Text Analysis example counting sentences, paragraphs, words just like MS word does

I want to count the number of sentence parargraphs and words and character and then number of characters without spaces and report onthis into simple text boxes imagine like MS word does a wordcount i want this functonality from my VB app in a text box.

Cheers
RanoldinAsked:
Who is Participating?
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

NBrownohCommented:
for characters without spaces do this:
YourVal = len(replace(YourText, " ", ""))

for the number of words genericaly just do this:
Tem = replace(YourText, " ", "")
YourVal = Val(Len(YourText) - Len(Tem))

and for sentances do this:
Tem = replace(YourText, ".", "")
YourVal = Val(Len(YourText) - Len(Tem))
0

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
monvelasquezCommented:
The best way to do this is probably by using the RegExp object

a nice discussion about RegExp is at
    http://www.juicystudio.com/tutorial/vb/regexp.asp

In your project references add "Microsoft VBScript Regular Expressions"

here's are some samples

Function CountWords(ByVal Text As String) As Long
    Dim re As New RegExp
    re.Pattern = "\b\w+\b"
    re.Global = True
    CountWords = re.Execute(Text).Count
End Function

Function CountCharsNoSpaces(ByVal Text As String) As Long
    Dim re As New RegExp
    re.Pattern = "\s"
    re.Global = True
    CountChars = re.Execute(Text).Count
End Function


Function CountChars(ByVal Text As String) As Long
    Dim re As New RegExp
    re.Pattern = "."
    re.Global = True
    CountChars = re.Execute(Text).Count
End Function


0
RanoldinAuthor Commented:
What about the VBClrf character some reason it counts them as characters too?

Thanks for speedy respons though NBrownoh

I would also like to ask is it possible to do this on a string variable

myChar = Replace(strTemp, vbCrLf, "")
myChar = Replace(strTemp, " ", "")


frmAnalysis.txtNumberOfCharacters.Text = Len(myChar)
to also count the length of the string after the two above things have been done

0
Ultimate Tool Kit for Technology Solution Provider

Broken down into practical pointers and step-by-step instructions, the IT Service Excellence Tool Kit delivers expert advice for technology solution providers. Get your free copy now.

EDDYKTCommented:
Do this

strTemp = Replace(strTemp, vbCr, "")
strTemp = Replace(strTemp, vbLf, "")
strTemp = Replace(strTemp, " ", "")
0
Dang123Commented:
Listening . . .
0
RanoldinAuthor Commented:
Who is Dang 123 BTW ?? where you come from i have a feeling i know why u are listening hehe

THankyou for speedy responses guys as ever appreciate it loads so i thought id split the points up failry i got help from all three of you rather than from just one person :PP


Cheers
0
Dang123Commented:
Ranoldin,
    I am in New Jersey. My users are working on specs for a program I will be working on soon that will require me to do Text Analysis, it will be an in-house only program. I just wanted to be able to find this question easily in the future, and since EE gives a list of questions you comment into . . . .

    Am I who you were thinking of?   ; )

    Glad you got what you needed, and thanks for letting me listen in.

Dang123

0
RanoldinAuthor Commented:
Hehe no you are not the person lol i thought it was someone i knew locally to me hehe.

Text Analysis is ok i am continuing to develop Paragraphs counting and pages ugghies ))


Confusing this is VBClrf and VBcr lol

mean practically the same thing but of course vbClrf is a key nto a character oh well lol

Cheers Ranoldin ))
0
Dang123Commented:
Ranoldin,
    Your right, it can be a bit confusing. Basically, vbCrLf represents two characters (13 and 10) this is from the old teletype terminals. Carriage return (13) would move the print head back to the start of the line and Linefeed (10) would advance the paper by one line. While this has no real effect on the screens we use today, most programs still use this character combination to represent a line break. A few try to save the character length by using just Carriage return (13) (vbCr) or just Linefeed (10) (vbLf) so you need to be aware of any of the three ways to spot a line break.

Good luck, and thanks again for letting me listen in.

Dang123

0
RanoldinAuthor Commented:
Gusy id like to extend this question dont know if u will be able to help how would i go about counting Paragrahs in a text box ??
0
Dang123Commented:
1. Replace all vbCrLf & vbCrLf with vbCrLf (may need to do it a few times depending on your users typing)

2. Loop through the text counting vbCrLf

3. If the last characters in the text are vbCrLf, you have your count otherwise add 1 to the get your count.

0
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
Visual Basic Classic

From novice to tech pro — start learning today.

Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.