Solved

VBA search paragraph and extract block of text

Posted on 2009-07-02
10
718 Views
Last Modified: 2012-05-07
Hello, I am running an Access VBA subroutine that opens a word file, searches the file for some tokens, and extracts those texts until the closing token appears. I am new to word programming but not to excel or access. Anyone have any examples or solutions? Thanks
0
Comment
Question by:allenlo77
  • 5
  • 5
10 Comments
 
LVL 15

Expert Comment

by:weinberk
ID: 24767493
Is the start "token" the same as the end token?
Is there more than one set of start and end tokens?
I'll whip up some code, just let me know.  Won't take but a minute.
0
 

Author Comment

by:allenlo77
ID: 24767506
could be, doesnt have to be, your call!
0
 
LVL 15

Accepted Solution

by:
weinberk earned 500 total points
ID: 24767890
This will spit out the found text in a msgbox and assigns it to a variable.
I used ">>" as the starting token and ""<<" as the ending token.  If this isn't what you're using, change the
.Text = "\>\>*\<\<"
line to be the characters you're using.  You need to escape (\) special characters.
Also, at the end, where I trimmed of the tokens using left and right, you'll need to use the length of the starting token to replace the 2 and the total length of the start and end token to replace the 4.
 
This should do what you need.  You can loop if you need to find multiple occurrences.
Hope this helps.

Sub Find()

    Dim sFoundText As String

    

    Selection.Find.ClearFormatting

    

    With Selection.Find

        .Text = "\>\>*\<\<"

        .Replacement.Text = ""

        .Forward = True

        .Wrap = wdFindContinue

        .Format = False

        .MatchCase = False

        .MatchWholeWord = False

        .MatchWildcards = True

        .MatchSoundsLike = False

        .MatchAllWordForms = False

    End With

    Selection.Find.Execute

    sFoundText = Left(Right(Selection.Text, Len(Selection.Text) - 2), Len(Selection.Text) - 4)

    MsgBox sFoundText

End Sub

Open in new window

0
 

Author Comment

by:allenlo77
ID: 24772166
weinberk, I am trying to extract a whole paragraph, wiill this work?

For example, the doc file has this

fdksljfkldjlkj >> hello my name is allen, blah blah

new paragraph blah blah blah

new paragraph blah blah. Okay we are done <<

And I'd like to extract everything between the '>>' token and '<<' token. I am looking at the code you posted and I am not even sure if it will work.
0
 
LVL 15

Expert Comment

by:weinberk
ID: 24772266
Yes it works.  
I pasted your sample text into a word document then ran the code I provided.  A message box pops up with the text (although boxes appear where carriage returns are since the msg box doesn't support returns like that), and the entire text is available in the variable to do whatever you want with.
0
Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

 

Author Comment

by:allenlo77
ID: 24772452
Ok it works, how do you handle white space between paragraphs? I have 4 paragraphs all separated by white space line.
0
 
LVL 15

Expert Comment

by:weinberk
ID: 24772626
You just asked to return the text, which is why my code does.  
What is it that you want the code to do with the "white space?"  In your example text, it's just returns.  This might be another topic....
0
 

Author Comment

by:allenlo77
ID: 24772645
well because there is white space between the paragraphs, it gives me an "Invalid procedure call or argument", so I was wondering if you have a solution for that
0
 

Author Comment

by:allenlo77
ID: 24772722
nevermind, it works, the mouse cursor cannot be on the paragraph in between the tokens or else it doesn't work
0
 
LVL 15

Expert Comment

by:weinberk
ID: 24772987
You're right.  Replace
Selection.Find.ClearFormatting

with
 ActiveDocument.Content.Select
and give it a whirl.  
Please don't forget to accept a solution.  I've worked hard to get a solution for you.
0

Featured Post

Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

It is often necessary in this forum and others to illustrate Word fields as text with the field delimiters replaced with the curly brackets that the delimiters resemble when field codes are being displayed on the document. This means that the text c…
Preface: When I started this series, I used the term CommandBars because that is the Office Object class that it discusses. Unfortunately, when Microsoft introduced Office 2007, they replaced the standard Commandbar menus with "The Ribbon" and rem…
This video shows the viewer how to set up and create Footnotes in their document. Click on the References tab: Select "Insert Footnote": Type in desired text:
This Experts Exchange video Micro Tutorial shows how to tell Microsoft Office that a word is NOT spelled correctly. Microsoft Office has a built-in, main dictionary that is shared by Office apps, including Excel, Outlook, PowerPoint, and Word. When …

929 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

14 Experts available now in Live!

Get 1:1 Help Now