Solved

VBA search paragraph and extract block of text

Posted on 2009-07-02
10
722 Views
Last Modified: 2012-05-07
Hello, I am running an Access VBA subroutine that opens a word file, searches the file for some tokens, and extracts those texts until the closing token appears. I am new to word programming but not to excel or access. Anyone have any examples or solutions? Thanks
0
Comment
Question by:allenlo77
  • 5
  • 5
10 Comments
 
LVL 15

Expert Comment

by:weinberk
ID: 24767493
Is the start "token" the same as the end token?
Is there more than one set of start and end tokens?
I'll whip up some code, just let me know.  Won't take but a minute.
0
 

Author Comment

by:allenlo77
ID: 24767506
could be, doesnt have to be, your call!
0
 
LVL 15

Accepted Solution

by:
weinberk earned 500 total points
ID: 24767890
This will spit out the found text in a msgbox and assigns it to a variable.
I used ">>" as the starting token and ""<<" as the ending token.  If this isn't what you're using, change the
.Text = "\>\>*\<\<"
line to be the characters you're using.  You need to escape (\) special characters.
Also, at the end, where I trimmed of the tokens using left and right, you'll need to use the length of the starting token to replace the 2 and the total length of the start and end token to replace the 4.
 
This should do what you need.  You can loop if you need to find multiple occurrences.
Hope this helps.

Sub Find()
    Dim sFoundText As String
    
    Selection.Find.ClearFormatting
    
    With Selection.Find
        .Text = "\>\>*\<\<"
        .Replacement.Text = ""
        .Forward = True
        .Wrap = wdFindContinue
        .Format = False
        .MatchCase = False
        .MatchWholeWord = False
        .MatchWildcards = True
        .MatchSoundsLike = False
        .MatchAllWordForms = False
    End With
    Selection.Find.Execute
    sFoundText = Left(Right(Selection.Text, Len(Selection.Text) - 2), Len(Selection.Text) - 4)
    MsgBox sFoundText
End Sub

Open in new window

0
VMware Disaster Recovery and Data Protection

In this expert guide, you’ll learn about the components of a Modern Data Center. You will use cases for the value-added capabilities of Veeam®, including combining backup and replication for VMware disaster recovery and using replication for data center migration.

 

Author Comment

by:allenlo77
ID: 24772166
weinberk, I am trying to extract a whole paragraph, wiill this work?

For example, the doc file has this

fdksljfkldjlkj >> hello my name is allen, blah blah

new paragraph blah blah blah

new paragraph blah blah. Okay we are done <<

And I'd like to extract everything between the '>>' token and '<<' token. I am looking at the code you posted and I am not even sure if it will work.
0
 
LVL 15

Expert Comment

by:weinberk
ID: 24772266
Yes it works.  
I pasted your sample text into a word document then ran the code I provided.  A message box pops up with the text (although boxes appear where carriage returns are since the msg box doesn't support returns like that), and the entire text is available in the variable to do whatever you want with.
0
 

Author Comment

by:allenlo77
ID: 24772452
Ok it works, how do you handle white space between paragraphs? I have 4 paragraphs all separated by white space line.
0
 
LVL 15

Expert Comment

by:weinberk
ID: 24772626
You just asked to return the text, which is why my code does.  
What is it that you want the code to do with the "white space?"  In your example text, it's just returns.  This might be another topic....
0
 

Author Comment

by:allenlo77
ID: 24772645
well because there is white space between the paragraphs, it gives me an "Invalid procedure call or argument", so I was wondering if you have a solution for that
0
 

Author Comment

by:allenlo77
ID: 24772722
nevermind, it works, the mouse cursor cannot be on the paragraph in between the tokens or else it doesn't work
0
 
LVL 15

Expert Comment

by:weinberk
ID: 24772987
You're right.  Replace
Selection.Find.ClearFormatting

with
 ActiveDocument.Content.Select
and give it a whirl.  
Please don't forget to accept a solution.  I've worked hard to get a solution for you.
0

Featured Post

DevOps Toolchain Recommendations

Read this Gartner Research Note and discover how your IT organization can automate and optimize DevOps processes using a toolchain architecture.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

Title # Comments Views Activity
MS Word document > cursor placement 5 33
Regarding Notepad++ 4 43
Replacing hyphens with non-breaking hyphens only in certain numbers 2 17
word 2016 1 31
Preface: When I started this series, I used the term CommandBars because that is the Office Object class that it discusses. Unfortunately, when Microsoft introduced Office 2007, they replaced the standard Commandbar menus with "The Ribbon" and rem…
Microsoft Word is a program we have all encountered at some point, but very few of us have dug deep into its full scope of features, let alone customized it to suit our needs. Luckily making the ribbon (aka toolbar, first introduced in Word 2007) wo…
This video teaches the viewer how to align pictures around text while keeping the text properly aligned in the document.
This video shows where to find templates, what they are used for, and how to create and save a custom template using Microsoft Word.

777 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question