Thanks for the help, but it was saying "cannot find the Regular expression: [[:cntrl:]].*\n.*\n.*\n.*\
Main Topics
Browse All TopicsI started this in Excel and used macros, but it turns out the files were way to large for excel to handle, they were cutting 2/3rds of them off and i didn't even realize it.
I want to delete this section
JEFFERSON COUNTY, WV
REAL PROPERTY ASSESSMENT SYSTEM
11/30/2007 INITIAL ASSESSMENT ROLL PAGE: 1
02:32 PM FOR TAXYEAR 2007 AA151PTD
Date Range: TO
DISTRICT 01 - BOLIVAR CORPORATION
OWNER NAME-1 / NAME-2 LAND
BK IMPROVEMENTS
TX IN CARE OF ACCOUNT PROPERTY TAX MINERALS EX
PARCEL YR OWNER NUMBER DESCRIPTION CLS CD
Eventually I also want to have each section of the files on one line to use in a database, it seems that each section ends with HOMESTEAD
I connected an example txt file of waht I'm working with
I'm stuck here.
This Question has been solved and asker verified All Experts Exchange premium technology solutions are available to subscription members.
Experts Exchange has been collecting answers to technology questions since 1996…3 million and counting! If you have a question, chances are we already have your answer.
If you can't find the exact answer you're looking for, ask our exclusive community of 50,000 experts. You’ll get a personalized answer from a trusted professional.
Thousands of free tech tips, tricks, how-to’s and tutorials are available in our peer reviewed articles section. See for yourself how smart our experts are, no login required.
Access the answers to your technology questions today.
30-day free trial. Register in 60 seconds.
Members of the expert community talk about why the experience at Experts Exchange is different than what you will find anywhere else.

Try it out and discover for yourself.
30-day free trial. Register in 60 seconds.
Join the community of experts here and help other tech pros by answering question in your area of expertise. You can earn FREE access to all Experts Exchange's premium features and resources.
Applying the above regular expression to the document you attached matches all sections. You could also try the expression
\f.*\n.*\n.*\n.*\n\n\n.*\n
which is the same as above, but with the control character replaced with form feed character. For me, both expressions give the same result.
_______________
Nayer Naguib
Yes I could see the reason. Each section (between two consequent form feed characters) contains seven data records. The last record in each section is not followed by an empty line. In order to solve this issue, you need to replace the first regular expression (\f.*\n.*\n.*\n.*\n\n\n.*\
Note: If you want the last record in the file to be formatted during the second step, just add a new line character at the end of the file.
_______________
Nayer Naguib
Business Accounts
Answer for Membership
by: nayernaguibPosted on 2008-03-26 at 05:24:03ID: 21210809
Replacing the following regular expression with nothing in the whole document will delete all headers except for the first one:
n\n\n.*\n. *\n.*\n.*\ n.*
n\(.*\)\n\ (.*\)\n\n
[[:cntrl:]].*\n.*\n.*\n.*\
The [[:cntrl:]] character matches the "FF" byte at the beginning of each section. The "FF" byte does not occur before the first header in the document, which is why the first header is not deleted with the above replacement.
Next, replacing the regular expression
^\(01.*\)\n\(.*\)\n\(.*\)\
with
\1 \2 \3 \4
will remove newline character occurrences from each section (except for the last one as it is followed by an EOF character rather than newline.
Do not forget to check the "Regular expression" check box in the Replace dialog.
_______________
Nayer Naguib