[Last Call] Learn about multicloud storage options and how to improve your company's cloud strategy. Register Now

x
?
Solved

Parse text file, trim each line, remove carriage returns, add CRLF after delimiter character?

Posted on 2007-12-04
6
Medium Priority
?
7,672 Views
Last Modified: 2013-11-07
I believe this is a fairly simple question, but unfortunately I have no experience working with text files so I'm asking for help rather than figuring it out myself.

I need to read in a text file (it's going to be 5 megs up to a couple hundred).

I need to do the following with it
1) I need to remove the trailing spaces from each line.
2) Delete all carriage returns and line feeds.
3) Add a Carriage Return/ Line Feed after any instance of "</END??>" where the ?? can be anything.

I believe each line is around 1000 bytes.  Speed doesn't really matter.
I mention the byte count since I think this is a large file and should use something like a string buffer no? Not really familiar with anything other than ordinary strings.

I'll gladly bump up the point value for a quick quality solution.
0
Comment
Question by:Brothernod
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 3
  • 2
6 Comments
 
LVL 55

Expert Comment

by:Jaime Olivares
ID: 20409231
let's assume you read each line into a String object:
       Dim line as String

To remove new lines and CRs, use:
       line = line.Replace(chr(10).ToString(), "")
       line = line.Replace(char(13).ToString(), "")

Then remove trailing spaces
       line = line.TrimEnd(null);

about the </END??> issue, you can do it with an algorithm:
      Dim i as Integer = 0
      Do
             i = line.IndexOf("</END", i)
             If i < 0 Then Exit Do
             i = line.IndexOf(">", i)
             If i < 0 Then Exit Do
             line = line.Insert(i+1, char(13) & chr(10))
      While True
0
 
LVL 19

Expert Comment

by:weellio
ID: 20409421
here ya go
Dim source, target
source = "myfile.txt"
target = "myfile2.txt"
 
Dim fso, f, f2
dim x, y
Const ForReading = 1, ForWriting = 2
 
Set fso = CreateObject("Scripting.FileSystemObject")
If fso.FileExists(source) Then
   Dim inputLine, outputLine
   Set f = fso.OpenTextFile(source, ForReading, False)
   Set f2 = fso.OpenTextFile(target, ForWriting, True)
   While Not f.AtEndOfStream
      inputLine = f.ReadLine
			outputline = parseme(inputLine)
      f2.Write outputline
   Wend
   f.Close
   f2.Close
   Set f2 = Nothing
   Set f = Nothing
   MsgBox "Done"
Else
   MsgBox source, vbOKOnly, "Source File Not Found"
End If
Set fso = Nothing
 
function parseme(strtext)
' remove trailing lines
	parseme = rtrim(strtext)
 
'remove carriage returns
	parseme = Replace(parseme, vbCr, "")
 
'remove line feeds
	parseme = Replace(parseme, vblf, "")
 
'find "</END??>" and add vbcrlf
	x = instr(parseme, "</END") 'if exists find a location 
'loop to find multiple '</end"'s
	do until x = 0 
			y = instr(x, parseme, ">")
			parseme = left(parseme, y+1) & vbcrlf & mid(parseme, y+1) 'starting on character past the ">"
			x = instr(y, parseme, "</END") 
	loop
end function

Open in new window

0
 
LVL 19

Expert Comment

by:weellio
ID: 20409426
hmm,.. i did a simple vbscript,.. but i'm sure you don't really have to change it to make it work with vb

just name the file something.vbs and edit the 'inputfile and outputfilenames.
or setup as arguments or something.
0
Veeam Disaster Recovery in Microsoft Azure

Veeam PN for Microsoft Azure is a FREE solution designed to simplify and automate the setup of a DR site in Microsoft Azure using lightweight software-defined networking. It reduces the complexity of VPN deployments and is designed for businesses of ALL sizes.

 
LVL 19

Accepted Solution

by:
weellio earned 1000 total points
ID: 20409435
tiny change
Dim source, target
source = "myfile.txt"
target = "myfile2.txt"
 
Dim fso, f, f2
dim x, y
Const ForReading = 1, ForWriting = 2
 
Set fso = CreateObject("Scripting.FileSystemObject")
If fso.FileExists(source) Then
   Dim inputLine, outputLine
   Set f = fso.OpenTextFile(source, ForReading, False)
   Set f2 = fso.OpenTextFile(target, ForWriting, True)
   While Not f.AtEndOfStream
      inputLine = f.ReadLine
			outputline = parseme(inputLine)
      f2.Write outputline
   Wend
   f.Close
   f2.Close
   Set f2 = Nothing
   Set f = Nothing
   MsgBox "Done"
Else
   MsgBox source, vbOKOnly, "Source File Not Found"
End If
Set fso = Nothing
 
function parseme(strtext)
' remove trailing lines
	parseme = rtrim(strtext)
 
'remove carriage returns
	parseme = Replace(parseme, vbCr, "")
 
'remove line feeds
	parseme = Replace(parseme, vblf, "")
 
'find "</END??>" and add vbcrlf
	x = instr(parseme, "</END") 'if exists find a location 
'loop to find multiple '</end"'s
	do until x = 0 
			y = instr(x, parseme, ">")
			parseme = left(parseme, y) & vbcrlf & mid(parseme, y+1) 'starting on character past the ">"
			x = instr(y, parseme, "</END") 
	loop
end function

Open in new window

0
 
LVL 1

Author Comment

by:Brothernod
ID: 20411400
I won't lie, I was really hoping for a nice clean vb.net example.

BUT

I needed this quick, and weelio did provide a quick complete solution.

Thank you.
0
 
LVL 1

Author Closing Comment

by:Brothernod
ID: 31412783
It's in VBA not VB.NET but I guess it was close enough since I'm in a hurry :)
0

Featured Post

Enroll in October's Free Course of the Month

Do you work with and analyze data? Enroll in October's Course of the Month for 7+ hours of SQL training, allowing you to quickly and efficiently store or retrieve data. It's free for Premium Members, Team Accounts, and Qualified Experts!

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Welcome my friends to the second instalment and follow-up to our Minify and Concatenate Your Scripts and Stylesheets (http://www.experts-exchange.com/Programming/Languages/.NET/ASP.NET/A_4334-Minify-and-Concatenate-Your-Scripts-and-Stylesheets.html)…
It’s quite interesting for me as I worked with Excel using vb.net for some time. Here are some topics which I know want to share with others whom this might help. First of all if you are working with Excel then you need to Download the Following …
This course is ideal for IT System Administrators working with VMware vSphere and its associated products in their company infrastructure. This course teaches you how to install and maintain this virtualization technology to store data, prevent vuln…
In this video you will find out how to export Office 365 mailboxes using the built in eDiscovery tool. Bear in mind that although this method might be useful in some cases, using PST files as Office 365 backup is troublesome in a long run (more on t…

650 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question