Solved

Search for text and delete from file

Posted on 2009-05-08
6
678 Views
Last Modified: 2012-05-06
How do I search for a string of text in a file, using wild card, and then delete the entire line from the file....

if line contains "My name is *", delete line from file
0
Comment
Question by:MichelleLacy
6 Comments
 
LVL 20

Expert Comment

by:ddayx10
ID: 24341899
You might try looking at this post, it gives pretty sound idea of the limitations of what you are trying to do and a solution:

http://www.experts-exchange.com/Programming/Programming_Languages/Visual_Basic/VB_Controls/Q_21696282.html
0
 
LVL 13

Expert Comment

by:iHadi
ID: 24341906
Hi

try the following code (I'm assuming your not opening text files over 10s of megabytes):
string filePath = "your file path";
 

string fileContent = File.ReadAllText(filePath);
 

fileContent = Regex.Replace(fileContent, "My name is.*", "", RegexOptions.Multiline | RegexOptions.IgnoreCase);
 

File.WriteAllText("your file path", fileContent);

Open in new window

0
 
LVL 7

Expert Comment

by:zwei
ID: 24344365
iHadi, your code doesn't delete the whole line.

How about this modification:
string filePath = "file path";

string fileContent = File.ReadAllText(filePath);

String[] lines = fileContent.Split('\n');

fileContent = "";
 

foreach(String line in lines) 

{

    if (! Regex.Match(line, ".*My name is.*", RegexOptions.IgnoreCase).Success)

    {

        fileContent += line + '\n';

    }

}
 

File.WriteAllText(filePath, fileContent);

Open in new window

0
Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

 
LVL 9

Accepted Solution

by:
tculler earned 500 total points
ID: 24344986
Unfortunately, there is no way to actually "delete" something from a file. You have a few choices; I'll give you an un-optimized version of one of them (which will work for any size file, theoretically). Using Regex is somewhat overkill in this situation, "split" won't work correctly in case you have different types of line deliniations (\n, \r, etc), and File.ReadAllLines() will read any file into memory FIRST, potentially using up way to much memory. The solution below should be able to 'dodge' any of these potential problems. However, I did not optimize it completely because the goal of it is clarity. Modify it as you wish.
void RemoveFromFile(System.IO.FileInfo mySource)

{

    System.String output = mySource.FullName;

    mySource.MoveTo(mySource.Name + "_TEMP" + mySource.Extension);
 

    String line = null;

    using (System.IO.StreamReader reader = new System.IO.StreamReader(mySource.OpenRead()))

    using (System.IO.StreamWriter writer = new System.IO.StreamWriter(System.IO.File.OpenWrite(output)))

    {

        while (!reader.EndOfStream)

        {

            line = reader.ReadLine();
 

            if (!line.Contains("My name is "))

            { writer.WriteLine(line); }

        }

        writer.Flush();

    }
 

    mySource.Delete();

}

Open in new window

0
 
LVL 7

Expert Comment

by:zwei
ID: 24345038
@tculler: Good solution, reading and writing at the same time. I think I saw an example of that in ddayx10's link. I would still recommend to use regexes though (to allow for future modifications). This code would probably need to have .ToLower() on every line and "My name is ".
0
 
LVL 9

Expert Comment

by:tculler
ID: 24380749
I never check links, and question askers usually don't either. If they're asking a question, they should have already researched using such obvious resources.

I didn't do a ToLower(), or use regex, because I felt that from the question asked the problem was specific enough to target in on one thing, do that one thing, and do it very well. RegEx is awesome, but does require a bit of overhead when using it (though it will save time, and probably boost efficiency, in more complex situations). As long as you know this won't change, this solution is fairly optimized and expandable. Of course, if there is more content you must avoid, RegEx may become a necessity (though you would just replace the call to .Contains to a Regex call). Let me know if you'd like to see the implementation of my solution using RegEx, and I'll whip one up.
0

Featured Post

Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Calculating holidays and working days is a function that is often needed yet it is not one found within the Framework. This article presents one approach to building a working-day calculator for use in .NET.
Performance in games development is paramount: every microsecond counts to be able to do everything in less than 33ms (aiming at 16ms). C# foreach statement is one of the worst performance killers, and here I explain why.
With the power of JIRA, there's an unlimited number of ways you can customize it, use it and benefit from it. With that in mind, there's bound to be things that I wasn't able to cover in this course. With this summary we'll look at some places to go…
Learn how to create flexible layouts using relative units in CSS.  New relative units added in CSS3 include vw(viewports width), vh(viewports height), vmin(minimum of viewports height and width), and vmax (maximum of viewports height and width).

863 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

28 Experts available now in Live!

Get 1:1 Help Now