?
Solved

Search for text and delete from file

Posted on 2009-05-08
6
Medium Priority
?
686 Views
Last Modified: 2012-05-06
How do I search for a string of text in a file, using wild card, and then delete the entire line from the file....

if line contains "My name is *", delete line from file
0
Comment
Question by:MichelleLacy
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
6 Comments
 
LVL 20

Expert Comment

by:ddayx10
ID: 24341899
You might try looking at this post, it gives pretty sound idea of the limitations of what you are trying to do and a solution:

http://www.experts-exchange.com/Programming/Programming_Languages/Visual_Basic/VB_Controls/Q_21696282.html
0
 
LVL 13

Expert Comment

by:iHadi
ID: 24341906
Hi

try the following code (I'm assuming your not opening text files over 10s of megabytes):
string filePath = "your file path";
 
string fileContent = File.ReadAllText(filePath);
 
fileContent = Regex.Replace(fileContent, "My name is.*", "", RegexOptions.Multiline | RegexOptions.IgnoreCase);
 
File.WriteAllText("your file path", fileContent);

Open in new window

0
 
LVL 7

Expert Comment

by:zwei
ID: 24344365
iHadi, your code doesn't delete the whole line.

How about this modification:
string filePath = "file path";
string fileContent = File.ReadAllText(filePath);
String[] lines = fileContent.Split('\n');
fileContent = "";
 
foreach(String line in lines) 
{
    if (! Regex.Match(line, ".*My name is.*", RegexOptions.IgnoreCase).Success)
    {
        fileContent += line + '\n';
    }
}
 
File.WriteAllText(filePath, fileContent);

Open in new window

0
Independent Software Vendors: We Want Your Opinion

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

 
LVL 9

Accepted Solution

by:
tculler earned 1500 total points
ID: 24344986
Unfortunately, there is no way to actually "delete" something from a file. You have a few choices; I'll give you an un-optimized version of one of them (which will work for any size file, theoretically). Using Regex is somewhat overkill in this situation, "split" won't work correctly in case you have different types of line deliniations (\n, \r, etc), and File.ReadAllLines() will read any file into memory FIRST, potentially using up way to much memory. The solution below should be able to 'dodge' any of these potential problems. However, I did not optimize it completely because the goal of it is clarity. Modify it as you wish.
void RemoveFromFile(System.IO.FileInfo mySource)
{
    System.String output = mySource.FullName;
    mySource.MoveTo(mySource.Name + "_TEMP" + mySource.Extension);
 
    String line = null;
    using (System.IO.StreamReader reader = new System.IO.StreamReader(mySource.OpenRead()))
    using (System.IO.StreamWriter writer = new System.IO.StreamWriter(System.IO.File.OpenWrite(output)))
    {
        while (!reader.EndOfStream)
        {
            line = reader.ReadLine();
 
            if (!line.Contains("My name is "))
            { writer.WriteLine(line); }
        }
        writer.Flush();
    }
 
    mySource.Delete();
}

Open in new window

0
 
LVL 7

Expert Comment

by:zwei
ID: 24345038
@tculler: Good solution, reading and writing at the same time. I think I saw an example of that in ddayx10's link. I would still recommend to use regexes though (to allow for future modifications). This code would probably need to have .ToLower() on every line and "My name is ".
0
 
LVL 9

Expert Comment

by:tculler
ID: 24380749
I never check links, and question askers usually don't either. If they're asking a question, they should have already researched using such obvious resources.

I didn't do a ToLower(), or use regex, because I felt that from the question asked the problem was specific enough to target in on one thing, do that one thing, and do it very well. RegEx is awesome, but does require a bit of overhead when using it (though it will save time, and probably boost efficiency, in more complex situations). As long as you know this won't change, this solution is fairly optimized and expandable. Of course, if there is more content you must avoid, RegEx may become a necessity (though you would just replace the call to .Contains to a Regex call). Let me know if you'd like to see the implementation of my solution using RegEx, and I'll whip one up.
0

Featured Post

Independent Software Vendors: We Want Your Opinion

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

In order to hide the "ugly" records selectors (triangles) in the rowheaders, here are some suggestions. Microsoft doesn't have a direct method/property to do it. You can only hide the rowheader column. First solution, the easy way The first sol…
Introduction This article series is supposed to shed some light on the use of IDisposable and objects that inherit from it. In essence, a more apt title for this article would be: using (IDisposable) {}. I’m just not sure how many people would ge…
Michael from AdRem Software explains how to view the most utilized and worst performing nodes in your network, by accessing the Top Charts view in NetCrunch network monitor (https://www.adremsoft.com/). Top Charts is a view in which you can set seve…
In this video, Percona Solutions Engineer Barrett Chambers discusses some of the basic syntax differences between MySQL and MongoDB. To learn more check out our webinar on MongoDB administration for MySQL DBA: https://www.percona.com/resources/we…
Suggested Courses
Course of the Month14 days, 5 hours left to enroll

800 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question