Solved

Search for text and delete from file

Posted on 2009-05-08
6
677 Views
Last Modified: 2012-05-06
How do I search for a string of text in a file, using wild card, and then delete the entire line from the file....

if line contains "My name is *", delete line from file
0
Comment
Question by:MichelleLacy
6 Comments
 
LVL 20

Expert Comment

by:ddayx10
Comment Utility
You might try looking at this post, it gives pretty sound idea of the limitations of what you are trying to do and a solution:

http://www.experts-exchange.com/Programming/Programming_Languages/Visual_Basic/VB_Controls/Q_21696282.html
0
 
LVL 13

Expert Comment

by:iHadi
Comment Utility
Hi

try the following code (I'm assuming your not opening text files over 10s of megabytes):
string filePath = "your file path";
 

string fileContent = File.ReadAllText(filePath);
 

fileContent = Regex.Replace(fileContent, "My name is.*", "", RegexOptions.Multiline | RegexOptions.IgnoreCase);
 

File.WriteAllText("your file path", fileContent);

Open in new window

0
 
LVL 7

Expert Comment

by:zwei
Comment Utility
iHadi, your code doesn't delete the whole line.

How about this modification:
string filePath = "file path";

string fileContent = File.ReadAllText(filePath);

String[] lines = fileContent.Split('\n');

fileContent = "";
 

foreach(String line in lines) 

{

    if (! Regex.Match(line, ".*My name is.*", RegexOptions.IgnoreCase).Success)

    {

        fileContent += line + '\n';

    }

}
 

File.WriteAllText(filePath, fileContent);

Open in new window

0
Highfive + Dolby Voice = No More Audio Complaints!

Poor audio quality is one of the top reasons people don’t use video conferencing. Get the crispest, clearest audio powered by Dolby Voice in every meeting. Highfive and Dolby Voice deliver the best video conferencing and audio experience for every meeting and every room.

 
LVL 9

Accepted Solution

by:
tculler earned 500 total points
Comment Utility
Unfortunately, there is no way to actually "delete" something from a file. You have a few choices; I'll give you an un-optimized version of one of them (which will work for any size file, theoretically). Using Regex is somewhat overkill in this situation, "split" won't work correctly in case you have different types of line deliniations (\n, \r, etc), and File.ReadAllLines() will read any file into memory FIRST, potentially using up way to much memory. The solution below should be able to 'dodge' any of these potential problems. However, I did not optimize it completely because the goal of it is clarity. Modify it as you wish.
void RemoveFromFile(System.IO.FileInfo mySource)

{

    System.String output = mySource.FullName;

    mySource.MoveTo(mySource.Name + "_TEMP" + mySource.Extension);
 

    String line = null;

    using (System.IO.StreamReader reader = new System.IO.StreamReader(mySource.OpenRead()))

    using (System.IO.StreamWriter writer = new System.IO.StreamWriter(System.IO.File.OpenWrite(output)))

    {

        while (!reader.EndOfStream)

        {

            line = reader.ReadLine();
 

            if (!line.Contains("My name is "))

            { writer.WriteLine(line); }

        }

        writer.Flush();

    }
 

    mySource.Delete();

}

Open in new window

0
 
LVL 7

Expert Comment

by:zwei
Comment Utility
@tculler: Good solution, reading and writing at the same time. I think I saw an example of that in ddayx10's link. I would still recommend to use regexes though (to allow for future modifications). This code would probably need to have .ToLower() on every line and "My name is ".
0
 
LVL 9

Expert Comment

by:tculler
Comment Utility
I never check links, and question askers usually don't either. If they're asking a question, they should have already researched using such obvious resources.

I didn't do a ToLower(), or use regex, because I felt that from the question asked the problem was specific enough to target in on one thing, do that one thing, and do it very well. RegEx is awesome, but does require a bit of overhead when using it (though it will save time, and probably boost efficiency, in more complex situations). As long as you know this won't change, this solution is fairly optimized and expandable. Of course, if there is more content you must avoid, RegEx may become a necessity (though you would just replace the call to .Contains to a Regex call). Let me know if you'd like to see the implementation of my solution using RegEx, and I'll whip one up.
0

Featured Post

Top 6 Sources for Identifying Threat Actor TTPs

Understanding your enemy is essential. These six sources will help you identify the most popular threat actor tactics, techniques, and procedures (TTPs).

Join & Write a Comment

Introduction Although it is an old technology, serial ports are still being used by many hardware manufacturers. If you develop applications in C#, Microsoft .NET framework has SerialPort class to communicate with the serial ports.  I needed to…
Summary: Persistence is the capability of an application to store the state of objects and recover it when necessary. This article compares the two common types of serialization in aspects of data access, readability, and runtime cost. A ready-to…
This video explains how to create simple products associated to Magento configurable product and offers fast way of their generation with Store Manager for Magento tool.
You have products, that come in variants and want to set different prices for them? Watch this micro tutorial that describes how to configure prices for Magento super attributes. Assigning simple products to configurable: We assigned simple products…

772 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

11 Experts available now in Live!

Get 1:1 Help Now