We help IT Professionals succeed at work.

C#, Regex, Extract multiple sections of text

I'm scraping some HTML that has elided sections marked by strike-through, such as

<BR>Line 1
<BR>Line 2
<BR><STRIKE>Strike 1</STRIKE>
<BR>Line 3 some text <STRIKE>Strike 2</STRIKE> more text
<BR><STRIKE>Strike 3</STRIKE>
<BR>Line 4
<BR><STRIKE>Strike 4</STRIKE>

I can write a loop that iterates over this text, removing strike-throughs one at a time, but that's lame.

You'd think I could get either a String with all strike-throughs removed, or at least a MatchCollection of everything that isn't struck through.

Kind of a fun problem...What would you do?
Comment
Watch Question

Top Expert 2007
Commented:
Doh!  I must be tired...

String result = Regex.Replace(fileContents,@"<STRIKE>(.*?)</STRIKE>","",RegexOptions.Singleline);