Grep pattern in TextWrangler

I'm trying to determine a grep pattern that will allow me to find and replace all lines that look like this:
<style type=""text/css"">/*<![CDATA[*/    span.c1 {BACKGROUND-COLOR: #c0c0c0}    /*]]>*/    </style>

Basically anything between style tags.
<style?[^>]*> will get be that first style tag, but I'm hitting a wall getting the selection to extend to the last </style>

I'm putting this in a generic linux category cause I would assume that the grep rules would be pretty close?
LVL 2
musickmannData AnalystAsked:
Who is Participating?
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

Gerwin Jansen, EE MVETopic Advisor Commented:
Do you want to replace the complete line or just what's in between the tags? Also: will you have different first tags and always just </style> as as closing tag? Replacing what's in between the tags could be done with sed where you have 3 RE's and where you replace the 2nd RE with whatever you want.
0
musickmannData AnalystAuthor Commented:
I am looking to remove the style tags completely. I'm not sure what sed and 3RE's refers to though.
0
serialbandCommented:
RE == Regular Expression.

sed == Stream EDitor - Command line program

You probably should have added the Mac zones since Text Wrangler is a Mac product.

You should do it in 2 passes, since that's a little easier.   If you have the first one made, then use that.  The 2nd one should just be </style>
0
The Ultimate Tool Kit for Technolgy Solution Provi

Broken down into practical pointers and step-by-step instructions, the IT Service Excellence Tool Kit delivers expert advice for technology solution providers. Get your free copy for valuable how-to assets including sample agreements, checklists, flowcharts, and more!

Gerwin Jansen, EE MVETopic Advisor Commented:
I'm assuming the style tags are across multiple lines, right?
0
musickmannData AnalystAuthor Commented:
Thanks, I was so single minded on this I didn't even think outside of the scope to sed. I haven't used the sed command much, so wouldn't know where to start there.

I think I wasn't clear enough still yet - I want to remove the style tags and everything in between. I could certainly remove the opening and closing tags in two passes, but I'm having trouble finding a way to select the two tags and everything in between to get removed.

If there is a sed command that makes this easier, I could attempt that. I imagine is something like
sed [pattern] [filename] from the terminal?
0
Gerwin Jansen, EE MVETopic Advisor Commented:
>> I want to remove the style tags and everything in between
I understand, you mean by 'everything in between' that it can be spread over several lines?

About sed: yes it's like that, you give it some patterns to search for and when found perform some actions (basically).
0
Sigurdur ArmannssonDesigner Commented:
In Text Wrangler:

Find: <style.</style> would find the whole style statement.
0
Sigurdur ArmannssonDesigner Commented:
Ooops sorry. Ignore that :D
0
Sigurdur ArmannssonDesigner Commented:
Rather: <style.+?style>
1
serialbandCommented:
I installed textwrangler and matched your example with the following.  You should try out this search pattern:

<style*[^>]*>?[^>]*>[^>]*style>

It should match your example exactly and should span multiple lines as well as some different entries.
<style type=""text/css"">/*<![CDATA[*/    span.c1 {BACKGROUND-COLOR: #c0c0c0}    /*]]>*/    </style>

<style type=""text/css"">/*<![CDATA[*/  
  span.c1 {BACKGROUND-COLOR: #c0c0c0}    /*]]>*/    </style>

<style type=""text/css"">/*<![CDATA[*/    span.c4hca,.phthpa {BACKGROUND-COLOR: #c0c0c0}    /*]]>*/    </style>
0

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
musickmannData AnalystAuthor Commented:
Awesome, that worked perfectly, and I don't think I would have ever gotten there on my own. I've been going over the grep references trying to figure out the syntax and just not getting it very well.

I usually try to narrate through statements like this but I guess I'm just not sure what grep is looking at. If I understand the components of the expression:
* = 0 or more previous characters
[^>] = any character that is NOT >
? = 0 or 1 previous characters

So I'm not sure how that stings together to tell it to find all the characters between the two style tags.
0
serialbandCommented:
TextWrangler's grep seems to be implemented a little differently from command line grep.  Otherwise I would have been able to match it with <style*style>.
0
Sigurdur ArmannssonDesigner Commented:
There are many different flavours of grep. Like for instance InDesign has few commands that are not available else where.

in TextWrangler <style*style> does not find anything but <style.+?style> finds any character, one or more
(except a return)  between <style and style. Actually it could be shortened down to <s.+?e>
The question mark makes the command less greedy.
0
serialbandCommented:
Then you should have gotten the points for your much simpler solution.  I only just installed TextWrangler yesterday for testing that.
0
Sigurdur ArmannssonDesigner Commented:
No problem about the points. He used your suggestion.

Try out my solution just for fun and try to make it not work. :D
0
musickmannData AnalystAuthor Commented:
Sigurdur - I'm sorry, I never even saw your comments until just now when I got the last email, I was refreshing this page yesterday and only saw the comment from serialband.

However, you're solution seems to require the string to be on a single line. In this one example, that happens to be the case, but I have some other files I'll be working with in the coming days where that will not be the case.
I'm sorry I didn't see the solution earlier, both work and do exactly what I was looking for. I tried several variants of the * .* +* etc but I guess it was the ? that I needed to use.
0
Sigurdur ArmannssonDesigner Commented:
Nevermind the points. :D

The only thing that breaks the . command is the return, but one of the things you could use to include a return if the line breaks is like this:

<style.*?\r*.*?style>

This is <style, any character zero or more time, return zero or more times, any character zero ore more times and then style> to close.
0
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
Linux

From novice to tech pro — start learning today.