Solved

Replace HTML tags via regular expressions in OpenOffice

Posted on 2010-08-31
2
842 Views
Last Modified: 2012-05-10
I am trying to strip all html tags from a specific column in a csv document which I am opening in OpenOffice. I am trying to use the regular expression find/replace.

I tried <(.|\n)+?> to replace all html tags with an empty string, but when I do this open office just wipes out all content.

I also tried this solution http://www.zyxware.com/articles/643/strip-html-tags-from-your-openoffice-document-using-regular-expressions and the only thing it seems to be removing is occurrences of <p>&nbsp;</p> from all columns while ALL other html tags remain in the cells.

Below is an example of text that I need stripped off HTML tags.
<p>A wooden pistol with metal accents and painted red, black and green.&nbsp; These pistols were used by Tim Burton's aliens in the film <em>Mars Attacks!</em> (Warner Bros., 1996).&nbsp; The alien weapons are stolen by humans, like Tom Jones, who turn them against their invaders.&nbsp; This item was acquired directly from Warner Bros. Studio and accompanied by a certificate of authenticity from the studio and a copy of the film.&nbsp; Length, 15 inches</p>
<p>&nbsp;</p>
<object height="344" width="425">
<param name="movie" value="http://www.youtube.com/v/VYHeZCEFwhI&amp;hl=en_US&amp;fs=1?rel=0" />
<param name="allowFullScreen" value="true" />
<param name="allowscriptaccess" value="always" /><embed height="344" width="425" src="http://www.youtube.com/v/VYHeZCEFwhI&amp;hl=en_US&amp;fs=1?rel=0" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true"></embed></object>

Open in new window

0
Comment
Question by:SWB-Consulting
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
2 Comments
 
LVL 75

Accepted Solution

by:
käµfm³d   👽 earned 500 total points
ID: 33570057
Try:
</?[^>]+>

Open in new window

0
 
LVL 13

Expert Comment

by:marchent
ID: 33570077
what about this?

<[^>]*>
0

Featured Post

Technology Partners: We Want Your Opinion!

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

As most anyone who uses or has come across them can attest to, regular expressions (regex) are a complicated bit of magic. Packed so succinctly within their cryptic syntax lies a great deal of power. It's not the "take over the world" kind of power,…
Do you hate spam? I do, and I am willing to bet you do as well. I often wonder, though, "if people hate spam so much, why do they still post their email addresses on the web?" I'm not talking about a plain-text posting here. I am referring to the fa…
Learn how to match and substitute tagged data using PHP regular expressions. Demonstrated on Windows 7, but also applies to other operating systems. Demonstrated technique applies to PHP (all versions) and Firefox, but very similar techniques will w…
Explain concepts important to validation of email addresses with regular expressions. Applies to most languages/tools that uses regular expressions. Consider email address RFCs: Look at HTML5 form input element (with type=email) regex pattern: T…

691 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question