I am trying to strip all html tags from a specific column in a csv document which I am opening in OpenOffice. I am trying to use the regular expression find/replace.
I tried <(.|\n)+?> to replace all html tags with an empty string, but when I do this open office just wipes out all content.
I also tried this solution http://www.zyxware.com/articles/643/strip-html-tags-from-your-openoffice-document-using-regular-expressions
and the only thing it seems to be removing is occurrences of <p> </p> from all columns while ALL other html tags remain in the cells.
Below is an example of text that I need stripped off HTML tags.
<p>A wooden pistol with metal accents and painted red, black and green. These pistols were used by Tim Burton's aliens in the film <em>Mars Attacks!</em> (Warner Bros., 1996). The alien weapons are stolen by humans, like Tom Jones, who turn them against their invaders. This item was acquired directly from Warner Bros. Studio and accompanied by a certificate of authenticity from the studio and a copy of the film. Length, 15 inches</p>
<object height="344" width="425">
<param name="movie" value="http://www.youtube.com/v/VYHeZCEFwhI&hl=en_US&fs=1?rel=0" />
<param name="allowFullScreen" value="true" />
<param name="allowscriptaccess" value="always" /><embed height="344" width="425" src="http://www.youtube.com/v/VYHeZCEFwhI&hl=en_US&fs=1?rel=0" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true"></embed></object>