We help IT Professionals succeed at work.
Get Started

Remove all HTML/XML tags from CSV file

KeterHD
KeterHD asked
on
5,964 Views
Last Modified: 2012-06-27
Hola, everyone!

I've got myself a CSV file, of MySQL descent. Originally, the file was used as a DB for a website. For that reason, the text inside the table contains tons of HTML/XML characters intended to specify the location and design of the text on a page.

While it's all swell, now that I need to get just the text out of it, it's quite a nightmare. As there are hundreds of variations of design tags within the document, there is no way I could possibly remove them all.

Could you think of a way to strip the document of all tags? I found some solutions using PHP, however, I lack knowledge of it, hence I can do little to improve the situation.
Comment
Watch Question
This problem has been solved!
Unlock 1 Answer and 14 Comments.
See Answer
Why Experts Exchange?

Experts Exchange always has the answer, or at the least points me in the correct direction! It is like having another employee that is extremely experienced.

Jim Murphy
Programmer at Smart IT Solutions

When asked, what has been your best career decision?

Deciding to stick with EE.

Mohamed Asif
Technical Department Head

Being involved with EE helped me to grow personally and professionally.

Carl Webster
CTP, Sr Infrastructure Consultant
Ask ANY Question

Connect with Certified Experts to gain insight and support on specific technology challenges including:

  • Troubleshooting
  • Research
  • Professional Opinions
Did You Know?

We've partnered with two important charities to provide clean water and computer science education to those who need it most. READ MORE