We help IT Professionals succeed at work.
Get Started

VBScript Remove HTML and non UTF-8 Characters

garethtnash asked
Last Modified: 2012-08-29

I have a site that accepts data from a number of different inputs, currently this data is output as an XML feed. However I'm running into issues, as the data can contain both HTML - <p style="astyle"> and non UTF-8 Characters. One of the resources that read the XML have asked us to ensure that all HTML is removed and that the remining text is correctly UTF-8 encoded.

I'm using ASP VBScript to generate the XML, is there a VBscript function that i can use to make sure that the data is as requested?

Many thanks
Watch Question
Top Expert 2004
This problem has been solved!
Unlock 2 Answers and 3 Comments.
See Answers
Why Experts Exchange?

Experts Exchange always has the answer, or at the least points me in the correct direction! It is like having another employee that is extremely experienced.

Jim Murphy
Programmer at Smart IT Solutions

When asked, what has been your best career decision?

Deciding to stick with EE.

Mohamed Asif
Technical Department Head

Being involved with EE helped me to grow personally and professionally.

Carl Webster
CTP, Sr Infrastructure Consultant
Ask ANY Question

Connect with Certified Experts to gain insight and support on specific technology challenges including:

  • Troubleshooting
  • Research
  • Professional Opinions
Did You Know?

We've partnered with two important charities to provide clean water and computer science education to those who need it most. READ MORE