Strip Out MS Word's dirty code from a string...


Looking to see if anybody has a good function which can strip all MS Word related code out of a string and return a clean XHTML compatible string?


Who is Participating?
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

I don't think you will find one magic script that will accomplish what you are looking for but I would suggest reading following article:

There's a solution involving Javascript with an amazing WYSIWYG editor called Xinha.

Xinha example:

After it loads, click on the WYSIWYG editor. There's a button on the toolbar called "Clear MSOffice tags".

You can learn more about Xinha by visiting its website:

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
phirschybarAuthor Commented:
I have used Xinha. I have also heard good things about TinyMCE. I am still looking for a php function which would do it but this is a start. thanks
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today

From novice to tech pro — start learning today.