I am trying to sanitize HTML to XHTML using HTMLTidy.
I have "£" signs in my input HTML. However these seem to be getting corrupted. In GUI Tidy I not that the "£"s are coming as "A£" in the original HTML so no wonder there is corruption.
I am actually doing this conversion in a web app and I can specify the Input and Output encoding. However whatever I try I still get corruption.
Ideas please? This is driving me mad!!!
I could change the "£"s to "£" or "£" in the input HTML if that helps.
Thanks for any help.