I have a page that I have specifically converted to UTF-8 to eliminate unwanted characters. I have verified the encoding and the page comes up fine locally in all browsers. When I parse the page with QueryPath (htmlqp) I am left with a phantom character:
U+00E2 â c3 a2 LATIN SMALL LETTER A WITH CIRCUMFLEX
in place of
U+0027 ' 27 APOSTROPHE
I've tried adding the options convert_from_encoding => utf-8 and strip_low_ascii but I'm still left with this character. Any ideas how to fix this?