Herci
asked on
Character encoding problem
I am trying to write some foreign characters into an xml file but there seems to be a problem encoding some of the characters. I am using UTF-8 for the XML header.
Following characters work fine.
ÀÁÂÃÄÇÈÉÊËÌÍÎÏÑÒÓÔÕÖÙÚÛÜàá âãäåæç
but the following characters œ c a e l n s z z
they gets converted into the following html when writing into the XML.
ć ą ę ł ń ś ź ż
How can I write them in the exact format?
Thanks.
Following characters work fine.
ÀÁÂÃÄÇÈÉÊËÌÍÎÏÑÒÓÔÕÖÙÚÛÜàá
but the following characters œ c a e l n s z z
they gets converted into the following html when writing into the XML.
ć ą ę ł ń ś ź ż
How can I write them in the exact format?
Thanks.
ASKER CERTIFIED SOLUTION
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
SOLUTION
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
ASKER
I've still not figured out a solution for this yet and that's why it took a long time to give an update. I've decided to close this question but I will keep your answers in mind and carry on doing further research on this. Thanks a lot.
A month and a half? And you still could not respond, then you gave a bad grade? What were you expecting? Please read the grading guidelines then explain why you gave the bad grade without any response or explanation! Nobody does this at EE. What was wrong?
http://support.experts-exchange.com/customer/portal/articles/481419
http://support.experts-exchange.com/customer/portal/articles/481419
https://www.experts-exchange.com/Web_Development/Web_Languages-Standards/PHP/A_11880-Unicode-PHP-and-Character-Collisions.html
For us to offer any specific help, we would need to see the test data set and see how it interacts with the program code that creates the XML document. The numeric character entities would seem to be good "visually" when the XML is rendered by a browser, but there is nothing that inherently changes the UTF-8 characters into numeric entities without a specific programmatic step.