troubleshooting Question

Unicode Problem with German characters "Umlaute"

Avatar of megloff
megloff asked on
XML
10 Comments1 Solution1132 ViewsLast Modified:
Hello there !

I'm creating a application with xerces, xalan using the Java XML API. I use DOM to create an XML document.

Unfortunately the special German characters as like as "Umlaute" (ä.ö.ü) are not written correctly in the xml document. For example the "ü" character is translated in a stupid unicode value "&#xfffd" which is not correct. The correct value is for example "&#x00FC" or the character value himself So it seems that there is a kind of an overflow?

I create especially an iso-8859-1 encoding output during the serialization of my DOM document.
For this I'm using
http://xml.apache.org/xerces-j/apiDocs/org/apache/xml/serialize/OutputFormat.html
the OutputFormat Class and set the encoding to
OutputFormat.setEncoding("iso-8859-1")

and after that the XMLSerializer to produce the outputstream
http://xml.apache.org/xerces-j/apiDocs/org/apache/xml/serialize/XMLSerializer.html
XMLSerializer.setOutputFormat(OutputFormat)


So my question is now, what Do I wrong? Why are my special characters not written correctly in the xml document? Have I to initialize something during my DOM creation ?

Thank you in advance
Mark


 









ASKER CERTIFIED SOLUTION
Computer101

Our community of experts have been thoroughly vetted for their expertise and industry experience.

Join our community to see this answer!
Unlock 1 Answer and 10 Comments.
Start Free Trial
Learn from the best

Network and collaborate with thousands of CTOs, CISOs, and IT Pros rooting for you and your success.

Andrew Hancock - VMware vExpert
See if this solution works for you by signing up for a 7 day free trial.
Unlock 1 Answer and 10 Comments.
Try for 7 days

”The time we save is the biggest benefit of E-E to our team. What could take multiple guys 2 hours or more each to find is accessed in around 15 minutes on Experts Exchange.

-Mike Kapnisakis, Warner Bros