?
Solved

What is the logic for representing Unicode hex character 0419 as 0xD0 0x99 in Word 2007?

Posted on 2009-07-06
3
Medium Priority
?
259 Views
Last Modified: 2012-05-07
If I insert Unicode character 0419 (hex) in a Word 2007 document and then look at its byte representation in the document.xml file, it appears as <0xD0> <0x99>.

What is the logic behind this conversion?

0
Comment
Question by:wjm013
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 2
3 Comments
 
LVL 39

Expert Comment

by:itsmeandnobodyelse
ID: 24786005
>>>> I insert Unicode character 0419 (hex) in a Word 2007
How exactly you were doing that?

>>>> it appears as <0xD0> <0x99>.
How do you know it is the character you entered as 0419?

>>>> What is the logic behind this conversion?
Don't know. As far as I could see there is 0x0419 is quite a different character than 0xD099 (or 0x99D0 if the XML prints the bytes from left to right.


I did similar to you recently but I checked the representation in the binary .doc file using the hex editor of VS. Here the hex representations of the UNICODE chars from Lucida Unicode font were prettyly matching with that documented in the charts.
0
 

Author Comment

by:wjm013
ID: 24787278
>>>> I insert Unicode character 0419 (hex) in a Word 2007
>>>>How exactly you were doing that?
1 - Opened a Word 2007 file, chose Insert Symbol and Arial Unicode MS, chose "from Unicode (hex)", navigated to 0419 and inserted it.

2 - Saved the Word 2007 file.

3 - Added ".ZIP" to the end of the filename.

4 - Extracted the Zip files to a separate directory.

5 - Opened "document.xml" in a text editor.

6 - Navigated to the 0419 character, , and chose to view the binary which is "D0 99"
0
 

Accepted Solution

by:
wjm013 earned 0 total points
ID: 24797844

The "D0 99" represents the UTF-8 translation of the 0419 character as specifed
in the XML tag at the top of document.xml:

<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
0

Featured Post

[Webinar] Lessons on Recovering from Petya

Skyport is working hard to help customers recover from recent attacks, like the Petya worm. This work has brought to light some important lessons. New malware attacks like this can take down your entire environment. Learn from others mistakes on how to prevent Petya like worms.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Do you ever need to create a 20 page Word document for some testing purpose? Are you tired of copying & pasting old boring "lorem ipsum" text over and over again, increasing font size and line space in order to make the document 20+ pages long? Look…
This article shows how to get a list of available printers for display in a drop-down list, and then to use the selected printer to print an Access report or a Word document filled with Access data, using different syntax as needed for working with …
Learn how to make your own table of contents in Microsoft Word using paragraph styles and the automatic table of contents tool. We'll be using the paragraph styles in Word’s Home toolbar to help you create a table of contents. Type out your initial …
This Experts Exchange video Micro Tutorial shows how to tell Microsoft Office that a word is NOT spelled correctly. Microsoft Office has a built-in, main dictionary that is shared by Office apps, including Excel, Outlook, PowerPoint, and Word. When …
Suggested Courses

765 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question