Solved

What is the logic for representing Unicode hex character 0419 as 0xD0 0x99 in Word 2007?

Posted on 2009-07-06
3
241 Views
Last Modified: 2012-05-07
If I insert Unicode character 0419 (hex) in a Word 2007 document and then look at its byte representation in the document.xml file, it appears as <0xD0> <0x99>.

What is the logic behind this conversion?

0
Comment
Question by:wjm013
  • 2
3 Comments
 
LVL 39

Expert Comment

by:itsmeandnobodyelse
Comment Utility
>>>> I insert Unicode character 0419 (hex) in a Word 2007
How exactly you were doing that?

>>>> it appears as <0xD0> <0x99>.
How do you know it is the character you entered as 0419?

>>>> What is the logic behind this conversion?
Don't know. As far as I could see there is 0x0419 is quite a different character than 0xD099 (or 0x99D0 if the XML prints the bytes from left to right.


I did similar to you recently but I checked the representation in the binary .doc file using the hex editor of VS. Here the hex representations of the UNICODE chars from Lucida Unicode font were prettyly matching with that documented in the charts.
0
 

Author Comment

by:wjm013
Comment Utility
>>>> I insert Unicode character 0419 (hex) in a Word 2007
>>>>How exactly you were doing that?
1 - Opened a Word 2007 file, chose Insert Symbol and Arial Unicode MS, chose "from Unicode (hex)", navigated to 0419 and inserted it.

2 - Saved the Word 2007 file.

3 - Added ".ZIP" to the end of the filename.

4 - Extracted the Zip files to a separate directory.

5 - Opened "document.xml" in a text editor.

6 - Navigated to the 0419 character, , and chose to view the binary which is "D0 99"
0
 

Accepted Solution

by:
wjm013 earned 0 total points
Comment Utility

The "D0 99" represents the UTF-8 translation of the 0419 character as specifed
in the XML tag at the top of document.xml:

<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
0

Featured Post

6 Surprising Benefits of Threat Intelligence

All sorts of threat intelligence is available on the web. Intelligence you can learn from, and use to anticipate and prepare for future attacks.

Join & Write a Comment

When creating Microsoft Word-based forms there may be a need to have a form field repeated throughout the whole document. For instance, with a company name, you may want this information repeated automatically throughout the document rather than man…
Introduction Authors who set out to write any sort of lengthy piece for online submission—be it a long question or comment on a technical form, an article, or a substantial blog entry—often find it useful to work up a draft in an editor other t…
Learn how to make your own table of contents in Microsoft Word using paragraph styles and the automatic table of contents tool. We'll be using the paragraph styles in Word’s Home toolbar to help you create a table of contents. Type out your initial …
Office 365 is currently available in five editions. Three of them are for business use: Office 365 Business Essentials, Office 365 Business, and Office 365 Business Premium. Two of them are for home/personal use: Office 365 Home and Office 365 Perso…

763 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

6 Experts available now in Live!

Get 1:1 Help Now