Solved

What is the logic for representing Unicode hex character 0419 as 0xD0 0x99 in Word 2007?

Posted on 2009-07-06
3
243 Views
Last Modified: 2012-05-07
If I insert Unicode character 0419 (hex) in a Word 2007 document and then look at its byte representation in the document.xml file, it appears as <0xD0> <0x99>.

What is the logic behind this conversion?

0
Comment
Question by:wjm013
  • 2
3 Comments
 
LVL 39

Expert Comment

by:itsmeandnobodyelse
ID: 24786005
>>>> I insert Unicode character 0419 (hex) in a Word 2007
How exactly you were doing that?

>>>> it appears as <0xD0> <0x99>.
How do you know it is the character you entered as 0419?

>>>> What is the logic behind this conversion?
Don't know. As far as I could see there is 0x0419 is quite a different character than 0xD099 (or 0x99D0 if the XML prints the bytes from left to right.


I did similar to you recently but I checked the representation in the binary .doc file using the hex editor of VS. Here the hex representations of the UNICODE chars from Lucida Unicode font were prettyly matching with that documented in the charts.
0
 

Author Comment

by:wjm013
ID: 24787278
>>>> I insert Unicode character 0419 (hex) in a Word 2007
>>>>How exactly you were doing that?
1 - Opened a Word 2007 file, chose Insert Symbol and Arial Unicode MS, chose "from Unicode (hex)", navigated to 0419 and inserted it.

2 - Saved the Word 2007 file.

3 - Added ".ZIP" to the end of the filename.

4 - Extracted the Zip files to a separate directory.

5 - Opened "document.xml" in a text editor.

6 - Navigated to the 0419 character, , and chose to view the binary which is "D0 99"
0
 

Accepted Solution

by:
wjm013 earned 0 total points
ID: 24797844

The "D0 99" represents the UTF-8 translation of the 0419 character as specifed
in the XML tag at the top of document.xml:

<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
0

Featured Post

Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

In Easy String Encryption Using CryptoAPI in C++ (http://www.experts-exchange.com/viewArticle.jsp?aid=1193) I described how to encrypt text and recommended that the encrypted text be stored as a series of hexadecimal digits -- because cyphertext may…
This is written from a 'VBA for MS Word' perspective, but I am sure it applies to most other MS Office components where VBA is used.  One thing that really bugs me is slow code, ESPECIALLY when it's mine!  In programming there are so many ways to…
This video walks the viewer through the process of creating an MLA formatted document, as well as a bibliography with citations.
Learn how to create and modify your own paragraph styles in Microsoft Word. This can be helpful when wanting to make consistently referenced styles throughout a document or template.

920 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

17 Experts available now in Live!

Get 1:1 Help Now