Go Premium for a chance to win a PS4. Enter to Win


fonts converting to chinese seemingly random

Posted on 2014-12-21
Medium Priority
Last Modified: 2015-03-03
Here's a strange one that i don't expect many answers from, but throwing it out there..

I have a client who when they send an email (outlook 2010 on pop3 to their ISP), everything appears normal.  but when someone replies (anyone), the original message the client typed, all instances of apostrophe-m are replaced with the Chinese characters 鈥檓

The emails don't leave that way, but they come back that way.  and what the person replies with, their text is like that too.  not just one sender, seemingly any sender who replies

So yes, a weird one here - hopefully someone ran across this in their vast experience and has any suggestion

by the way, if you google search for:  鈥檓
you will see tons and tons of examples where I'm was replaced with I鈥檓
Question by:FocIS
  • 5
  • 2
LVL 58

Expert Comment

ID: 40511706
What character encoding do you have set on the email?

Author Comment

ID: 40512104
checkmarked is "automatically select encoding for outgoing messages"
selected is Western European (ISO)

I should mention, every other character in the emails appear perfectly fine
LVL 58

Expert Comment

ID: 40512108
I would set a meta tag to specify UTF-8
Without any traceability it's hard to know where the characters are getting converted, but the issue you have is usually associated with the character encoding.
What does it mean to be "Always On"?

Is your cloud always on? With an Always On cloud you won't have to worry about downtime for maintenance or software application code updates, ensuring that your bottom line isn't affected.

LVL 16

Expert Comment

ID: 40512850
i agree with Gary that using UTF-8 encoding should avoid the problem, and that the underlying problem is associated with character encoding.

I suspect (but can't prove from the example texts) that the root cause may be that the 'apostrophe' character being used in the original source document:

Is not the standard ASCII apostrophe character; this is decimal code 39, hexadecimal 27.
Is (perhaps courtesy of something like Word?) instead the 'right single quotation mark' (sometimes referred to as one of the 'curly quote' characters); this is Unicode code-point U+2019, but (in the Windows ANSI code-set) is mapped to decimal code 146, hexadecimal 92, which is (in Unicode and other code-sets) reserved for the (little used) C1 control-code characters.
LVL 16

Expert Comment

ID: 40512863
Note sure whether or not E-E will display the following correctly, but here goes:

ASCII/Unicode code-point U+0027 is character '
Unicode code-point U+2019 is character
LVL 16

Expert Comment

ID: 40512874
... and (in the Western European (ISO) coded character set (otherwise known as ISO 8859-1), hexadecimal 92 is not a graphic character; it is (as in the Unicode super-set) reserved for one of the C1 control-code characters.
LVL 16

Accepted Solution

DansDadUK earned 2000 total points
ID: 40514669
A few more diagnostics:

Saving this web page, then viewing it within a hexadecimal editor shows that the several instances of the characters 鈥檓 are each represented by the hexadecimal code e988a5e6aa93.

Looking at this in more detail:

hexadecimal e988a5 is the UTF-8 representation of the 16-bit Unicode value U+9225, which is the character
hexadecimal e6aa93 is the UTF-8 representation of the 16-bit Unicode value U+6a93, which is the character

Note that the most-significant byte of the first 16-bit encoded value is 0x92, which (as mentioned earlier) is the code-point associated with the  'right single quotation mark' in the Windows ANSI coded character set (codepage 1252).

Your easiest way of checking whether the above has anything to do with your symptoms is to switch off use of smart (curly) quotes in the text editor (probably Word) used by your Outlook user.
See support page for details.
LVL 16

Expert Comment

ID: 40641836
... and I've just come across this rather good article, entitled Unicode, PHP, and Character Collisions, written by Ray Paseur, which provides some more background on such character encoding problems.

Featured Post

Keep up with what's happening at Experts Exchange!

Sign up to receive Decoded, a new monthly digest with product updates, feature release info, continuing education opportunities, and more.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Microsoft Word is a program we have all encountered at some point, but very few of us have dug deep into its full scope of features, let alone customized it to suit our needs. Luckily making the ribbon (aka toolbar, first introduced in Word 2007) wo…
Ever visit a website where you spotted a really cool looking Font, yet couldn't figure out which font family it belonged to, or how to get a copy of it for your own use? This article explains the process of doing exactly that, as well as showing how…
Excel styles will make formatting consistent and let you apply and change formatting faster. In this tutorial, you'll learn how to use Excel's built-in styles, how to modify styles, and how to create your own. You'll also learn how to use your custo…
If you’ve ever visited a web page and noticed a cool font that you really liked the look of, but couldn’t figure out which font it was so that you could use it for your own work, then this video is for you! In this Micro Tutorial, you'll learn yo…

877 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question