Link to home
Start Free TrialLog in
Avatar of chima
chima

asked on

Notepad++ Encoding of various language

Hello
I have some English text, and I am using google translate to get the text for the language I want.  I then copy and paste this text into Notepad++.  Before I paste I select the proper encoding.  The text is pasted properly.
The problem is in saving the file.  The file will be a close caption, therefore I save it as a .srt file.  When I go back and open the file the text is not correct.  The same problem occurs when I save it as a text file.
I created an Arabic file/srt file and this one opens correctly each and every time.  
Could it be that Notepad++ does not work with the other encoding settings?
ASKER CERTIFIED SOLUTION
Avatar of _3mp3ror_
_3mp3ror_
Flag of Thailand image

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
Avatar of chima
chima

ASKER

_3mp3ror_ I apologize for the delay.  Upon posting this question, I remembered that I had to save it as a UTF-8 file (because I had done it before), and off I went trying to make my files work.  Some did and others did not work.  Little did I think that the font would be a factor.
So now, if I may ask; Might you know of a working app to tests the files I have?  The app that I am testing may not be working, meaning that my file are okay, but the app is not.
I'll check more into the font relative to the files I have that do not work.
One of the files is Chinese traditional that is not working, any thoughts?
thanks
Avatar of chima

ASKER

Thank you by: _3mp3ror_, the UTF-8 is the setting to save the various languages.
Maybe the file was made with GB18030 encoding ?
How does GB18030 differ from Unicode?
http://stackoverflow.com/questions/223360/how-does-gb18030-differ-from-unicode

Chinese Character Encoding Standards (to help you understand the specs in third-party font and application packages)
http://www.pinyinjoe.com/pinyin/encoding.htm

http://www.dynalab.com/language/en-US/Products/GB18030_Standard.aspx

Before the release of GB18030, most of us only used the GB2312 standard. However, there are only 6,000+ Simplified Chinese characters in GB2312. It is absolutely not good enough for the document communications between China and outside world. The GB18030 standard includes more than 27,484 Chinese characters. It covers both the Simplified and Traditional Chinese characters on one single platform, this should definitely solve the conversion problems between the GB and BIG5 encodings.


http://sourceforge.net/tracker/index.php?func=detail&aid=3178272&group_id=95717&atid=612385
Need Support For GB18030 - ID: 3178272

I'm editing a text document that's written in GB18030 (a Chinese character set standard).
However, Notepad++ only supports GB2312, some characters cannot be displayed.
Please add support for GB18030.
Thanks!

There is anohter editor that supports this encoding but it's somewhat more complicated to use
http://tech.groups.yahoo.com/group/vim/message/101980

Notepad++ has Big5 support (Encoding > Character Sets > Chinese > Big5)
Big5 is a Chinese character encoding method used in Taiwan, Hong Kong, and Macau for Traditional Chinese characters.

If the file is GB18030 encoded you could try this tool that will convert it to Unicode
http://www.microsoft.com/download/en/details.aspx?id=5503

After the support package is successfully installed, the gbunicnv.exe and ms4bsp.dll will be installed under SystemDrive\Program Files\GB18030Tools\folder.