Link to home
Start Free TrialLog in
Avatar of Tsirapi28
Tsirapi28

asked on

ASCII to UTF8 Conversion of flat files

Hello.

I have a flat file that contains ASCII characters mixed with Japanese Kanji. I am tryin gto get an application to read this flat file, but it is unable to read the Kanji, unless I convert the flat file into UTF8 format.

The file is on a Windows 2003 server machine. Does anyone know / has a working program that will convert ASCII to UTF8?

Thanks
Tagari28.
Avatar of catoaguilar
catoaguilar

Avatar of Tsirapi28

ASKER

Thanks, but are there any - sources recommendations about writting C / Java / VB to convert ACSII to UTF8?
How can an ASCII file contain Kanji ?
Such characters requre a Unicode variant - such as UTF-8. It's more likely that the file is encoded in ISO-10646 or something.

Are you able to identify which one?
There's also a program called 'iconv'. It converts from many charsets to many. But there will sure be specific japanese charcode converters around..

On windows, the plain file is probably in Shift-Jis (SJIS) format. If not, try JIS or EUC-JP.

The great and free JWPCE editor will probably be able to open it anyway.. from that you can save to many format.
http://www.physics.ucla.edu/~grosenth/c_main.html#DOWNLOAD
Thanks to all for your input.

How ASCII contains Kanji?

Here is an extract of the file I'm working with. The code characters are Kanji.

0836KM0ZDM•Û‘SŠÇ—‚R‚O‚O†’n
0836KM0ZNAŒv‘••Û‘S‚Q•ªÍŒv
0836KM0DLE“d‹C•Û‘S
0836KM0FDFƒIƒtƒTƒCƒg•Û‘S
0836KM0FCN‘•’u•Û‘S‚R‚O‚O†’n
0836KM0IXP‹ZpŒŸ¸‚R‚O‚O†’n

This file comes in as .txt format. I guess my first question is how does one tell what is this file encoded in ASCII, UNICODE or what and then how can I feed this file through a program that will produce a UTF8 output. I do not care to see printable charatcers as long as it is in UTF8. My goal is to then feed the file into an application to process the data.


ASKER CERTIFIED SOLUTION
Avatar of BigRat
BigRat
Flag of France image

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
I am using Delphi where u can use the function UTF8toAnsi function. Hope this helps u.
SOLUTION
Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial