asked on

ASCII to UTF8 Conversion of flat files

Hello.

I have a flat file that contains ASCII characters mixed with Japanese Kanji. I am tryin gto get an application to read this flat file, but it is unable to read the Kanji, unless I convert the flat file into UTF8 format.

The file is on a Windows 2003 server machine. Does anyone know / has a working program that will convert ASCII to UTF8?

Thanks
Tagari28.

catoaguilar

try this:

http://www.parallelgraphics.com/products/cortona/utfconverter/

Tsirapi28

ASKER

Thanks, but are there any - sources recommendations about writting C / Java / VB to convert ACSII to UTF8?

InteractiveMind

How can an ASCII file contain Kanji ?

InteractiveMind

Such characters requre a Unicode variant - such as UTF-8. It's more likely that the file is encoded in ISO-10646 or something.

Are you able to identify which one?

Favago

There's also a program called 'iconv'. It converts from many charsets to many. But there will sure be specific japanese charcode converters around..

On windows, the plain file is probably in Shift-Jis (SJIS) format. If not, try JIS or EUC-JP.

The great and free JWPCE editor will probably be able to open it anyway.. from that you can save to many format.
http://www.physics.ucla.edu/~grosenth/c_main.html#DOWNLOAD

Tsirapi28

ASKER

Thanks to all for your input.

How ASCII contains Kanji?

Here is an extract of the file I'm working with. The code characters are Kanji.

0836KM0ZDM•Û‘SŠÇ—‚R‚O‚O†’n
0836KM0ZNAŒv‘••Û‘S‚Q•ªÍŒv
0836KM0DLE“d‹C•Û‘S
0836KM0FDFƒIƒtƒTƒCƒg•Û‘S
0836KM0FCN‘•’u•Û‘S‚R‚O‚O†’n
0836KM0IXP‹ZpŒŸ¸‚R‚O‚O†’n

This file comes in as .txt format. I guess my first question is how does one tell what is this file encoded in ASCII, UNICODE or what and then how can I feed this file through a program that will produce a UTF8 output. I do not care to see printable charatcers as long as it is in UTF8. My goal is to then feed the file into an application to process the data.

ASKER CERTIFIED SOLUTION

BigRat

membership

This solution is only available to members.

To access this solution, you must be a member of Experts Exchange.

Start Free Trial

Shruti_Basnet

I am using Delphi where u can use the function UTF8toAnsi function. Hope this helps u.

SOLUTION

RNMcLean

membership

This solution is only available to members.

To access this solution, you must be a member of Experts Exchange.

Start Free Trial