Transliteration of non-ASCII7 Character
Posted on 2005-04-06
Flat files are created on the unix server having charset = " ISO-08859-5"
These files are ftp to the unix server having charset = " ISO-08859-1"
I need to transliterate “преә” to “pred”
We have two created mapping files
one holds the cyrillic characters and the other the corresponding english.
The problem that I am facing is to recognise the characters in ISO-08859-1 character set.
On the unix server if i try to read the mapping file the characters cannot be read correctly.
also the problem is with file to be transliterated.
One solution we tried was to use the binary value of the characters and try to transliteration in Java. this did not work.
Also i tried to do the transliteration using a sed script file. this approach failed as well.
The unix server where the flat file need to transliterated does not support charset = " ISO-08859-5"
thanks a lot in advance
Please let me know if anyone has worked on character transliteration.