[Last Call] Learn how to a build a cloud-first strategyRegister Now

x
  • Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 139
  • Last Modified:

Windows IME .dic file format

I would like to understand the encoding of Window's dic file.
I would like to be able to read/write/understand the data these file contains. I am assuming that these files are some sort of database and that no matter the language it is a similar format.

If you have installed Japanese on your computer then one such file will exist in your windows directory:

%winroot%\IME\IMJP8_1\Dicts\IMJPTK.DIC


Thanks
ff


(I have already seen the following pages and they do not help me much:
http://msdn.microsoft.com/library/en-us/wcemain4/html/cmtskCreatingDictionaryFile.asp?frame=true

http://msdn.microsoft.com/library/en-us/wcemain4/html/cmrefJapaneseIME30Part-of-SpeechCodes.asp?frame=true
)

============================================================
Deleted, with no points refunded
12/25/2004 12:35AM PST

modulo
Community Support Moderator
============================================================
0
funkyfinger
Asked:
funkyfinger
  • 2
1 Solution
 
funkyfingerAuthor Commented:
Btw,

As far as I can tell the correct name for this file is a "binary IME dictionary file".

I believe that these files are not used for window's spell check.
0
 
virmaiorCommented:
They are not used in spellcheck

I'm not  sure what the format is but if I knew it then I would try to port Jdict over since  that would just be awesome for when I'm typing in kana
0
 
funkyfingerAuthor Commented:
This is a round about way of doing it but it will get you (me) the information you (I) want. It will not contain all the data however, you (I.. ok I'm going to stop talking in first person to myself from this point, because obviously I'm not reading it and hopefully you find this information valuable.) .. you will not get the type of word contained by the Japanese character (the database also contains if the word is a noun, verb or place) but every thing else even the radical incoding.
Here's how:
Start Character Map, select advanced view, start Spy++, use Visual Basic 6.0 (because I know how but you hate .Net) and the SendMessage API to get the text cotained within the list boxes.
Use the Group By select box to select radicals, kana, etc...
This will popup another window with the title "Group By"
Write a program that selects each item listed in this window, this control might not be a select list so using an API that simulates a mouse click might be a eaiser (but longer) process. Next use the WM_gettext message to get the sub grouping of data from the character map window. Remember that each character is a wide character and that Unicode is not 2 bytes.
Alos know that each window might have scroll bars so that is a nastly little problem as well.
The rest you will have to do on your own (store data in DB)
Good Luck
0
 
moduloCommented:
Closed, 400 points refunded.

modulo
Community Support Moderator
Experts Exchange
0

Featured Post

Keep up with what's happening at Experts Exchange!

Sign up to receive Decoded, a new monthly digest with product updates, feature release info, continuing education opportunities, and more.

  • 2
Tackle projects and never again get stuck behind a technical roadblock.
Join Now