j_stone
asked on
visual basic spell check with a twist / an OCR problem
I know that you can use the microsoft word spell checker in visual basic but what I need to do i am not sure is possible:
I am working on an ocr application that recognizes forms based on the text on them and bookmarks them in a pdf.
the program works pretty well on scans that are high quality but when the quality degrades so does the ocr that i am using (MODI).
I need a way to replace words not in the dictionary with words of equal length (these words seem to be the closest to the actually pre-OCR words) or barring that possibility the first suggested word automatically
any idea how i can do spell checking and replacing without user intervention? (the text isn't saved anywhere it is just used as a tool for recognizing a given page) I have looked for OCR programs that have this feature and LEADTOOLS seems to do what I want it to, and the OCR is better than MODI but i don't really have several thousand dollars to spend on a hobby program.
So what i am looking for is a spell checker that i can force matches on or a free OCR program that will do best matching
I am working on an ocr application that recognizes forms based on the text on them and bookmarks them in a pdf.
the program works pretty well on scans that are high quality but when the quality degrades so does the ocr that i am using (MODI).
I need a way to replace words not in the dictionary with words of equal length (these words seem to be the closest to the actually pre-OCR words) or barring that possibility the first suggested word automatically
any idea how i can do spell checking and replacing without user intervention? (the text isn't saved anywhere it is just used as a tool for recognizing a given page) I have looked for OCR programs that have this feature and LEADTOOLS seems to do what I want it to, and the OCR is better than MODI but i don't really have several thousand dollars to spend on a hobby program.
So what i am looking for is a spell checker that i can force matches on or a free OCR program that will do best matching
ASKER
i can already spell check the document but it requires user intervention which really won't work for a 150 page tiff file...your code seems to require the same interaction
When it comes up with a list of 20 suggestions how do you suggest one gets forced? Just take the first one?
Perhaps you can duplicate the AutoCorrect options. Create a dictionary of AutoCorrect items. This will require a learning curve. OCR spell check add common repeatable items to AutoCorrect. Then recurse your document word by word replacing anything found in the autoCorrect Keys with the autocorrect value.
ASKER
Corey2:
>When it comes up with a list of 20 suggestions how do you suggest one gets forced? Just take the first >one?
either the first choice or the first choice with the same amount of characters (this seems to be the more correct answer)
the program is supposed to represent phrases so if a word is not in the dictionary it is most likely not ocr'ed correctly
>When it comes up with a list of 20 suggestions how do you suggest one gets forced? Just take the first >one?
either the first choice or the first choice with the same amount of characters (this seems to be the more correct answer)
the program is supposed to represent phrases so if a word is not in the dictionary it is most likely not ocr'ed correctly
ASKER CERTIFIED SOLUTION
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
ASKER
just got a chance to try this tonight...found the code hidden in what c0ldfyr3 wrote
corey2
Open in new window