Transposing text from a scanned document

G'day All,

I have to build an application that will allow a user to take a scanned document and transpose the text on the document into a structured format. The scanned document will be a hand-written survey form. I have to devise a way to allow the user to easily navigate through the document (through each of the form fields) and re-type the handwritten text. Basically the user would "tab" to each of the fields and the tabbing process would change the display to show the corresponding field text, and zoom the document as appropriate so the user can easily read the text. Given that I know the exact structure of the survey form, I was thinking of scanning the document as a PDF and programmatically adding bookmarks at runtime (using iText), or scanning as a TIFF and zooming to a particular X,Y coordinate.

Can anyone offer any further suggestions as to how I might accomplish this task? I'm not looking for code, rather some high-level ideas and possibly some Java libraries that may help. Thanks very much.
stuart-the-legendAsked:
Who is Participating?
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

CEHJCommented:
>>I was thinking of scanning the document as a PDF

Is that possible?
0
stuart-the-legendAuthor Commented:
Absolutely. My scanner will do that for me now.
0
CEHJCommented:
Yes, but i mean as a pdf *document*. Producing a pdf file is trivial - it'd be an image essentially
0
Cloud Class® Course: MCSA MCSE Windows Server 2012

This course teaches how to install and configure Windows Server 2012 R2.  It is the first step on your path to becoming a Microsoft Certified Solutions Expert (MCSE).

stuart-the-legendAuthor Commented:
Ah, sorry, misunderstood you. Yes, the PDF basically contains an image representation of the scanned form. I have toyed with idea of OCR'ing the form during the scan process, but I don't think the technology is capable of dealing with hand-written script (not if my handwriting is anything to go by).
0
CEHJCommented:
I don't see how you can do anything at all programmatically *unless* you can OCR it ...
0
sancjCommented:
Sounds like your looking for some kind of scan/index front end app. Look at Kofax Capture, EMC Captiva or the like. You can create a template and it will zoom to that area on each tab to the linked field, which the operator can type in the read info. There is also some ICR capability in Kofax as well.  Depending on the specifics AUTOSTORE QuickCapturePro, from  Notable Solutions may work too.
0

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
Java

From novice to tech pro — start learning today.