Link to home
Start Free TrialLog in
Avatar of rossc
rossc

asked on

OCR Control

Is there an ActiveX control available that will return the text from a TIF file?  I have a list of TIF files that I need to OCR into text.  The text will be used in a VB application to rename the original TIF file.  The file names are then stored in a database which can be searched.  This is a very simple document management system.
Avatar of muffinthedog
muffinthedog

rossc,
    scuse my ignorance for asking exactly you are trying to do...

You trying to get the PS language info or something?  If so I have code that could easily be converted to VB.
Avatar of rossc

ASKER

muffinthedog,

The documents I am scanning to TIF files have a customer number and invoice number printed on them.  I use the customer number
and invoice number as part of a file name I assign to the scanned TIF file.  The new name of the TIF file is stored in a database which allows lookups on all customer invoices if you know the customer number or invoice number.  The entire path to each scanned document includes the document type, the date, the customer number, and the invoice number.  The date and document type are known before scanning, the customer number and invoice number are unique for each document and must be obtained by OCRing each scanned image.
ASKER CERTIFIED SOLUTION
Avatar of mark2150
mark2150

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
rossc,
    see if I got this right, scanning printed invoices to build an image database of sales? Penciled invoices? No other data available on sales?

The proposed answer is probably the best under the circumstances without a considerable investment in equipment, which, given the task, would seem to be out of the question for whatever organization this is for.

dum qwestion.. funds available to job out the scanning, or cost prohibited?

Thought you might have some format where you could just read the headers and write the Date/Time,ImageDescription... of the files.
Avatar of rossc

ASKER

The solution you describe is probably what I will end up doing.  
A barcode wedge scanner could be used to save some data entry time but I would like to avoid the data entry entirely if possible.  Did you try any programs that OCR barcodes?  The accuracy of OCRing barcodes should be pretty good.

Also, is there an advantage to using .PDF files instead of .TIF files?

Thanks for any advice.
.PDF's are easier to create/manage in my book. .PDF goes straight to the web without modification and are fairly compact depending on content. .PDF reader is *free* so your only software cost is at point of scan.

It's kind of silly to OCR a barcode. *WAND* it and be done with it.

Make up "scan sheets" and laminate them. Have the commonest responses to your software on them and you'll get good data entry speed. If your forms have barcodes on them mount the scanner looking down and as you wave the document under the scanner it'll automatically read.

The big advantage to .PDF was that we hooked up the "copier" program on the scanner to print to .PDF. Then all the user had to do was set in a huge stack of docs and click on the COPY button. After a bit they're prompted for a file name for the batch and that, as they say, is that.

M

email me at:

mark_lambert@ntsc.navy.mil

and I'll send you a copy of the module that pulls out the notes from an Adobe .PDF and converts that data into an Access database.

M