multi document OCR converter

We have a multitude of files we need to OCR, some are images, others PDF. I can find tools which can do these one at a time, but need something that could do hundreds if you point the software in the direction of a folder full. I do not trust online converters as the docs may contain sensitive information. Please let me know of anything that may meet our criteria.
LVL 3
pma111Asked:
Who is Participating?
 
masnrockCommented:
You could take a look at something like CVISION
0
 
Joe Winograd, Fellow&MVEDeveloperCommented:
Hi pma,
This EE article explains how to do it:
Batch Conversion of PDF, TIFF, and Other Image Formats via Command Line Interface to PDF, PDF Searchable, and TIFF with Power PDF Advanced

It meets all of your criteria, namely:
• can OCR a multitude of files in batch
• input files can be PDFs and many other image formats
• local installation of software, i.e., not online

In addition, in terms of your comment to "point the software in the direction of a folder full", it has a Watched Folder feature that I discuss in this 5-minute EE video Micro Tutorial:
Convert Scanned Image-Only PDF Files to PDF Searchable Image Files via OCR with Power PDF Advanced

Also, it has other advanced features that you may find useful, such as Bates Numbering/Stamping, discussed in this other 5-minute EE video Micro Tutorial:
Bates Stamping/Numbering of PDF Files with Power PDF Advanced

As a disclaimer, I want to emphasize that I have no affiliation with this company and no financial interest in it whatsoever. I am simply a happy user/customer. Regards, Joe
0
 
viki2000Commented:
Along the years I have tried several important OCR software and not all meet expectations as the quality to identify the characters.
One of the best, at which I remained, is Abbyy FineReader.
I never needed huge number of files as your request, but Abbyy has the batch converter, only that I never tried it personally:
https://www.abbyy.com/en-eu/finereader/automate-conversion/

You may try the 30 days trial before you buy it.
0
 
Joe Winograd, Fellow&MVEDeveloperCommented:
Hi pma111,
Following up on viki2000's comment, I agree that ABBYY FineReader is a very good product. There are, indeed, many good OCR packages out there. Here's a relatively recent post that I made at EE discussing several of them (including ABBYY FineReader):
https://www.experts-exchange.com/questions/29056027/Variability-in-quality-and-accuracy-of-OCR.html#a42290281

Btw, I have many products that perform OCR, including ABBYY FineReader, Adobe Acrobat, OmniPage, PaperPort, PDF-XChange Editor, Power PDF, and others. If you'd like to post one of your PDF files, being careful to ensure that it doesn't have any private/sensitive information in it, I'll use several of my OCR packages on it to give you a comparison of OCR accuracy. Of course, the other issue for you is to "do hundreds" at a time, or a "folder full" — some OCR packages can do that, some can't. Regards, Joe
0
 
Joe Winograd, Fellow&MVEDeveloperCommented:
This is one of those questions that asks for suggestions of a product to perform a specific function. As such, there are no right or wrong answers, as long as the recommended product performs the desired function. In this case, I selected the posts where appropriate products were recommended. There is no "Best" answer in this case, so I selected the first post with a recommended product as the Accepted Solution and all the others as Assisted Solutions, but I split the points evenly among the participating experts. Regards, Joe
0
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

All Courses

From novice to tech pro — start learning today.