Solved

OCR software not working well - need something GOOD!

Posted on 2010-09-16
5
857 Views
Last Modified: 2013-12-27
Because of the number of legal/sensitive documents that need to be OCRed I will need something "good" that actually "works." I tried using the 30 day trial of Foxit Phantom (/w OCR package) but it didnt do a very good job.

However, this might be the issue. If it was originally made and designed with PDF, it OCRes perfectly.
BUT if its a hardcopy document, or either a hardcopy scanned into a jpg or PDF and then OCRed... it turns out horrible.

It doesnt keep the formatting, there is all kinds of huge white spaces that have to be formatted/fixed, and it doesnt keep any of the fonts or font sizes or even bolded characters.

Any genius recommend something GOOD and actually works as intended? Also if it has a 30 day trial too that would be nice (that way I can make sure it actually works!).

Thank you geniuses!
HappyT
0
Comment
Question by:TheHappyTech
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 2
  • 2
5 Comments
 
LVL 31

Accepted Solution

by:
captain earned 400 total points
ID: 33696471
Hi

I would stick with Acrobat. It simply works.

trial:
http://www.adobe.com/products/acrobatpro/tryout.html

capt.
0
 

Author Comment

by:TheHappyTech
ID: 33697376
One word... "WOW"

It really works. Not only did it match take the text that was slightly crooked on the page (cause of the way it was scanned) and straightened it, it even recognized big fonts and bold etc.

I think this might be it! (Even though its expensive....)
0
 
LVL 31

Expert Comment

by:captain
ID: 33697500
I always look at it from the perspective of time cost vs. One- off investment.

Acrobat really is one, if not the best out there. It is wide spread hence well supported and it usually saves you time being fast and doing the job well.

I guess it comes down to how often you need to do such a task, but from your previous experience you already know the time cost on working with products that do not perform well.

As you mention 'numbers' of documents and that you qill be unlikely to need toupgrade from v9 for a good 2 years for the purposes you use it for, it may be cost effective.

Hth
capt.
0
 
LVL 11

Assisted Solution

by:Amila Hendahewa
Amila Hendahewa earned 100 total points
ID: 33706818
0
 

Author Closing Comment

by:TheHappyTech
ID: 33845449
Works!
0

Featured Post

Free Tool: Path Explorer

An intuitive utility to help find the CSS path to UI elements on a webpage. These paths are used frequently in a variety of front-end development and QA automation tasks.

One of a set of tools we're offering as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

The Adobe PDF proprietary file format is recognized as secure and formulated. But these PDF files are also prone to corruption and any external threat like virus attacks, improper storage can hit PDF file integrity.This type of damages can make cruc…
In a previously published article (http://www.experts-exchange.com/articles/10331/Automatic-Duplex-Scanning-in-PaperPort-Versions-11-12-14.html) here at Experts Exchange, I explained how to achieve duplex (double-sided) scanning in Nuance's PaperPor…
In this fourth video of the Xpdf series, we discuss and demonstrate the PDFinfo utility, which retrieves the contents of a PDF's Info Dictionary, as well as some other information, including the page count. We show how to isolate the page count in a…
We often encounter PDF files that are pure images, that is, they do not have text characters, but instead contain only raster graphics. The most common causes of this are document scanning software and faxing software/services that create image-only…

624 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question