OCR software to scan and make live text from an existing PDF file

Posted on 2011-10-07
Last Modified: 2012-05-12
Is there any software out there that can import an existing PDF and convert it to live text for editing?
Question by:ZionTech1
    LVL 11

    Expert Comment

    LVL 51

    Accepted Solution

    A couple of clarifications. Many PDF files already contain text, so they don't need to be OCR'ed to create the text – it's already there. The only PDF files that need to be OCR'ed to create text are those containing just images. So if you have an existing PDF file with text, the real issue is how to edit it. For that you'll need either a product that can directly edit a PDF file or a product that converts the PDF file into an editable format, such as a Word file (or a low-end approach to the latter is to use a PDF reader, like Adobe Reader, and simply copy/paste the text).

    If you need OCR to create the text from image-only PDF files, there are many good packages out there. Two highly-regarded ones are ABBYY FineReader and Nuance's OmniPage:

    Another approach is to use an imaging/scanning package, such as Nuance's PaperPort:

    PaperPort can take an image-only PDF and via a <Save As> command automatically invoke OCR on it and create a PDF Searchable Image file, which contains both the image and a layer of text created by the OCR (btw, under the covers, PaperPort utilizes OmniPage OCR). The latest version is PP14, which just came out in August. The main enhancement is cloud support, which you probably don't need. The new version is fairly expensive, but you can get the previous version, which is 12 (yes, they were superstitious and skipped 13), as a download at Newegg for $39.99:

    The Newegg download is likely to be 12.0. Do not install that. Instead, read my EE article on how to upgrade to 12.1 (free!):

    As a disclaimer, I want to emphasize that I have no affiliation with any companies mentioned in this post, or any financial interest in them whatsoever. Regards, Joe
    LVL 30

    Expert Comment

    by:Paul Sauvé
    If you already have an all-in-one printer (print/fax/scan/photocopy) or a stand-alone scanner, then you may have received this soptware with your hardware.

    For example, I have a Brother Multi-functional printer and it came with PaperPort which allows me to scan pdf files wwith text images in them.

    Author Comment

    M3rc74 and paulsauve, thank you for answering a question I did not ask. Your efforts are much appreciated.

    The question was "Is there any software out there that can import an EXISTING PDF and convert it to live text for editing". The importance on the EXISTING PDF part. Meaning that the PDF was scanned in as an image or from any other source.

    joewinograd: has completely and thoroughly answered my question. Thank you.

    Author Closing Comment

    LVL 30

    Expert Comment

    by:Paul Sauvé
    M3rc74 and paulsauve, thank you for answering a question I did not ask

    Excuse me: "For example, I have a Brother Multi-functional printer and it came with PaperPort which allows me to scan pdf files with text images in them."

    I guess I'm a little dazed and confused, especially since I mentioned the same software as joewinograd - i.e. PaperPort.

    Please remember that we are volunteers and we do this for the pleasure of helping out. I don't think your sarcasm is appropriate. It's bit like cutting someone off in your car then yelling them! I'm not asking for points, I'm asking you to be polite!  I really don't need the grief.

    Thank you for your understanding.

    Featured Post

    What Security Threats Are You Missing?

    Enhance your security with threat intelligence from the web. Get trending threat insights on hackers, exploits, and suspicious IP addresses delivered to your inbox with our free Cyber Daily.

    Join & Write a Comment

    *Adobe Acrobat 9 was used for this article. Particular steps may vary depending on software versions. 1. Create a framework of your form in Word, leaving space where you’d ultimately like the Adobe fields to appear.  (Note: I use the blank lines …
    This article discusses the PaperPort 14 Scanner Connection Tool, which Nuance provides at no charge in order to fix scanning problems in Windows 8. Furthermore, users of PaperPort 14 in Windows 7 and Windows 10 have reported that the tool works in t…
    This video is the second in a two-part series that discusses PaperPort's "Send To Bar" feature . The first video tutorial ( explains the purpose of the Send To Bar, how to use it, and how to hide unwanted …
    Learn how to automatically add page numbers in your next InDesign project. This can be very helpful in multi-page books and magazines that you are designing. Make sure your Pages window visible.:  In the document you wish to add page numbers to. Act…

    731 members asked questions and received personalized solutions in the past 7 days.

    Join the community of 500,000 technology professionals and ask your questions.

    Join & Ask a Question

    Need Help in Real-Time?

    Connect with top rated Experts

    17 Experts available now in Live!

    Get 1:1 Help Now