<

Watch Convert Scanned Image-Only PDF Files to PDF Searchable Image Files via OCR with Power PDF Advanced

Posted on
50,019 Points
919 Views
1 Endorsement
Last Modified:
Community Pick: Many members of our community have endorsed this article.
Editor's Choice: This article has been selected by our editors as an exceptional contribution.
Experience Level: Beginner
4:55
Joe Winograd
50+ years in computers
Development•Sales
CIO•Document Imaging
EE — FELLOW 2017
MVE 2015,2016,2018
RENOWNED 2018,2019
CERTIFIED GOLD 2020
In this video, we show how to convert an image-only PDF file into a PDF Searchable Image file, that is, a file with both the image (typically from scanning) and text, which is created in an automated fashion with Optical Character Recognition (OCR) software. To do this, we will set up a Watched Folder, such that whenever an image-only PDF file arrives in the Watched Folder, it will automatically be converted to a PDF Searchable Image file. We will achieve this using Power PDF, the newest product from the Document Imaging division of Nuance Communications. There are two editions of Power PDF — Standard and Advanced. The Watched Folder feature is in the Advanced edition only.

Update on 10-Sep-2020

Two points in this update:

(1) As noted in the paragraph above and in the video, Power PDF was, at the time of this video's publication, the newest product from the Document Imaging division of Nuance Communications. Since then, Kofax Inc. has acquired Nuance's Document Imaging division, as discussed in this EE article and video about PaperPort Version 14.7:

Free in-place upgrade of Nuance PaperPort Professional 14.5 to Kofax PaperPort Professional 14.7
Free upgrade of Nuance PaperPort Professional 14.5 to Kofax PaperPort Professional 14.7

(2) Related to the subject of this video, namely, the ability to convert image-only PDFs to searchable PDFs, it is worthwhile to note that PaperPort, also a product from Nuance (now Kofax), can do this, as discussed in this EE article:

PaperPort - How To Create Searchable PDF Files

End of Update

Video Steps

1. Download and install the trial software



Visit the Nuance website at:

http://www.nuance.com/for-business/document-imaging-and-scanning/power-pdf-converter/index.htm

Click the free trial link, which takes you here:

http://www.nuance.com/for-business/imaging-solutions/document-conversion/power-pdf-converter/free-trial/index.htm

Fill out the short form and submit it.

Download the trial software and install it.

Step1.jpg

2. Run the program and invoke the Watched Folder feature



Run the program by clicking Start>All Programs>Nuance Power PDF Advanced>Power PDF Advanced.

Invoke the Watched Folder feature by clicking the Advanced Processing menu, then the drop-down on the Batch Controls ribbon button, then Watched Folder.

Step2.jpg

3. Configure the Watched Folder settings



Tick the Enable Watched Folder box.

Click the Source button and Browse to the folder that you want as the Watched Folder.

Tick the Include Subfolders box if you want them.

Click the Destination button and Browse to the folder where you want to store the converted PDFs.

Click the File Type drop-down and select Searcheable PDF (Nuance's spelling, not mine).

Fill in whatever other options you want, especially what to do with the finished source files.

Step3.jpg

4. Test the process



Copy one or more image-only PDFs into the Watched Folder.

After a short wait, check the Destination Folder to view the searchable PDFs that have been created.

Check the Source folder and the Move to folder specified in the Finished source files settings to make sure that the source files were handled correctly.

Step4.jpg
That's it - you're done! If you find this video to be helpful, please click the thumbs-up icon below. Thank you for watching, Joe

1
0 Comments
How many times do you come across a file format that your computer doesn't recognize? Read on for a great solution I've found and been using myself, which is a lot more than just a file format viewer. Enjoy...
Provides a one size fits all solution for File Dialog calls for all MS office applications. Module contains function fncGetFilePathTRM() which does all the filter building for you. I've used a similar version of this for over 15 years, 1 module impo…