• Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 320
  • Last Modified:

What software, apart from Adobe Acrobat Pro, will convert a regular pdf to a searchable pdf?

What software, preferably open source, will convert a regular pdf to a searchable pdf?
0
100questions
Asked:
100questions
1 Solution
 
aadihCommented:
Nitro PDF Professional.  (Paid and not Open Source.)
0
 
Tim PhillipsCommented:
Here is a website that will do 15 images for free: http://www.onlineocr.net/
0
 
Joe Winograd, EE MVE 2015&2016DeveloperCommented:
Lots of software will do this, some open source, many not. One commercial (not open source) product is Power PDF. Here's an EE article on how to do it via a batch command line:
http://www.experts-exchange.com/Web_Development/Document_Imaging/A_13696-Batch-Conversion-of-PDF-and-TIFF-files-via-Command-Line-Interface.html

Here's an EE 5-minute video Micro Tutorial on how to do it via a Watched Folder:
http://www.experts-exchange.com/Web_Development/Document_Imaging/VP_235.html

An excellent option for a free (but not open source) solution is PDF-XChange Editor:
http://www.tracker-software.com/product/pdf-xchange-editor

They also have a PRO (non-free) version, but the free version does everything you need — specifically, OCR.

For other free and/or open source tools, here are some for you to consider and experiment with:

(1) Tesseract OCR Engine, an open source product now maintained by Google:
http://code.google.com/p/tesseract-ocr/

It has numerous add-ons:
http://code.google.com/p/tesseract-ocr/wiki/AddOns

(2) FreeOCR, which uses a compiled version of the Tesseract engine:
http://www.paperfile.net/

(3) GOCR/JOCR, an open source OCR package developed under the GNU Public License:
http://jocr.sourceforge.net/

(4) OCR Desktop, which is not open source, but is free for personal use (needs to be registered in order to turn off popups and advertising):
http://www.ocrtools.com/fi/prdOCRFree.aspx

(5) SimpleOCR, which is not open source, but is free, with both an end-user version:
http://www.simpleocr.com/

and a royalty-free SDK:
http://www.simpleocr.com/Info.asp#SDK

(6) Boxoft Free OCR (I use several Boxoft free tools):

http://www.boxoft.com/free-ocr/

(7) Google Drive/Docs has an option to perform OCR on uploaded files, but the last time I tried it (a while ago, so it might be better now), the resulting PDF did not hide the text layer, so the file looked ugly.

Regards, Joe
0
Free Tool: ZipGrep

ZipGrep is a utility that can list and search zip (.war, .ear, .jar, etc) archives for text patterns, without the need to extract the archive's contents.

One of a set of tools we're offering as a way to say thank you for being a part of the community.

 
Iris vHCommented:
You can also try Udocx. It'll OCR & route documents when you scan. www.udocx.com
0
 
100questionsAuthor Commented:
Thank you.   I have not tried any of these yet however hopefully they do.
0
 
100questionsAuthor Commented:
Have not tried this, however hopefully it works.
0
 
Joe Winograd, EE MVE 2015&2016DeveloperCommented:
> I have not tried any of these yet however hopefully they do.
> Have not tried this, however hopefully it works.

If you haven't tried anything, you should not close the question. When answers are marked as Accepted and Assisted Solutions, the question goes into EE's PAQ archive (Previously Asked Question – the database of all questions with solutions). EE members have an expectation that questions in the PAQ have a solution verified by the asker. In addition, three other experts took the time and effort to provide possible solutions for you. It is unfair to those experts to accept another answer where you haven't even tried the recommended solution. Regards, Joe
0
 
100questionsAuthor Commented:
PDFX change works, however you need to be careful that when it opens pdfs you may not see the whole pdf image.
0
 
Joe Winograd, EE MVE 2015&2016DeveloperCommented:
I'm glad that PDF-XChange Editor works for you. To see the whole page, click the View menu then Zoom>Fit Page. Or the keyboard shortcut for it is Ctrl-0. Or click the Fit Page icon on the toolbar:

Fit Page
Regards, Joe
0
 
Joe Winograd, EE MVE 2015&2016DeveloperCommented:
One other thing. I see that you graded the answer a B. You may not be familiar with EE's grading system, so I recommend that you read this article:
http://support.experts-exchange.com/customer/portal/articles/481419

A B grade is the exception. As the article says, an A grade "should be the default grade awarded unless the answer is deficient." Also note the comment that an "asker should explain why a B grade was awarded." The point is, if an answer is deficient (such as lacking some information to resolve the problem), it's helpful for the Experts to know how/why it is deficient, and to give them a chance to improve the answer (as in this case, where you said the whole page doesn't show, but I was able to explain how to do that). This is a critical feature of the EE community.

In terms of this particular case, I don't think the answer is deficient in any way and, thus, deserves an A grade, but, of course, that's your call. However, if you choose to leave it a B), then it's your obligation to explain why you awarded a non-A grade. Thanks, Joe
0
 
100questionsAuthor Commented:
Then please change this to an A grade.  Thank you.
0
 
Joe Winograd, EE MVE 2015&2016DeveloperCommented:
eenookami,
Thank you for explaining grading at EE, reviewing the grade in this case, concluding that there's no reason it shouldn't be an A, and then changing it. I appreciate it!

100questions,
Thank you for agreeing with eenookami and me that it is worthy of an A.

Regards, Joe
0

Featured Post

Vote for the Most Valuable Expert

It’s time to recognize experts that go above and beyond with helpful solutions and engagement on site. Choose from the top experts in the Hall of Fame or on the right rail of your favorite topic page. Look for the blue “Nominate” button on their profile to vote.

Tackle projects and never again get stuck behind a technical roadblock.
Join Now