Solved

PDF Document Scanner & Text Search Software

Posted on 2011-09-20
7
344 Views
Last Modified: 2012-05-12
Hi I would like to know what the cheapest software available to scan up to 250 pages of a hard copy text book and have it converted to a pdf doc. I need to then be able to electronically search for any text once it has been converted to pdf.
0
Comment
Question by:FrankSasso
  • 5
  • 2
7 Comments
 
LVL 51

Expert Comment

by:Joe Winograd, EE MVE
ID: 36570755
I've been using PaperPort for more than 15 years:

http://www.nuance.com/for-individuals/by-product/paperport/index.htm

The latest version is PP14, which just came out on 2-Aug. The main enhancement is cloud support, which you probably don't need. The new version is fairly expensive, but you can get the previous version, which is 12 (yes, they were superstitious and skipped 13), as a download at Newegg for $39.99:

http://www.newegg.com/Product/Product.aspx?Item=N82E168168677800SF

It can automatically make PDF Searchable Image files, meaning that it automatically invokes built-in OCR to create a layer of text (searchable!) which resides in the PDF file along with the scanned image. You can then search it with the All-in-One Search that is built into PaperPort or with any search engine that can index PDFs with text, such as dtSearch (not free), Google Desktop Search (free), X1 (not free), or Windows Search 4 (free).

Btw, the Newegg download is likely to be 12.0. Do not install that. Instead, read my EE article on how to upgrade to 12.1 (free!):

http://www.experts-exchange.com/Web_Development/Document_Imaging/A_6537-PaperPort-Upgrade-How-to-download-and-install-updated-versions-of-PaperPort-11-and-12.html

Regards, Joe
0
 
LVL 51

Expert Comment

by:Joe Winograd, EE MVE
ID: 36570762
I should have added as a disclaimer that I have no affiliation with this company and no financial interest in it whatsoever. I am simply a happy user/customer. Regards, Joe
0
 

Author Comment

by:FrankSasso
ID: 36570853
Hi Joe, thanks for your information however Im assuming that i need to buy some type of scanner to use this software?
0
Maximize Your Threat Intelligence Reporting

Reporting is one of the most important and least talked about aspects of a world-class threat intelligence program. Here’s how to do it right.

 
LVL 51

Expert Comment

by:Joe Winograd, EE MVE
ID: 36570953
Hi Frank,
Based on your question, I assumed that you already have a scanner and are looking for software that is capable of creating a PDF with searchable text. Is that right? If so, what scanner do you have? If not, then you'll need both hardware (scanner) and software (which often comes bundled with scanners, but isn't always robust, and many times can't create searchable PDFs – hence the need for a third-party package). Regards, Joe
0
 

Author Comment

by:FrankSasso
ID: 36571061
Hi Joe, i have a BROTHER MFC 7340 which allows me to scan docs which is fine when Im scanning a small amt of docs, but because I intend to scan hard copy books which are not loose leaf pages, I may need to look at one of those scanners you can buy that you hold the scanner in your hand and just pass it over the doc, what do you think?
0
 
LVL 51

Expert Comment

by:Joe Winograd, EE MVE
ID: 36571153
Frank,

I have a Brother MFC-3820CN, MFC-7820N, and MFC-9840CDW, so I know exactly what you mean – doing a book on the flatbed is no picnic! I looked into book scanners a while ago. My favorite is the TREVENTUS ScanRobot:

http://www.treventus.com/bookscanner_pageturner.html

But I couldn't afford the $100,000 for it. :)  You must look at the videos for this thing – they will knock your socks off:

http://www.treventus.com/products/scanrobotr-20-mds/videos.html

The five videos total nine minutes – trust me, it's worth it!

Now, back to reality. After my wife decided that the ScanRobot was not the best way to spend our life savings, I looked into scanning services that specialize in books. Here are a couple I found (I'm sure there are plenty others):

http://bookscanning.com/
http://www.blueleaf-book-scanning.com/index.html

Even the TREVENTUS folks have a "Scanservice":

http://www.treventus.com/scanservice.html

I also found a really interesting site called Do-It-Yourself Book Scanning:

http://www.diybookscanner.org/

In the end, my books are still sitting on the shelves, un-scanned, so I can't give you any wisdom on what worked and what didn't.

Of course, it you're willing to destroy the original book (not usually the case), you can remove the binding and then put the pages through an ADF. Great, but only if you don't care about the original book. As far as your idea of passing a hand scanner over each page of the book, I think that may be as painful as using a flatbed.

Regards, Joe
0
 
LVL 51

Accepted Solution

by:
Joe Winograd, EE MVE earned 125 total points
ID: 36574849
Frank,

One other thing. If you really want to give a hand scanner a try, the VuPoint Solutions Magic Wand Portable Scanner (PDS-ST410-VP) looks interesting and is relatively inexpensive:

http://www.amazon.com/VuPoint-Solutions-Portable-Scanner-PDS-ST410-VP/dp/B002R0BFAA

But I'm still having a tough time wrapping my head around this technique for a several hundred page book. Regards, Joe
0

Featured Post

Free Trending Threat Insights Every Day

Enhance your security with threat intelligence from the web. Get trending threat insights on hackers, exploits, and suspicious IP addresses delivered to your inbox with our free Cyber Daily.

Join & Write a Comment

The ability to edit PDF documents can be useful, however it may not be a straight forward process. Many non-technical people don't realise that a PDF document is basically an image rather than a text file, even if it contains nothing but text. If…
In a previous article published here at Experts Exchange, Signature Image with Transparent Background (http://www.experts-exchange.com/Web_Development/Document_Imaging/A_12380-Signature-Image-with-Transparent-Background.html), I explained how to cre…
In this second video of the Xpdf series, we discuss and demonstrate the PDFimages utility, which, in a single command, is able to extract all the images from a PDF file and save each one in a separate image file (PBM, PPM, or JPG). Download and inst…
In this third video of the Xpdf series, we discuss and demonstrate the PDFtoText utility, which converts PDF files into plain text files. Download and install the software.: You may have already downloaded and installed the Xpdf tools while watching…

707 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

15 Experts available now in Live!

Get 1:1 Help Now