Want to protect your cyber security and still get fast solutions? Ask a secure question today.Go Premium

x
?
Solved

PDF Document Scanner & Text Search Software

Posted on 2011-09-20
7
Medium Priority
?
372 Views
Last Modified: 2012-05-12
Hi I would like to know what the cheapest software available to scan up to 250 pages of a hard copy text book and have it converted to a pdf doc. I need to then be able to electronically search for any text once it has been converted to pdf.
0
Comment
Question by:Frank .S
  • 5
  • 2
7 Comments
 
LVL 57

Expert Comment

by:Joe Winograd, EE MVE 2015&2016
ID: 36570755
I've been using PaperPort for more than 15 years:

http://www.nuance.com/for-individuals/by-product/paperport/index.htm

The latest version is PP14, which just came out on 2-Aug. The main enhancement is cloud support, which you probably don't need. The new version is fairly expensive, but you can get the previous version, which is 12 (yes, they were superstitious and skipped 13), as a download at Newegg for $39.99:

http://www.newegg.com/Product/Product.aspx?Item=N82E168168677800SF

It can automatically make PDF Searchable Image files, meaning that it automatically invokes built-in OCR to create a layer of text (searchable!) which resides in the PDF file along with the scanned image. You can then search it with the All-in-One Search that is built into PaperPort or with any search engine that can index PDFs with text, such as dtSearch (not free), Google Desktop Search (free), X1 (not free), or Windows Search 4 (free).

Btw, the Newegg download is likely to be 12.0. Do not install that. Instead, read my EE article on how to upgrade to 12.1 (free!):

http://www.experts-exchange.com/Web_Development/Document_Imaging/A_6537-PaperPort-Upgrade-How-to-download-and-install-updated-versions-of-PaperPort-11-and-12.html

Regards, Joe
0
 
LVL 57

Expert Comment

by:Joe Winograd, EE MVE 2015&2016
ID: 36570762
I should have added as a disclaimer that I have no affiliation with this company and no financial interest in it whatsoever. I am simply a happy user/customer. Regards, Joe
0
 

Author Comment

by:Frank .S
ID: 36570853
Hi Joe, thanks for your information however Im assuming that i need to buy some type of scanner to use this software?
0
Upgrade your Question Security!

Your question, your audience. Choose who sees your identity—and your question—with question security.

 
LVL 57

Expert Comment

by:Joe Winograd, EE MVE 2015&2016
ID: 36570953
Hi Frank,
Based on your question, I assumed that you already have a scanner and are looking for software that is capable of creating a PDF with searchable text. Is that right? If so, what scanner do you have? If not, then you'll need both hardware (scanner) and software (which often comes bundled with scanners, but isn't always robust, and many times can't create searchable PDFs – hence the need for a third-party package). Regards, Joe
0
 

Author Comment

by:Frank .S
ID: 36571061
Hi Joe, i have a BROTHER MFC 7340 which allows me to scan docs which is fine when Im scanning a small amt of docs, but because I intend to scan hard copy books which are not loose leaf pages, I may need to look at one of those scanners you can buy that you hold the scanner in your hand and just pass it over the doc, what do you think?
0
 
LVL 57

Expert Comment

by:Joe Winograd, EE MVE 2015&2016
ID: 36571153
Frank,

I have a Brother MFC-3820CN, MFC-7820N, and MFC-9840CDW, so I know exactly what you mean – doing a book on the flatbed is no picnic! I looked into book scanners a while ago. My favorite is the TREVENTUS ScanRobot:

http://www.treventus.com/bookscanner_pageturner.html

But I couldn't afford the $100,000 for it. :)  You must look at the videos for this thing – they will knock your socks off:

http://www.treventus.com/products/scanrobotr-20-mds/videos.html

The five videos total nine minutes – trust me, it's worth it!

Now, back to reality. After my wife decided that the ScanRobot was not the best way to spend our life savings, I looked into scanning services that specialize in books. Here are a couple I found (I'm sure there are plenty others):

http://bookscanning.com/
http://www.blueleaf-book-scanning.com/index.html

Even the TREVENTUS folks have a "Scanservice":

http://www.treventus.com/scanservice.html

I also found a really interesting site called Do-It-Yourself Book Scanning:

http://www.diybookscanner.org/

In the end, my books are still sitting on the shelves, un-scanned, so I can't give you any wisdom on what worked and what didn't.

Of course, it you're willing to destroy the original book (not usually the case), you can remove the binding and then put the pages through an ADF. Great, but only if you don't care about the original book. As far as your idea of passing a hand scanner over each page of the book, I think that may be as painful as using a flatbed.

Regards, Joe
0
 
LVL 57

Accepted Solution

by:
Joe Winograd, EE MVE 2015&2016 earned 500 total points
ID: 36574849
Frank,

One other thing. If you really want to give a hand scanner a try, the VuPoint Solutions Magic Wand Portable Scanner (PDS-ST410-VP) looks interesting and is relatively inexpensive:

http://www.amazon.com/VuPoint-Solutions-Portable-Scanner-PDS-ST410-VP/dp/B002R0BFAA

But I'm still having a tough time wrapping my head around this technique for a several hundred page book. Regards, Joe
0

Featured Post

Free Tool: Port Scanner

Check which ports are open to the outside world. Helps make sure that your firewall rules are working as intended.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

PaperPort is a popular document imaging/management product from Nuance Communications (http://www.nuance.com/). It is in widespread use by both individuals (http://www.nuance.com/for-individuals/by-product/paperport/index.htm) and businesses (http:/…
The Adobe PDF proprietary file format is recognized as secure and formulated. But these PDF files are also prone to corruption and any external threat like virus attacks, improper storage can hit PDF file integrity.This type of damages can make cruc…
In this video, we show how to convert an image-only PDF file into a PDF Searchable Image file, that is, a file with both the image (typically from scanning) and text, which is created in an automated fashion with Optical Character Recognition (OCR) …
In this fifth video of the Xpdf series, we discuss and demonstrate the PDFdetach utility, which is able to list and, more importantly, extract attachments that are embedded in PDF files. It does this via a command line interface, making it suitable …
Suggested Courses
Course of the Month10 days, 12 hours left to enroll

572 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question