Solved

PDF mega scan and auto rename

Posted on 2012-03-23
9
735 Views
Last Modified: 2012-04-02
I am interested to know if and how I could scan 4,000 completed hard copy forms to PDF (all in one mega-scan) that look like this...

https://docs.google.com/file/d/0B9Ga3bzjO-rUVUVOZnhyanpTU213WWdyQThXTmZRZw/edit

and end up with 4,000 individual PDFs auto renamed by record number (i.e. record number 22 = 22.pdf).
0
Comment
Question by:K_Deutsch
  • 6
  • 3
9 Comments
 
LVL 52

Accepted Solution

by:
Joe Winograd, EE MVE earned 500 total points
ID: 37759008
OK, me again. :)   And IrfanView again. :)

File>Select Scan/TWAIN Source...

Pick your scanner/driver

File>Acquire/Batch scanning

Select the <Multiple images (Batch Mode)> button

Set <Output file name> to blank

Set starting counter to 1

Set increment to 1

Set number of digits to 4

Set <Destination directory> to whatever you want

Set <Save as> to PDF

It should look like this:
IrfanView-multi-image-scanClick the Options button

Click General tab

For the <Preview of PDF during save operation> option, select <not needed>

Make sure <Save all pages from original image is checked> and <Open PDF after saving> is not checked

It should look like this:
IrfanView-preview-not-neededClick OK, OK, and your TWAIN or WIA scanning dialog will appear

Perform the scan

IrfanView will create the 4,000 xxxx.pdf files in the folder you chose. If the scan gets interrupted and you need to restart, you can pick up where you left off by setting the <Starting counter> to whatever you need in the <Acquire/Batch Scanning> screen shown above. When you're done scanning, IrfanView will ask if you want to save the image changes:
IrfanView-save-image-changes-say-NO Say NO! The xxxx.pdf files have already been saved. Regards, Joe
0
 
LVL 52

Expert Comment

by:Joe Winograd, EE MVE
ID: 37759070
Btw, this assumes that the record numbers are in order. It is simply naming the records from <0001.pdf> to <4000.pdf> as the scanning occurs. In other words, this process is not reading the content, i.e., it is not OCR'ing the record number and putting it in the file name. That's a whole different level of difficulty! Regards, Joe
0
 
LVL 52

Expert Comment

by:Joe Winograd, EE MVE
ID: 37759081
My comment above said, "When you're done scanning, IrfanView will ask if you want to save the image changes:". That's not quite true. It will ask you that when you exit IrfanView. In any case, say NO!
0
 

Author Comment

by:K_Deutsch
ID: 37759082
I feel bad because my explanation has been poor and incomplete. What really is happening here is that we are sending out a total of 4,000 "response requested" type forms, each with a unique record number. We of course won't get all 4,000 back. After I scan-in, I am wanting to have an automated process that goes through and recognizes the record number somehow and renames the pdf appropriately.
0
Best Practices: Disaster Recovery Testing

Besides backup, any IT division should have a disaster recovery plan. You will find a few tips below relating to the development of such a plan and to what issues one should pay special attention in the course of backup planning.

 
LVL 52

Expert Comment

by:Joe Winograd, EE MVE
ID: 37759345
Ah, as I indicated, whole different animal! :)   You're going to need a scanning/OCR package capable of OCR'ing that portion of the form and then saving the scanned document with the file name that was OCR'ed. Two excellent OCR packages which can probably do it (but I can't personally attest to it) are ABBYY FineReader and Nuance's OmniPage:

http://www.abbyy.com/
http://www.nuance.com/for-business/by-product/omnipage/index.htm

I'll give it some more thought, but start with those two. This is not going to be as simple as your first question. :)   Regards, Joe
0
 

Author Comment

by:K_Deutsch
ID: 37778655
I have not abandoned this. The software products you mentioned are too large in scope, I think, though I did get a trial of FineReader. I may poke around in that, but beyond that, we are currently using KnowledgeLake Capture. Onsite IT folks say it may be the answer, but I may not have time to wait on them.
0
 
LVL 52

Expert Comment

by:Joe Winograd, EE MVE
ID: 37778929
Two comments:

1. I don't know much about KnowledgeLake Capture, but I do know that it has some type of advanced OCR data extraction that may be able to do what you want with the current form.

2. Unless #1 turns out to be easy, or at least doable, my suggestion is to make a new form for future mailings. (If #1 is not doable, then you'll have to handle manually the forms that have already been mailed.) The new form should have a bar code on it that has the unique document number of each form. KnowledgeLake Capture supports barcode recognition, both as a document separator and in advanced capture. I can't be certain of this, having never used it or seen the manual, but it is very likely that KnowledgeLake Capture's barcode recognition capabilities can do what you need, i.e., recognize the unique number in each barcode and save that page as a separate PDF file, with the file name being the number in the barcode.

Regards, Joe
0
 

Author Closing Comment

by:K_Deutsch
ID: 37799314
The answer I picked as the accepted solution is based on my original question, which was vague. The solutions for my more clarified question are out of reach for what has to be a hit and run project for me or nothing. Thanks, Joe!
0
 
LVL 52

Expert Comment

by:Joe Winograd, EE MVE
ID: 37799380
You're welcome! I hope you can achieve what you want with Knowledge Lake Capture...might be possible. Good luck! Regards, Joe
0

Featured Post

Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

This article provides the solution to a question (http://www.experts-exchange.com/Software/Photos_Graphics/Images_and_Photos/Q_28674207.html) posed here at Experts Exchange. The asker of the question has many JPG images in many folders, and all of t…
In a previous article here at Experts Exchange (http://www.experts-exchange.com/articles/18414/Create-a-PDF-file-with-Contact-Sheets-montage-of-thumbnails-for-all-JPG-files-in-a-folder-and-each-of-its-subfolders-using-an-automated-batch-method.html)…
The goal of the tutorial is to teach the user how to add a water mark to there photo. Once you have a photo you like you have to go into the water mark setting and add a water mark to the image. You can either choose a text watermark or an image…
Sometimes we receive PDF files that are in the wrong orientation. They may be sideways or even upside down. This most commonly happens with scanned or faxed documents. It is possible to rotate the view of these PDFs with the free Adobe Reader produc…

867 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

15 Experts available now in Live!

Get 1:1 Help Now