PDF Extraction to Excel

I have a multiple page scanned PDF document that contains several 1 page invoices.  I need a solution to OCR the document so that the data may be extracted and then select specific fields from the document to export them to a spreadsheet.  The specific fields are repeated on each page.

I've looked at a couple of solutions, but you have to copy each field from all pages to extract the data fields that I want and that takes too much time.
curtconnerAsked:
Who is Participating?
 
jppintoCommented:
Did you tryed PDF2XL? Take a look at my review to this program on my blog here:

http://excel-user.blogspot.com/2010/11/pdf-to-excel.html

jppinto
0
 
curtconnerAuthor Commented:
jppinto:  The OCR piece didn't work very well with the document that I'm scanning.  Loved the features, but the OCR failed.
0
 
InfoStrangerCommented:
Do you have Adobe Acrobat?

My instructions below are for Acrobat 8.0.  To convert picture to text using OCR,
1) open PDF in Acrobat
2) Select Document Menu
3) Select OCR Text Recognition
4) Recognize Text Using OCR...
5) Click OK

You may want to try this first then try it again.  The OCR may not work as well if the document is faded or too crooked.
0
Cloud Class® Course: Microsoft Windows 7 Basic

This introductory course to Windows 7 environment will teach you about working with the Windows operating system. You will learn about basic functions including start menu; the desktop; managing files, folders, and libraries.

 
redmondbCommented:
curtconner,

I've frequently used ABBYY FineReader for tasks such as this. (My version is V8, the current is V10 - http://www.abbyy.com/.)

Initially, you create a template specifying the fields that you want to extract from the invoice (a few minutes work for a typical invoice layout) and set up a job to open, read and export the fields to Excel (another minute's work).

From then on, simply run the job which opens the PDF, OCRs the required fields and exports them to Excel.

Regards,
Brian.
0
 
jyk_ausCommented:
Cortconnor,

Have you considered purchasing the full version of Acrobat Reader?  Amongst other things it has the facility to convert PDF to quite a few formats, Excel included.

See here:
http://www.adobe.com/products/acrobatstandard.html

Best regards
Jacob
0
 
viki2000Commented:
Try this http://www.abbyyusa.com/finereader/
It is programmable with macros, has customizable areas...
0
 
redmondbCommented:
Thanks, curtconner.

Hope it worked out OK in the end.

Regards,
Brian.
0
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

All Courses

From novice to tech pro — start learning today.