• Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 467
  • Last Modified:

Can I do this with Adobe Pro

Hi,

I'm trying to set up an improved, digitized and low-cost Accounts Payable solution, and want to know if I can do the following steps with Adobe Pro. I've taken a look at Nuance products and so far, they've fallen slightly short of the mark.
- First step - batch OCR: take a folder of pdf files (origin:print-to-pdf and paper scans) and turn them into searchable pdfs.
- Second step - add form: add a pdf form page as the first new page of each of the files. This form would contain text box, drop box with database feed and a signature box. I want people with Adobe Reader to be able to fill the form, sign the form and save the form.
- Third step - data retrieve: take of folder of filled and signed forms and extract the form data (not the whole pdf text, just the form data) into some sort of workable format (database, csv, excel)

These steps can be achieved through GUI or scripting, as long as they can be mostly automated.

Can this be done? Can this be done without Livecycle? Can it be done by myself, with vb / vba and googling experience?

Thanks for your input,

Thomas
0
nutsch
Asked:
nutsch
  • 3
  • 2
2 Solutions
 
Karl Heinz KremerCommented:
Yes, this can be done without Livecycle. You can do all the forms related stuff with AcroForms (the other forms system built into Acrobat). I don't know how good you are in taking information that you find online to create a working solution, so I wont comment on if you can do this ;)

Which version of Acrobat do you have? Acrobat Pro provides (the name changed with Acrobat X, but the functionality is still available) batch processing, or actions (in Acrobat X). This allows you to process a whole directory full of PDF files. Running OCR on these files during a batch sequence is possible. For files that got printed to PDF, you don't even have to do that - the textual information is already stored in a way that can be extracted (as long as you used a tool that actually puts valid font information into the PDF file, there are some bad programs out there, but if you stick to the big name products, you wont have a problem).

You can also add your forms page as part of the same sequence/action.

The only step that you cannot automate is to save the file as a reader enabled PDF (which would enable somebody with just Reader to fill in the form and save it). This step needs to be done manually

The last step - collecting the data - can be automated again. You can either do this in Acrobat, or in your own VB/VBA program - depending on how complicated the output format will be. Acrobat allows you to create a simple output file. If you want the full gamut of Excel options, you can do this as a VBA program. Take a look at this blog post that I wrote a few years ago, it describes how to read form fields from VBA:
http://www.khk.net/wordpress/2010/09/23/reading-pdf-form-fields-with-vba/

How much experience do you have in automating Acrobat?
0
 
nutschAuthor Commented:
I have no experience in automating Acrobat, but I have done extensive automation in excel with interfaces to SQL, SAP, NAV, etc.
0
 
nutschAuthor Commented:
The only step that you cannot automate is to save the file as a reader enabled PDF (which would enable somebody with just Reader to fill in the form and save it). This step needs to be done manually

Would that be manageable through some sort of sendkeys automation? Are there enough key shortcuts to allow for that?

Thanks
0
 
Karl Heinz KremerCommented:
If you feel comfortable enough in Excel, I would say download the SDK and play around with it. You can find it here: https://www.adobe.com/devnet/acrobat.html

Regarding the sendkeys suggestion - it may be possible. Take  a look at how you would do it manually, and see if you can automate that:
Alt-F <-to open the File menu
a  <- to select the "Save As" menu
x <- to select "Reader Extended PDF"
f <- to select the correct menu item
<return> <- get rid of the popup dialog
here you have to enter the filename - don't know if that can be automated via sendkeys
0
 
nutschAuthor Commented:
Thanks for your input. I'll download adobe x pro, get to work seeing what it can do, and come back to complain if I don't get it to work the way you say it would ;-)

Funny aside, I stumbled on your blog post earlier this week when I tried to automate data retrieval from a pdf, only to release that I didn't have the right license.

Thanks again,

Thomas
0

Featured Post

Free Tool: IP Lookup

Get more info about an IP address or domain name, such as organization, abuse contacts and geolocation.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

  • 3
  • 2
Tackle projects and never again get stuck behind a technical roadblock.
Join Now