?
Solved

pdf to excel column question

Posted on 2011-09-14
5
Medium Priority
?
271 Views
Last Modified: 2012-05-12
Is there a tool I can use to extract text from a pdf and export to excel.
I have pdf documents that are created from a client. They contain the same information but one week might be 3 pages and another could be 40 pages depending on the data they send.
I need to pull information from 3 columns and the column names are always the same.
Again it might be a few pages or many pages that the column information expands to.
Is there a tool that I can identify the column name and have it pull the column data no mater how many pages?
0
Comment
Question by:usky1
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 2
  • 2
5 Comments
 
LVL 6

Expert Comment

by:theKashyap
ID: 36538031
Automation might be too cumbersome. There are some APIs available AFAIK, but never used them.
Or did you mean manually? Manually you just have to copy data from pdf, paste into a new excel and use the "text to column" feature.
0
 
LVL 27

Expert Comment

by:Glenn Ray
ID: 36538180
Adobe Acrobat Professional can often convert pdf documents to Word format.  In turn, you should be able to convert the file to a csv or txt file that can be imported in to Excel.
0
 

Author Comment

by:usky1
ID: 36538810
I wanted to see if anyone knew of a toll that you could define the columns you need from the pdf, save it and use use the template for future exports.
Doing a manual cut and paste is tedious. I don't mind having to manual execute the template but would like it to be automated after that.
0
 
LVL 6

Accepted Solution

by:
theKashyap earned 1600 total points
ID: 36544598
I've never used it but check out: http://www.pdftoexcelonline.com/

Automation: In general most of these tools (xxx to/from pdf converter tools) are implemented using standard APIs.
E.g. www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/access.pdf
Also check if Google documents provides any APIs.
Finally check if postscript APIs can be used to read/write to pdf or not. If it can then you have many open source available e.g. GhostScript.
0
 

Author Comment

by:usky1
ID: 36546744
Thanks for the adobe api document. I will give you points for that when this is closed.
But I do not have the resources available to program this. I tried Nitro and the product is great but there is a bug when using large Excel files. They are aware of it and will not say when, or if, it will be fixed.
0

Featured Post

VIDEO: THE CONCERTO CLOUD FOR HEALTHCARE

Modern healthcare requires a modern cloud. View this brief video to understand how the Concerto Cloud for Healthcare can help your organization.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Messaging apps are amazing tools with the power to do a lot of good, but the truth is the process of collaborating with coworkers requires relationships established through meaningful communication - the kind of communication that only happens face-…
Freeze panes is an option within all variants of Excel to enable parts of a sheet to remain stationary when the cursor is in another part of the sheet. This is a very useful feature which is overlooked or under used.
This Micro Tutorial will demonstrate how to create pivot charts out of a data set. I also added a drop-down menu which allows to choose from different categories in the data set and the chart will automatically update.
In this seventh video of the Xpdf series, we discuss and demonstrate the PDFfonts utility, which lists all the fonts used in a PDF file. It does this via a command line interface, making it suitable for use in programs, scripts, batch files — any pl…

771 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question