Want to protect your cyber security and still get fast solutions? Ask a secure question today.Go Premium

x
  • Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 243
  • Last Modified:

Extracting text from a PDF document?

I'm building a small program that can extract all the text and metadata from a PDF document.

Speed is important so preferably no OLE automation solutions.

What library or SDK do I need for this! Is this a big task??
0
kbb2
Asked:
kbb2
1 Solution
 
jhanceCommented:
Adobe's Acrobat can do it:

http://www.adobe.com/support/techdocs/1c356.htm

There are free/open source solutions:

http://research.compaq.com/SRC/virtualpaper/pstotext.html

Here is a company with a library you can link into your app:

http://www.totalint.com/products/developer/PDFextractor.asp
0
 
kbb2Author Commented:
Thanks! Just what I needed!
0
 
AxterCommented:
kbb2,
Could you please close this question, by awarding jhnace the points.
0

Featured Post

Concerto's Cloud Advisory Services

Want to avoid the missteps to gaining all the benefits of the cloud? Learn more about the different assessment options from our Cloud Advisory team.

Tackle projects and never again get stuck behind a technical roadblock.
Join Now