Extracting text from a PDF document?

Posted on 2002-04-09
Medium Priority
Last Modified: 2010-04-02
I'm building a small program that can extract all the text and metadata from a PDF document.

Speed is important so preferably no OLE automation solutions.

What library or SDK do I need for this! Is this a big task??
Question by:kbb2
LVL 32

Accepted Solution

jhance earned 400 total points
ID: 6927832
Adobe's Acrobat can do it:


There are free/open source solutions:


Here is a company with a library you can link into your app:


Author Comment

ID: 6927852
Thanks! Just what I needed!
LVL 30

Expert Comment

ID: 6927877
Could you please close this question, by awarding jhnace the points.

Featured Post

The 14th Annual Expert Award Winners

The results are in! Meet the top members of our 2017 Expert Awards. Congratulations to all who qualified!

Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

Join & Write a Comment

This article shows you how to optimize memory allocations in C++ using placement new. Applicable especially to usecases dealing with creation of large number of objects. A brief on problem: Lets take example problem for simplicity: - I have a G…
Article by: evilrix
Looking for a way to avoid searching through large data sets for data that doesn't exist? A Bloom Filter might be what you need. This data structure is a probabilistic filter that allows you to avoid unnecessary searches when you know the data defin…
The viewer will learn how to clear a vector as well as how to detect empty vectors in C++.
The viewer will be introduced to the technique of using vectors in C++. The video will cover how to define a vector, store values in the vector and retrieve data from the values stored in the vector.

619 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question