Dump PDF to Text ?

Im looking for a free or opensource component to Dump PDF to Text ?

Please do not post unless you have a free or opensource method of accomplishing this.

THanks
Looking_4_AnswersAsked:
Who is Participating?
 
aikimarkConnect With a Mentor Commented:
1. download
ftp://ftp.foolabs.com/pub/xpdf/xpdf-3.02pl5-win32.zip

2. unzip the contents.

3. open a command prompt window and navigate to the unzipped directory.

4. Issue the following command:
PDFTEXT -?

5. read the displayed text

6. play with the different command line switches using your PDF files until you are satisfied with the result.

7. Use the ShellExecute() function in your application.
Refr:
http://delphi.about.com/od/windowsshellapi/a/executeprogram.htm
http://www.tek-tips.com/faqs.cfm?fid=5462

=========
You will probably need to wait/sleep for a moment for the shelled process to finish.
0
 
Looking_4_AnswersAuthor Commented:
Sorry, Delphi 2010
0
 
Looking_4_AnswersAuthor Commented:
Actually, i would be more interested in plain ole code versus a component
0
Free Tool: Path Explorer

An intuitive utility to help find the CSS path to UI elements on a webpage. These paths are used frequently in a variety of front-end development and QA automation tasks.

One of a set of tools we're offering as a way of saying thank you for being a part of the community.

 
aikimarkCommented:
@Looking

What criteria do you have for the resulting text file?

How was the PDF created?  -- The possible solutions differ.
0
 
Eddie ShipmanAll-around developerCommented:
http://www.swissdelphicenter.ch/en/showcode.php?id=2169

Here's the German->English translation, via Google Translate, of the comments:

So now I've finally found a solution how to
 the entire text from a PDF file (also possible with multiple pages)
 can read.
 I have been in the same time vorherein for my dirty
 Programming apologize, but I hope you can still what
 to start! The form includes a TMemo, 5 TLabel, TButton 1
 and an open dialogue

 oh yes, you have to insert before or a type library
 opens to the type library import dialog (project
 ) And find the folder when adding Selects Adobe Acrobat.
 There you should find a file named Acrobat.tbl, if not then
 Just look for times.
 Now for the install and then applying unit ready.
 have fun

I, personally, have not tried this.
0
 
Looking_4_AnswersAuthor Commented:
@aikimark:

No Criteria, just dump the entire pdf to text

@EddieShipman:

Thanks, i will give that a try
0
 
Ephraim WangoyaCommented:
0
 
aikimarkCommented:
@Looking

If the PDF only has an image layer, then there is an extra step required to recognize the characters/words in the image and add a text layer to the PDF.  People make this mistake when they convert TIFF images to PDFs and then wonder why they don't find anything when they do an Adobe search for words.

Did you look at the PDFText utility?
http://www.glyphandcog.com/textext.html
0
 
Looking_4_AnswersAuthor Commented:
@aikimark:

Yes i looked at it. It is not free.....at least the portion (commandline utility) that would be useful to me
0
 
aikimarkCommented:
For some reason, I'm only seeing XPDFText at that site.  I'll look for the free PDFText utility.
0
 
Looking_4_AnswersAuthor Commented:
Thanks....and right now., i am looking at these two free options:

http://www.foolabs.com/xpdf

http://mupdf.com 
0
 
aikimarkCommented:
I'm pretty sure that I got my utility from foolabs.  The README documentation states that it is opensource.
http://www.foolabs.com/xpdf/download.html
0
 
Looking_4_AnswersAuthor Commented:
so , which file do i download for windows XP, delphi 2010?

Also, can yiu tell me which file to call and how to pass the params?
0
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

All Courses

From novice to tech pro — start learning today.