PDF parser

Hi, Experts! I want to parse PDF files (in each file there is only one A4 paper) and extract text values and image objects. Is it any suitable component for this task?
Alexander_SavenkoAsked:
Who is Participating?
 
den4bCommented:
Use freeware package Xpdf: http://www.foolabs.com/xpdf/download.html

It has several command line tools compiled for windows:
 * pdftotext.exe  - is for extracting text
 * pdfimages.exe  - is for extracting images
 * pdfinfo.exe  - if for extracting pdf tags
0
 
Tomas Helgi JohannssonCommented:
    Hi!

Here is an example how to get the Text from PDF file : http://www.swissdelphicenter.ch/torry/showcode.php?id=2169
Also there are several components on Torry : www.torry.net. Just Type in PDF in The Quick Search.
You could also Import the Adobe Reader Ocx file and gain access to the functions you need to extract
the text/images you need (similar to the above example).

Regards,
   Tomas Helgi
0
 
Eddie ShipmanAll-around developerCommented:
You can take the free .Net iText application, convert it to a DLL or ActiveX and use it in your Delphi application.
http://itextsharp.sourceforge.net/
0
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

All Courses

From novice to tech pro — start learning today.