Solved

C# Read and Parse a PDF File

Posted on 2011-09-08
1
593 Views
Last Modified: 2013-12-17
I have a requirement to be able to open and read a PDF file using C#.  The idea is that I will read the PDF looking for specific keywords and storing the page on which the keyword was found.  I then want to be able to open the PDF jumping directly to the page on which I found the keyword.

My PDF does have bookmarks within it - looking at the documentation for Adobe Acrobat it appears that you can open a PDF and jump to a bookmark.  My question therefore is

1> How can I read the PDF initially and search for a keyword.

2> How can I find out if there is a bookmark on the page containing the keyword and if there is recover its name to be used when invoking Acrobat?

The idea is that the PDF is a parts catalogue and I want to link the parts within the database to the page in the catalogue which describes them.
0
Comment
Question by:ChrisMDrew
1 Comment
 
LVL 5

Accepted Solution

by:
Kelaros earned 500 total points
Comment Utility
As far as I know there is not a native way to parse PDF in C#, but there are third-party packages that do so.

Here are some good resources on how to parse a PDF file in C# using third-party software:

http://forums.asp.net/t/1218513.aspx

And here's a C# project that is an actual PDF parser:

http://www.codeproject.com/KB/recipes/mgpdfreader.aspx
0

Featured Post

How to run any project with ease

Manage projects of all sizes how you want. Great for personal to-do lists, project milestones, team priorities and launch plans.
- Combine task lists, docs, spreadsheets, and chat in one
- View and edit from mobile/offline
- Cut down on emails

Join & Write a Comment

Suggested Solutions

Title # Comments Views Activity
Retain selection in datagridview 2 22
Hovering effect 9 28
Why, and when, to use Windows Workflow 1 16
Showdialog 8 20
We all know that functional code is the leg that any good program stands on when it comes right down to it, however, if your program lacks a good user interface your product may not have the appeal needed to keep your customers happy. This issue can…
For those of you who don't follow the news, or just happen to live under rocks, Microsoft Research released a beta SDK (http://www.microsoft.com/en-us/download/details.aspx?id=27876) for the Xbox 360 Kinect. If you don't know what a Kinect is (http:…
Get a first impression of how PRTG looks and learn how it works.   This video is a short introduction to PRTG, as an initial overview or as a quick start for new PRTG users.
This demo shows you how to set up the containerized NetScaler CPX with NetScaler Management and Analytics System in a non-routable Mesos/Marathon environment for use with Micro-Services applications.

771 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

12 Experts available now in Live!

Get 1:1 Help Now