Solved

C# Read and Parse a PDF File

Posted on 2011-09-08
1
600 Views
Last Modified: 2013-12-17
I have a requirement to be able to open and read a PDF file using C#.  The idea is that I will read the PDF looking for specific keywords and storing the page on which the keyword was found.  I then want to be able to open the PDF jumping directly to the page on which I found the keyword.

My PDF does have bookmarks within it - looking at the documentation for Adobe Acrobat it appears that you can open a PDF and jump to a bookmark.  My question therefore is

1> How can I read the PDF initially and search for a keyword.

2> How can I find out if there is a bookmark on the page containing the keyword and if there is recover its name to be used when invoking Acrobat?

The idea is that the PDF is a parts catalogue and I want to link the parts within the database to the page in the catalogue which describes them.
0
Comment
Question by:ChrisMDrew
1 Comment
 
LVL 5

Accepted Solution

by:
Kelaros earned 500 total points
ID: 36502645
As far as I know there is not a native way to parse PDF in C#, but there are third-party packages that do so.

Here are some good resources on how to parse a PDF file in C# using third-party software:

http://forums.asp.net/t/1218513.aspx

And here's a C# project that is an actual PDF parser:

http://www.codeproject.com/KB/recipes/mgpdfreader.aspx
0

Featured Post

3 Use Cases for Connected Systems

Our Dev teams are like yours. They’re continually cranking out code for new features/bugs fixes, testing, deploying, testing some more, responding to production monitoring events and more. It’s complex. So, we thought you’d like to see what’s working for us.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

This article describes relatively difficult and non-obvious issues that are likely to arise when creating COM class in Visual Studio and deploying it by professional MSI-authoring tools. It is assumed that the reader is already familiar with the cla…
In my previous two articles we discussed Binary Serialization (http://www.experts-exchange.com/A_4362.html) and XML Serialization (http://www.experts-exchange.com/A_4425.html). In this article we will try to know more about SOAP (Simple Object Acces…
This tutorial demonstrates a quick way of adding group price to multiple Magento products.
You have products, that come in variants and want to set different prices for them? Watch this micro tutorial that describes how to configure prices for Magento super attributes. Assigning simple products to configurable: We assigned simple products…

919 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

13 Experts available now in Live!

Get 1:1 Help Now