Solved

C# Read and Parse a PDF File

Posted on 2011-09-08
1
612 Views
Last Modified: 2013-12-17
I have a requirement to be able to open and read a PDF file using C#.  The idea is that I will read the PDF looking for specific keywords and storing the page on which the keyword was found.  I then want to be able to open the PDF jumping directly to the page on which I found the keyword.

My PDF does have bookmarks within it - looking at the documentation for Adobe Acrobat it appears that you can open a PDF and jump to a bookmark.  My question therefore is

1> How can I read the PDF initially and search for a keyword.

2> How can I find out if there is a bookmark on the page containing the keyword and if there is recover its name to be used when invoking Acrobat?

The idea is that the PDF is a parts catalogue and I want to link the parts within the database to the page in the catalogue which describes them.
0
Comment
Question by:ChrisMDrew
1 Comment
 
LVL 5

Accepted Solution

by:
Kelaros earned 500 total points
ID: 36502645
As far as I know there is not a native way to parse PDF in C#, but there are third-party packages that do so.

Here are some good resources on how to parse a PDF file in C# using third-party software:

http://forums.asp.net/t/1218513.aspx

And here's a C# project that is an actual PDF parser:

http://www.codeproject.com/KB/recipes/mgpdfreader.aspx
0

Featured Post

Master Your Team's Linux and Cloud Stack!

The average business loses $13.5M per year to ineffective training (per 1,000 employees). Keep ahead of the competition and combine in-person quality with online cost and flexibility by training with Linux Academy.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Today I had a very interesting conundrum that had to get solved quickly. Needless to say, it wasn't resolved quickly because when we needed it we were very rushed, but as soon as the conference call was over and I took a step back I saw the correct …
Entity Framework is a powerful tool to help you interact with the DataBase but still doesn't help much when we have a Stored Procedure that returns more than one resultset. The solution takes some of out-of-the-box thinking; read on!
This Micro Tutorial will give you a basic overview how to record your screen with Microsoft Expression Encoder. This program is still free and open for the public to download. This will be demonstrated using Microsoft Expression Encoder 4.
In a recent question (https://www.experts-exchange.com/questions/28997919/Pagination-in-Adobe-Acrobat.html) here at Experts Exchange, a member asked how to add page numbers to a PDF file using Adobe Acrobat XI Pro. This short video Micro Tutorial sh…

773 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question