Solved

C# Read and Parse a PDF File

Posted on 2011-09-08
1
638 Views
Last Modified: 2013-12-17
I have a requirement to be able to open and read a PDF file using C#.  The idea is that I will read the PDF looking for specific keywords and storing the page on which the keyword was found.  I then want to be able to open the PDF jumping directly to the page on which I found the keyword.

My PDF does have bookmarks within it - looking at the documentation for Adobe Acrobat it appears that you can open a PDF and jump to a bookmark.  My question therefore is

1> How can I read the PDF initially and search for a keyword.

2> How can I find out if there is a bookmark on the page containing the keyword and if there is recover its name to be used when invoking Acrobat?

The idea is that the PDF is a parts catalogue and I want to link the parts within the database to the page in the catalogue which describes them.
0
Comment
Question by:ChrisMDrew
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
1 Comment
 
LVL 5

Accepted Solution

by:
Kelaros earned 500 total points
ID: 36502645
As far as I know there is not a native way to parse PDF in C#, but there are third-party packages that do so.

Here are some good resources on how to parse a PDF file in C# using third-party software:

http://forums.asp.net/t/1218513.aspx

And here's a C# project that is an actual PDF parser:

http://www.codeproject.com/KB/recipes/mgpdfreader.aspx
0

Featured Post

Technology Partners: We Want Your Opinion!

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

This document covers how to connect to SQL Server and browse its contents.  It is meant for those new to Visual Studio and/or working with Microsoft SQL Server.  It is not a guide to building SQL Server database connections in your code.  This is mo…
A long time ago (May 2011), I have written an article showing you how to create a DLL using Visual Studio 2005 to be hosted in SQL Server 2005. That was valid at that time and it is still valid if you are still using these versions. You can still re…
Finds all prime numbers in a range requested and places them in a public primes() array. I've demostrated a template size of 30 (2 * 3 * 5) but larger templates can be built such 210  (2 * 3 * 5 * 7) or 2310  (2 * 3 * 5 * 7 * 11). The larger templa…

739 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question