Want to protect your cyber security and still get fast solutions? Ask a secure question today.Go Premium

x
  • Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 741
  • Last Modified:

C# Read and Parse a PDF File

I have a requirement to be able to open and read a PDF file using C#.  The idea is that I will read the PDF looking for specific keywords and storing the page on which the keyword was found.  I then want to be able to open the PDF jumping directly to the page on which I found the keyword.

My PDF does have bookmarks within it - looking at the documentation for Adobe Acrobat it appears that you can open a PDF and jump to a bookmark.  My question therefore is

1> How can I read the PDF initially and search for a keyword.

2> How can I find out if there is a bookmark on the page containing the keyword and if there is recover its name to be used when invoking Acrobat?

The idea is that the PDF is a parts catalogue and I want to link the parts within the database to the page in the catalogue which describes them.
0
ChrisMDrew
Asked:
ChrisMDrew
1 Solution
 
KelarosCommented:
As far as I know there is not a native way to parse PDF in C#, but there are third-party packages that do so.

Here are some good resources on how to parse a PDF file in C# using third-party software:

http://forums.asp.net/t/1218513.aspx

And here's a C# project that is an actual PDF parser:

http://www.codeproject.com/KB/recipes/mgpdfreader.aspx
0

Featured Post

Restore individual SQL databases with ease

Veeam Explorer for Microsoft SQL Server delivers an easy-to-use, wizard-driven interface for restoring your databases from a backup. No expert SQL background required. Web interface provides a complete view of all available SQL databases to simplify the recovery of lost database

Tackle projects and never again get stuck behind a technical roadblock.
Join Now