How to Search a PDF image from a BLOB for specific data Using ASP.NET/C#

Posted on 2011-10-31
Last Modified: 2013-12-16
I have written a website that pulls back various fields from a database table that populates a second QueryResults.aspx page with all of the information within a database, excluding the BLOB (which is a PDF image of a receipt), onto the webpage. Once the user looks through the set of results they have pulled back, they are able to see the image they are looking for. My questions are:
1. Would it be feasible to pull back the BLOB of each PDF image and search for a receipt number located in various positions on the page, but not stored within the database table that is storing the BLOB and the other information that is displayed on the QueryResults.aspx page?
2. How would you search for words within a PDF file that is only in memory that you pull from a BLOB? (Would you just write it to a temporary location, search it, then delete it once you are done with the file? What would be the syntax for accomplishing such a task?)

I am new to ASP.NET and apologize in advance if the questions seem a bit off-kilter, but I was asked to complete this task by my boss. The database that is storing the BLOB and the other information is stored in an Oracle database.
Question by:thenthorn1010
    1 Comment
    LVL 7

    Accepted Solution

    You can implement solution using iTextSharp library, e.g.
    This will help you converting PDF to text on the fly in memory and then string search is not a big deal.


    Featured Post

    How your wiki can always stay up-to-date

    Quip doubles as a “living” wiki and a project management tool that evolves with your organization. As you finish projects in Quip, the work remains, easily accessible to all team members, new and old.
    - Increase transparency
    - Onboard new hires faster
    - Access from mobile/offline

    Join & Write a Comment

    Exception Handling is in the core of any application that is able to dignify its name. In this article, I'll guide you through the process of writing a DRY (Don't Repeat Yourself) Exception Handling mechanism, using Aspect Oriented Programming.
    Calculating holidays and working days is a function that is often needed yet it is not one found within the Framework. This article presents one approach to building a working-day calculator for use in .NET.
    Illustrator's Shape Builder tool will let you combine shapes visually and interactively. This video shows the Mac version, but the tool works the same way in Windows. To follow along with this video, you can draw your own shapes or download the file…
    In this tutorial you'll learn about bandwidth monitoring with flows and packet sniffing with our network monitoring solution PRTG Network Monitor ( If you're interested in additional methods for monitoring bandwidt…

    745 members asked questions and received personalized solutions in the past 7 days.

    Join the community of 500,000 technology professionals and ask your questions.

    Join & Ask a Question

    Need Help in Real-Time?

    Connect with top rated Experts

    17 Experts available now in Live!

    Get 1:1 Help Now