Solved

Search pdf files in webpage woth c#

Posted on 2006-06-18
10
759 Views
Last Modified: 2008-01-09
Hello experts,

I want to code search engine that will search pdf and aspx files content.
can you give me help on how i can perform this web site.
0
Comment
Question by:helkayal
  • 3
  • 3
10 Comments
 
LVL 30

Expert Comment

by:callrs
ID: 16930259
www.google.com returns results from pdf content
Not sure what you're asking here...
0
 
LVL 16

Accepted Solution

by:
OliWarner earned 250 total points
ID: 16930411
Not sure if live-seaching is going to be the best route, but anyway...

You're going to want to look at this at some point: http://www.codeproject.com/useritems/PDFToText.asp
That's how to read a PDF into .net

What I would do, is extract all the data from all your PDFs every 3/4 days (depending on how often they change) and dumping the text in a database... It should then be quite easy to do a full text search on the database.
0
 
LVL 16

Expert Comment

by:OliWarner
ID: 16930413
Otherwise you can go with the live-search method that does the above on demand... But as noted, highly unrecommended.
0
Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

 
LVL 1

Author Comment

by:helkayal
ID: 16930479
I saw this link before but it use reference to 2 external dlls , and i want to do this without external dlls.
can any one told me how can i do that.
simply how can i search or read pdf files without using any external dlls.
0
 
LVL 16

Expert Comment

by:OliWarner
ID: 16930493
Well the component used (iTextSharp) is open souce.... You've got all the source you need right there.
0
 
LVL 30

Expert Comment

by:callrs
ID: 17492199
Title of the link given by  OliWarner: "Extract text from PDF in C# (100% .NET)"
Question asked: "Search pdf files in webpage woth c#"

OliWarner's last comment: "Well the component used (iTextSharp) is open souce.... You've got all the source you need right there."


RECOMMEND:   Award to OliWarner
0
 
LVL 30

Expert Comment

by:callrs
ID: 17492200
Makes a good PAQ too
0

Featured Post

Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

Problem to be resolved in this article Currently, development of website and web application can be done without writing thousands of lines of programming code by hand. Description This can be done through by using a open source framework such …
An enjoyable and seamless user experience can go a long way on an eCommerce site. While a cohesive layout and engaging copy play roles in creating a positive user experience, some sites neglect aspects that seem marginal but in actuality prove very …
This tutorial demonstrates how to identify and create boundary or building outlines in Google Maps. In this example, I outline the boundaries of an enclosed skatepark within a community park.  Login to your Google Account, then  Google for "Google M…
This video teaches users how to migrate an existing Wordpress website to a new domain.

943 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

11 Experts available now in Live!

Get 1:1 Help Now