Want to protect your cyber security and still get fast solutions? Ask a secure question today.Go Premium

x
?
Solved

Search pdf files in webpage woth c#

Posted on 2006-06-18
10
Medium Priority
?
771 Views
Last Modified: 2008-01-09
Hello experts,

I want to code search engine that will search pdf and aspx files content.
can you give me help on how i can perform this web site.
0
Comment
Question by:helkayal
  • 3
  • 3
7 Comments
 
LVL 30

Expert Comment

by:callrs
ID: 16930259
www.google.com returns results from pdf content
Not sure what you're asking here...
0
 
LVL 16

Accepted Solution

by:
OliWarner earned 1000 total points
ID: 16930411
Not sure if live-seaching is going to be the best route, but anyway...

You're going to want to look at this at some point: http://www.codeproject.com/useritems/PDFToText.asp
That's how to read a PDF into .net

What I would do, is extract all the data from all your PDFs every 3/4 days (depending on how often they change) and dumping the text in a database... It should then be quite easy to do a full text search on the database.
0
 
LVL 16

Expert Comment

by:OliWarner
ID: 16930413
Otherwise you can go with the live-search method that does the above on demand... But as noted, highly unrecommended.
0
Industry Leaders: We Want Your Opinion!

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

 
LVL 1

Author Comment

by:helkayal
ID: 16930479
I saw this link before but it use reference to 2 external dlls , and i want to do this without external dlls.
can any one told me how can i do that.
simply how can i search or read pdf files without using any external dlls.
0
 
LVL 16

Expert Comment

by:OliWarner
ID: 16930493
Well the component used (iTextSharp) is open souce.... You've got all the source you need right there.
0
 
LVL 30

Expert Comment

by:callrs
ID: 17492199
Title of the link given by  OliWarner: "Extract text from PDF in C# (100% .NET)"
Question asked: "Search pdf files in webpage woth c#"

OliWarner's last comment: "Well the component used (iTextSharp) is open souce.... You've got all the source you need right there."


RECOMMEND:   Award to OliWarner
0
 
LVL 30

Expert Comment

by:callrs
ID: 17492200
Makes a good PAQ too
0

Featured Post

Free Tool: Port Scanner

Check which ports are open to the outside world. Helps make sure that your firewall rules are working as intended.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Today, the web development industry is booming, and many people consider it to be their vocation. The question you may be asking yourself is – how do I become a web developer?
How do you create a user-centered user experience on your website? And what are some things you should consider in the process?
Any person in technology especially those working for big companies should at least know about the basics of web accessibility. Believe it or not there are even laws in place that require businesses to provide such means for the disabled and aging p…
Video by: Mark
This lesson goes over how to construct ordered and unordered lists and how to create hyperlinks.
Suggested Courses
Course of the Month13 days, 17 hours left to enroll

581 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question