Solved

Search pdf files in webpage woth c#

Posted on 2006-06-18
10
767 Views
Last Modified: 2008-01-09
Hello experts,

I want to code search engine that will search pdf and aspx files content.
can you give me help on how i can perform this web site.
0
Comment
Question by:helkayal
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 3
  • 3
10 Comments
 
LVL 30

Expert Comment

by:callrs
ID: 16930259
www.google.com returns results from pdf content
Not sure what you're asking here...
0
 
LVL 16

Accepted Solution

by:
OliWarner earned 250 total points
ID: 16930411
Not sure if live-seaching is going to be the best route, but anyway...

You're going to want to look at this at some point: http://www.codeproject.com/useritems/PDFToText.asp
That's how to read a PDF into .net

What I would do, is extract all the data from all your PDFs every 3/4 days (depending on how often they change) and dumping the text in a database... It should then be quite easy to do a full text search on the database.
0
 
LVL 16

Expert Comment

by:OliWarner
ID: 16930413
Otherwise you can go with the live-search method that does the above on demand... But as noted, highly unrecommended.
0
Guide to Performance: Optimization & Monitoring

Nowadays, monitoring is a mixture of tools, systems, and codes—making it a very complex process. And with this complexity, comes variables for failure. Get DZone’s new Guide to Performance to learn how to proactively find these variables and solve them before a disruption occurs.

 
LVL 1

Author Comment

by:helkayal
ID: 16930479
I saw this link before but it use reference to 2 external dlls , and i want to do this without external dlls.
can any one told me how can i do that.
simply how can i search or read pdf files without using any external dlls.
0
 
LVL 16

Expert Comment

by:OliWarner
ID: 16930493
Well the component used (iTextSharp) is open souce.... You've got all the source you need right there.
0
 
LVL 30

Expert Comment

by:callrs
ID: 17492199
Title of the link given by  OliWarner: "Extract text from PDF in C# (100% .NET)"
Question asked: "Search pdf files in webpage woth c#"

OliWarner's last comment: "Well the component used (iTextSharp) is open souce.... You've got all the source you need right there."


RECOMMEND:   Award to OliWarner
0
 
LVL 30

Expert Comment

by:callrs
ID: 17492200
Makes a good PAQ too
0

Featured Post

How Do You Stack Up Against Your Peers?

With today’s modern enterprise so dependent on digital infrastructures, the impact of major incidents has increased dramatically. Grab the report now to gain insight into how your organization ranks against your peers and learn best-in-class strategies to resolve incidents.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Does your audience prefer people in photos or no people? How can you best highlight what you’re selling? What are your competitors doing, and what can you do that is different and unique from them?  Continue reading to learn how to make your images …
Today, the web development industry is booming, and many people consider it to be their vocation. The question you may be asking yourself is – how do I become a web developer?
This tutorial will teach you the core code needed to finalize the addition of a watermark to your image. The viewer will use a small PHP class to learn and create a watermark.
Video by: Mark
This lesson goes over how to construct ordered and unordered lists and how to create hyperlinks.

730 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question