• Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 1080
  • Last Modified:

Search text inside PDF documents. ASP.NET or PHP

Dear all,
I would like to build a search engine which be able to search into a folder with PDF files.
I have about 90 PDF files and I will upload 1 new file per month
I can't use Microsoft Indexing Services or Lucence.net as per my hoster limitations

Any other idea?
0
Jorgefa
Asked:
Jorgefa
1 Solution
 
Daniel JungesCommented:
0
 
Ray PaseurCommented:
What is the source of the PDF files?  Do you have MySQL handy?
0
Concerto Cloud for Software Providers & ISVs

Can Concerto Cloud Services help you focus on evolving your application offerings, while delivering the best cloud experience to your customers? From DevOps to revenue models and customer support, the answer is yes!

Learn how Concerto can help you.

 
JorgefaAuthor Commented:
Hi raja, thank for he info but for every update file I will need to run zoom in order to generate de indexes.
I am trying to look for a solution just coding and without (if posiible) third party commercial code

Hy Ray,
  the PDF are files that I generate every month, short books
  And yes, I can use MySQL
Cheers
0
 
Ray PaseurCommented:
There are a couple of options.  You can store the searchable elements of the PDF files in the MySQL data base along with a link to the PDF files.  Then you can use FULLTEXT index and MATCH AGAINST to search in MySQL.  You'll find the /path/to the PDF and return that.

Another option may be to use a hosted service such as ATOMZ or FREEFIND.
http://www.atomz.com/
http://www.freefind.com/

Another option may be to use Zoom
http://www.wrensoft.com/zoom/
0
 
JorgefaAuthor Commented:
Hi Ray,
how can I store the text of the PDF in MySQL?
0
 
Ray PaseurCommented:
You said you generate the PDF files, so I would expect that you would set up a MySQL table with the PDF contents in different columns, as appropriate to your search needs.  Then as you generate the PDF files, you would also send the underlying data to a script that would update the MySQL data base.
0
 
JorgefaAuthor Commented:
my fault!
I am not english speaker, generate is not correct, I received the files and I upload them
0
 
Ray PaseurCommented:
OK, that is different.  But there still is hope.  How big are the files?
0
 
JorgefaAuthor Commented:
Around 5MG as maximum
0
 
Ray PaseurCommented:
Can you send one of them to me please?  A smaller one if possible - I think we have a 5MB limit on the email attachments.  Email it to me at RPaseur [at] NationalPres V ORG
0

Featured Post

Prep for the ITIL® Foundation Certification Exam

December’s Course of the Month is now available! Enroll to learn ITIL® Foundation best practices for delivering IT services effectively and efficiently.

Tackle projects and never again get stuck behind a technical roadblock.
Join Now