browser-based file search for a Document Imaging System
Posted on 2004-10-22
We have a document imaging system where paper files are converted into searchable pdfs. Currently, users access these files by way of a mapped drive. Searching within the documents works very well. Even searching between documents for content works well. However, finding the file you're looking for in the first place is a pain. They're all named according to case number, which is a 1-8 character number, then further broken down into folders by year of the case (ie. all the 2004 cases go into a 2004 folder within the DIS). Scrolling through hundreds of files looking for, say, 35873, is getting to be a major pain as this thing grows, moreso if you don't know which year the case is from (although we have to break it up that way for archival purposes).
I would like to put a search engine on our Intranet site that does nothing more than reach out to that file server and look at the titles of the pdf's. The user should be able to put in the case number they're looking for and press submit. Then, they get the result, which should be a link to the file. They click the link and, being as it's a pdf, the link will get opened by Adobe Acrobat within the browser.
The users can't edit these files, so opening read-only is fine. I'm looking for a largely turnkey solution. I don't mind getting in and editing html or asp, but I don't know enough to write it from scratch. Most of the search engines I've looked at don't return file titles, which is all I want. They want to open and parse the page, which would normally be asp or html, and return data from within the file. Acrobat can already do that within the pdfs. I only want file titles. Has anybody written or know of a browser-based search engine that can search for and return the name of and a link to a pdf file?
I'm going to start this question at 250 points. That would be for a simple search engine that can search through the entire DIS for a file name, and provide that file name as a clickable link to the file. For 500 points, I'd also like a drop-down list that lets the user narrow the search path. Something like this, with each line being a searchable selection:
Search entire DIS (default selection)
Central Records (this would search only the Central Records subfolder)
2004 (this would search only the 2004 folder within Central Records)