Solved

Indexing PDFs WITHOUT using Type="path"

Posted on 2011-03-03
4
288 Views
Last Modified: 2012-05-11
Greetings

For various reasons, I am trying to index some PDFs using Coldfusion's CFIndex with the type="Custom". I am currently using cffile with action="readBinary" to read the PDFs.

Using Coldfusion 9.01

Thanks
0
Comment
Question by:RayBakker
  • 2
  • 2
4 Comments
 
LVL 52

Expert Comment

by:_agx_
ID: 35038306
(not for points ...)

I haven't worked w/verity in ages.  But since no one else has commented ... have you tried using cfpdf action="extracttext" and indexing the pdf text (not the bytes).
0
 

Accepted Solution

by:
RayBakker earned 0 total points
ID: 35058411
aqx

Thanks but here is what I ended up doing on Friday. Because I need to get meta data that was not in the pdfs I did the following:

1. I indexed the pdfs using the type=path
2. When displaying the results, I would read the file containing the meta data and then display the information.

Not as elegent as I would like but functional.

Thanks for your suggestion.
0
 
LVL 52

Expert Comment

by:_agx_
ID: 35064133
Not as elegent as I would like but functional.

If you mean o/s metadata (not the pdf properties) then AFAIK that's as easy as it gets.

Glad you solved it. But don't forget to select your comment as the answer and close the question :)
0
 

Author Closing Comment

by:RayBakker
ID: 35120581
It safisfied the basic requirements. It would have been more efficent if I could have place all the information I needed at indexing instead of at searching.
0

Featured Post

3 Use Cases for Connected Systems

Our Dev teams are like yours. They’re continually cranking out code for new features/bugs fixes, testing, deploying, testing some more, responding to production monitoring events and more. It’s complex. So, we thought you’d like to see what’s working for us.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

I spent nearly three days trying to figure out how incorporate OAuth in Coldfusion for the Eventful API. Hopefully, this article will allow Coldfusion Programmers to buzz through the API when they need to. Basically, what this script does is authori…
Sometimes databases have MILLIONS of records and we need a way to quickly query that table to return the results me need. Sure you could use CFQUERY but it takes too long when there are millions of records. That is why SOLR was invented. Please …
This Micro Tutorial will give you a basic overview how to record your screen with Microsoft Expression Encoder. This program is still free and open for the public to download. This will be demonstrated using Microsoft Expression Encoder 4.
The Email Laundry PDF encryption service allows companies to send confidential encrypted  emails to anybody. The PDF document can also contain attachments that are embedded in the encrypted PDF. The password is randomly generated by The Email Laundr…

803 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question