Solved

Indexing PDFs WITHOUT using Type="path"

Posted on 2011-03-03
4
287 Views
Last Modified: 2012-05-11
Greetings

For various reasons, I am trying to index some PDFs using Coldfusion's CFIndex with the type="Custom". I am currently using cffile with action="readBinary" to read the PDFs.

Using Coldfusion 9.01

Thanks
0
Comment
Question by:RayBakker
  • 2
  • 2
4 Comments
 
LVL 52

Expert Comment

by:_agx_
ID: 35038306
(not for points ...)

I haven't worked w/verity in ages.  But since no one else has commented ... have you tried using cfpdf action="extracttext" and indexing the pdf text (not the bytes).
0
 

Accepted Solution

by:
RayBakker earned 0 total points
ID: 35058411
aqx

Thanks but here is what I ended up doing on Friday. Because I need to get meta data that was not in the pdfs I did the following:

1. I indexed the pdfs using the type=path
2. When displaying the results, I would read the file containing the meta data and then display the information.

Not as elegent as I would like but functional.

Thanks for your suggestion.
0
 
LVL 52

Expert Comment

by:_agx_
ID: 35064133
Not as elegent as I would like but functional.

If you mean o/s metadata (not the pdf properties) then AFAIK that's as easy as it gets.

Glad you solved it. But don't forget to select your comment as the answer and close the question :)
0
 

Author Closing Comment

by:RayBakker
ID: 35120581
It safisfied the basic requirements. It would have been more efficent if I could have place all the information I needed at indexing instead of at searching.
0

Featured Post

Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

PROBLEM: How to add your own buttons to the bottom toolbar with paging info ( result count ). While creating a cfgrid, I ran into an issue where I wanted to embed my own custom buttons where the default ones ( insert / delete / etc… ) are for aes…
Recently while working on a project I got a very annoying cfdocument has no body error message. I had never seen this error before. So I checked the code. The code was pretty simple; it was Just showing me the cfdocumnt tag and inside that tag a …
Internet Business Fax to Email Made Easy - With  eFax Corporate (http://www.enterprise.efax.com), you'll receive a dedicated online fax number, which is used the same way as a typical analog fax number. You'll receive secure faxes in your email, f…
Learn how to create flexible layouts using relative units in CSS.  New relative units added in CSS3 include vw(viewports width), vh(viewports height), vmin(minimum of viewports height and width), and vmax (maximum of viewports height and width).

911 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

23 Experts available now in Live!

Get 1:1 Help Now