Searchable Adobe PDF Documents

Hey guys,

We received about 50 PDF documents and we want to search for text within these documents. I tried doing a search but nothing comes up. These look like scanned PDF's, how can i search for text on all of the documents?
LVL 4
Cobra25Asked:
Who is Participating?
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

MacroShadowCommented:
If the pdf's are scanned then you can't search in them without ocr'ing them. They are no more than an image which obviously can't be content searched.

You can Google "PDF OCR" for free or paid solutions, the quality of different ocr products greatly differs.
0
Joe Winograd, Fellow&MVEDeveloperCommented:
If you have Adobe Acrobat (not Adobe Reader), then you already have OCR. Acrobat calls it Recognize Text in Version X (10) and Text Recognition in Version XI (11). You'll find it in the Tools section. Here's what it looks like in Acrobat XI Pro:

Acrobat XI Pro OCR
If you don't have Acrobat, I recommend the excellent (and free!) PDF-XChange Editor:
http://www.tracker-software.com/product/pdf-xchange-editor

They also have a PRO (non-free) version, but I think you'll find that the free version does everything you need — including OCR! Regards, Joe
0
Joe Winograd, Fellow&MVEDeveloperCommented:
One other thought. Doing 50 documents manually would be painful, so you may want to consider a batch processing solution. Here's an EE article that discusses a batch conversion approach using Nuance's Power PDF Advanced:
http://www.experts-exchange.com/Web_Development/Document_Imaging/A_13696-Batch-Conversion-of-PDF-and-TIFF-files-via-Command-Line-Interface.html

It is not free, but is reasonably priced, and as the article shows, they offer a 30-day free trial. Also, if these documents are coming in regularly, you may want to consider a Watched Folder approach, which is available in the same Power PDF Advanced product. Regards, Joe
0

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
Joe Winograd, Fellow&MVEDeveloperCommented:
Our messages just crossed. Yes, that's exactly what I was talking about in my first post <http:#a40291063> — Adobe Acrobat's Text Recognition. I showed the screenshot from Acrobat XI Professional in that one. Here's the Recognize Text screen from Acrobat X Standard:

Acrobat X Std OCR
Regards, Joe
0
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
Adobe Acrobat

From novice to tech pro — start learning today.

Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.