• Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 33
  • Last Modified:

Adobe Reader / pdf files / search function

Hi Experts,
I have saved a doc as a pdf file but when I do any text search, it comes up with the same msg box "no matches were found"
I have attached the screenshot error for your reference,
Please assist
PDF-doc-text-search-1.PNG
0
Frank .S
Asked:
Frank .S
  • 4
  • 3
3 Solutions
 
duttcomCommented:
It looks like the PDF document is a scanned image, rather than a text-based document that can be searched, so as far as acrobat reader is concerned, it could be page after page of photos of your last holiday.

To be searchable, you would need to run it through an OCR filter (optical character recognition) to convert the pictures of text to actual text.

If you are willing to attach the PDF here, I can OCR it for you (with Acrobat Pro) and then you can search it.
0
 
Frank .SBuilding EstimatorAuthor Commented:
hi expert thankyou for your information, however i have many of these docs that all need the same search functionality.
Would there be a free prog that I could get that converts a scanned image pdf doc to one which allows me to search for text.
What I did was chose an existing word doc and saved it as a pdf, then did a text search and it worked! so it appears that when a doc is scanned its only saving it as an image file, so not sure.....
0
 
duttcomCommented:
What you say is spot on. Word will create nice searchable PDFs, whereas unless you have scanning software which can be set to OCR when you scan (and such software is not uncommon), then the resulting pages will be images rather than text.

Often scanners come with additional software which will will OCR for you. ReadIris is one that springs to mind that often comes bundled with scanners. Do have a scanner and the discs that came with it?

If not, you could try this site - http://www.onlineocr.net/

Or download one of the free programs listed here - http://www.top5freeware.com/ocr-software-for-windows

Don't expect the resulting PDFs to be exactly the same as the originals. PDFs can get quite ugly when you start fiddling with them.
0
Cloud Class® Course: Certified Penetration Testing

This CPTE Certified Penetration Testing Engineer course covers everything you need to know about becoming a Certified Penetration Testing Engineer. Career Path: Professional roles include Ethical Hackers, Security Consultants, System Administrators, and Chief Security Officers.

 
Joe Winograd, Fellow&MVEDeveloperCommented:
Hi Frank,
duttcom may be right, but let's find out for sure. When the doc is open in Adobe Reader, what happens when you drag the mouse to highlight? If you see a solid block, it is an image-only file; if you see an I-beam and see text, then there is text in there. I've attached two extractions from your posting...one is image-only and one was passed through OCR to make a PDF Searchable Image file (both an image and text from the OCR). Try these two and you'll see what I mean when you highlight. Let us know what happens and we'll take it from there. Regards, Joe
Frank-image-only.pdf
Frank-image-and-text-OCR-searcha.pdf
0
 
Joe Winograd, Fellow&MVEDeveloperCommented:
Frank,
Our messages just crossed. I'll get back to you in a moment with some OCR recommendations. Regards, Joe
0
 
duttcomCommented:
If you would like to upgrade your PDF handling options and don't mind a little outlay, I can recommend Nitro Pro - it does what Acrobat Pro does but at a fraction of the price. It's the PDF software I give my users.
0
 
Joe Winograd, Fellow&MVEDeveloperCommented:
Frank,
For starters, what scanner do you have (make and model) and did it come with bundled imaging/scanning/OCR software? For example, Nuance's PaperPort is very common as software bundled with scanners and it has built-in OCR:
http://nuance.com/for-individuals/by-product/paperport/index.htm

It can scan straight to a PDF Searchable Image file...you do not have to scan first and then OCR later...it invokes OCR as part of a Scanning Profile (if you want it to). Of course, if you already have image-only PDFs, it can run OCR on them, converting them to PDF Searchable Image files. However, if it wasn't bundled with your scanner, it is not a free product.

I know many free OCR products, but I'll wait to hear back from you before making any suggestions. You may already have a piece of software bundled with your scanner that can do the job. Regards, Joe
0
 
Joe Winograd, Fellow&MVEDeveloperCommented:
The question is: Why does a text search in a PDF file come up with no matches? The answer is: Because there's no text in the PDF file — it is an image-only PDF.

duttcom got the right answer in post https:#a39276742 and it deserves to be the Accepted Solution, imo.

The following other posts all provided correct and helpful information and deserve to be Assisted Solutions, imo:

https:#a39276790 (duttcom)
https:#a39276793 (joewinograd)
https:#a39276802 (duttcom)
https:#a39276809 (joewinograd)

Regards, Joe
0
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

Join & Write a Comment

Featured Post

Free Tool: Port Scanner

Check which ports are open to the outside world. Helps make sure that your firewall rules are working as intended.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

  • 4
  • 3
Tackle projects and never again get stuck behind a technical roadblock.
Join Now