Link to home
Create AccountLog in
Software

Software

--

Questions

--

Followers

Top Experts

Avatar of mikecox_
mikecox_πŸ‡ΊπŸ‡Έ

make a PDF file searchable
How can I convert an unsearchable PDF file into one that is?

Zero AI Policy

We believe in human intelligence. Our moderation policy strictly prohibits the use of LLM content in our Q&A threads.


Avatar of JohnJohnπŸ‡¨πŸ‡¦

Rescan it and then during the scan, select the scan option to make the document searchable. I do this and it works.

Once the PDF has been scanned as an image only, it cannot be converted to searchable. So re-scan it.

SOLUTION
Avatar of Joe WinogradJoe WinogradπŸ‡ΊπŸ‡Έ

Link to home
membership
Log in or create a free account to see answer.
Signing up is free and takes 30 seconds. No credit card required.
Create Account

Avatar of JohnJohnπŸ‡¨πŸ‡¦

I tried on my simple minded scanner and that did not seem to work. Good luck if your scanner supports what Joe says and works.

Avatar of Joe WinogradJoe WinogradπŸ‡ΊπŸ‡Έ

> Good luck if your scanner supports what Joe says and works.

John,
It has nothing whatsoever to do with the scanner. It is OCR software that runs on the file after it has already been scanned. There are many, many software packages that can OCR existing (already scanned-in) files, such as ABBYY FineReader, PaperPort, Power PDF, OmniPage, and the list goes on-and-on. As I recollect, you have Adobe Acrobat. Try this. Open an unsearchable/image-only PDF in it that your "simple minded scanner" created, then select Tools, then Text Recognition or Recognize Text, depending on which version of Acrobat you have. It will create text with its OCR process right in the PDF, which you'll be able to search, as well as copy/paste into Notepad, Word, etc. This is completely unrelated to scanners/scanning (although, of course, lots of scanning software, such as the products mentioned above, can also OCR at scan time). Regards, Joe

Reward 1Reward 2Reward 3Reward 4Reward 5Reward 6

EARN REWARDS FOR ASKING, ANSWERING, AND MORE.

Earn free swag for participating on the platform.


Avatar of JohnJohnπŸ‡¨πŸ‡¦

I just scan to Adobe PDF and do not have a bunch of tools . I had a client with Abby Fine reader. Β Cheaper and faster just to re-scan (for me at any rate).

So I was posting from a very simple minded approach. I am sure you are correct, but I only use very simple minded approaches.

Avatar of JohnJohnπŸ‡¨πŸ‡¦

I have Adobe open. No Recognize Text. I think that must be Adobe Pro.

Avatar of Joe WinogradJoe WinogradπŸ‡ΊπŸ‡Έ

Our posts just crossed...but you do have Acrobat...right? Not just Reader...but full Acrobat? If so, try what I suggested above on an image-only PDF β€” Tools>Text Recognition (or Recognize Text).

Free T-shirt

Get a FREE t-shirt when you ask your first question.

We believe in human intelligence. Our moderation policy strictly prohibits the use of LLM content in our Q&A threads.


Avatar of Joe WinogradJoe WinogradπŸ‡ΊπŸ‡Έ

Our posts crossed again. It doesn't have to be Acrobat Pro. It can be Acrobat Standard. But it cannot be Adobe Reader.

Avatar of JohnJohnπŸ‡¨πŸ‡¦

I am trying, but no such thing in regular Adobe Acrobat.

Avatar of JohnJohnπŸ‡¨πŸ‡¦

OK, it is well hidden under Enhanced Scans. I will try it later.

I curse the day Microsoft "categorized" things and all vendors followed like lemmings. Nothing can be found anymore.

Reward 1Reward 2Reward 3Reward 4Reward 5Reward 6

EARN REWARDS FOR ASKING, ANSWERING, AND MORE.

Earn free swag for participating on the platform.


Avatar of JohnJohnπŸ‡¨πŸ‡¦

Mike - I am done now, over my head, and so over to Joe.

Avatar of Joe WinogradJoe WinogradπŸ‡ΊπŸ‡Έ

Maybe these screenshots will help:

Acrobat X Standard
User generated image
Acrobat XI Pro
User generated image

Avatar of mikecox_mikecox_πŸ‡ΊπŸ‡Έ

ASKER

This is a rather large PDF file; it's the CC&R's of my condo association and it's the document our attorney provided. Β I have the OCR software but I can't image having to print, then scan all the pages from the PDF file into it. Β I was hoping that there was a program that would simply convert the file into a digital document that is searchable. It seems to me that I should be able to load that file into my OCR program and let it make the conversion.

Free T-shirt

Get a FREE t-shirt when you ask your first question.

We believe in human intelligence. Our moderation policy strictly prohibits the use of LLM content in our Q&A threads.


Avatar of Joe WinogradJoe WinogradπŸ‡ΊπŸ‡Έ

> It seems to me that I should be able to load that file into my OCR program and let it make the conversion.

Yes, you should be able to do that. If you can't, the problem is with your OCR software. What OCR software do you have? If you can't get it to work with your OCR software, then read my earlier post β€” it explains exactly how to do what you want with free software. Regards, Joe

SOLUTION
Link to home
membership
Log in or create a free account to see answer.
Signing up is free and takes 30 seconds. No credit card required.

ASKER CERTIFIED SOLUTION
Avatar of mikecox_mikecox_πŸ‡ΊπŸ‡Έ

ASKER

Link to home
membership
Log in or create a free account to see answer.
Signing up is free and takes 30 seconds. No credit card required.

Avatar of mikecox_mikecox_πŸ‡ΊπŸ‡Έ

ASKER

I don't know if this is Kosher but since I think I had the best answer I'm selecting that as the best one. Β Joe's was the next best. Β I thank you all for your effort.

Avatar of Joe WinogradJoe WinogradπŸ‡ΊπŸ‡Έ

> As suggested I tried to highlight some text and cannot

We knew from your first post that you would not be able to highlight text and copy/paste it, because you said it is "an unsearchable PDF file", meaning there's no text in it to search...or highlight! So it was clear from your opening comment that it is an image-only (probably scanned-in) PDF.

>Β the .doc file he scanned

Scanning a DOC file to PDF is unnecessary. In most versions of word, you can Save As to a PDF file. And if that's not available, there are many free PDF print drivers out there, such as Bullzip, CutePDF Writer, and doPDF. And a big advantage of these methods (Save As and Print to a PDF print driver from Word) is that they create a PDF Normal file, which has the text that may be copied/pasted/searched.

>Β pay a subscription fee

Yes, true for Acrobat, but that was why I gave you the link to my 5-minute EE video Micro Tutorial, How to OCR pages in a PDF with free software:
https://www.experts-exchange.com/videos/1618/

>Β Finally, as I suggested above, it is possible to load a PDF file into an OCR program.

Yes, as I mentioned in my first post.

>Β I don't know if this is Kosher but since I think I had the best answer I'm selecting that as the best one. Joe's was the next best. I thank you all for your effort.

Yes, it's Kosher to select your own post. Here's a member article that discusses it:
https://www.experts-exchange.com/articles/27139

And here's an EE support article that discusses it:
http://support.experts-exchange.com/customer/portal/articles/626862

Regards, Joe

Reward 1Reward 2Reward 3Reward 4Reward 5Reward 6

EARN REWARDS FOR ASKING, ANSWERING, AND MORE.

Earn free swag for participating on the platform.


Avatar of Joe WinogradJoe WinogradπŸ‡ΊπŸ‡Έ

> asked by couldn't I just load the entire PDF file into it, but the question didn't appear to get noticed

Mike, that question did get noticed, and I replied with this (in post #a41945230):
>It seems to me that I should be able to load that file into my OCR program and let it make the conversion.


Yes, you should be able to do that. If you can't, the problem is with your OCR software. What OCR software do you have? If you can't get it to work with your OCR software, then read my earlier post β€” it explains exactly how to do what you want with free software. Regards, Joe

Avatar of mikecox_mikecox_πŸ‡ΊπŸ‡Έ

ASKER

I have an OCR program and in a f/u quested asked by couldn't I just load the entire PDF file into it, but the question didn't appear to get noticed, so I tried it and it worked.

Avatar of Joe WinogradJoe WinogradπŸ‡ΊπŸ‡Έ

Mike,
Glad to hear that you tried it and it worked. As I mentioned earlier, your question did get noticed, and I replied in post #a41945230. In any case, great news that it's working! Regards, Joe

Free T-shirt

Get a FREE t-shirt when you ask your first question.

We believe in human intelligence. Our moderation policy strictly prohibits the use of LLM content in our Q&A threads.


Avatar of mikecox_mikecox_πŸ‡ΊπŸ‡Έ

ASKER

Thanks for the f/uj comments, I appreciate them and your efforts to help resolve this issue for me.

Avatar of Joe WinogradJoe WinogradπŸ‡ΊπŸ‡Έ

You're welcome, Mike β€” happy to help. I'm really glad to hear that the issue is resolved.
Software

Software

--

Questions

--

Followers

Top Experts

Software is any set of instructions that directs a computer to perform specific tasks or operations. Computer software consists of programs, libraries and related non-executable data (such as documentation). Computer software is non-tangible, contrasted with computer hardware, which is the physical component of computers. Software written in a machine language is known as "machine code". However, in practice, software is usually written in high-level programming languages than machine language. High-level languages are translated into machine language using a compiler or interpreter or a combination of the two.