Cannot copy from pdf and paste to notepad

Posted on 2005-05-16
Medium Priority
Last Modified: 2013-12-03

I'm viewing a pdf with Adobe Reader 7.0. I select some text, copy to clipboard and then paste it in notepad or word, but what appears is little boxes and not the original characters.
How can I correct this?

Thank you
Question by:Kokas79
LVL 44

Accepted Solution

Karl Heinz Kremer earned 320 total points
ID: 14017047
The PDF file does not contain all the information to extract the text. The problem is that a character in a PDF file may not contain information what "real" character it relates to. Some PDF generators do a pretty bad job when they embed fonts into PDF files. They use a proprietary encoding mechanism (e.g. 1 is A, 2 is B, 3 is C, ...) in both the embedded font and when they place glyphs on the page. Without a table that implements the reverse (e.g. character code 1 is 'A') you cannot extract text from such a file.

There is nothing you can do (besides to complain to whoever created the PDF file, and the author of the software that created this file).

Expert Comment

ID: 22415441
I should like to know which free PDF generator might be satisfactory.  PrimoPDF is the culprit in my case.

Featured Post

Technology Partners: We Want Your Opinion!

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Update 21-May-2015: I temporarily removed the source code to make major changes to the program. Regards, Joe INTRODUCTION This article presents a solution to a question (http://www.experts-exchange.com/Programming/Installation/Q_28396542.html)…
The Adobe PDF proprietary file format is recognized as secure and formulated. But these PDF files are also prone to corruption and any external threat like virus attacks, improper storage can hit PDF file integrity.This type of damages can make cruc…
We often encounter PDF files that are pure images, that is, they do not have text characters, but instead contain only raster graphics. The most common causes of this are document scanning software and faxing software/services that create image-only…
In this sixth video of the Xpdf series, we discuss and demonstrate the PDFtoPNG utility, which converts a multi-page PDF file to separate color, grayscale, or monochrome PNG files, creating one PNG file for each page in the PDF. It does this via a c…
Suggested Courses

839 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question