Cannot copy from pdf and paste to notepad

Posted on 2005-05-16
Last Modified: 2013-12-03

I'm viewing a pdf with Adobe Reader 7.0. I select some text, copy to clipboard and then paste it in notepad or word, but what appears is little boxes and not the original characters.
How can I correct this?

Thank you
Question by:Kokas79
    LVL 44

    Accepted Solution

    The PDF file does not contain all the information to extract the text. The problem is that a character in a PDF file may not contain information what "real" character it relates to. Some PDF generators do a pretty bad job when they embed fonts into PDF files. They use a proprietary encoding mechanism (e.g. 1 is A, 2 is B, 3 is C, ...) in both the embedded font and when they place glyphs on the page. Without a table that implements the reverse (e.g. character code 1 is 'A') you cannot extract text from such a file.

    There is nothing you can do (besides to complain to whoever created the PDF file, and the author of the software that created this file).

    Expert Comment

    I should like to know which free PDF generator might be satisfactory.  PrimoPDF is the culprit in my case.

    Write Comment

    Please enter a first name

    Please enter a last name

    We will never share this with anyone.

    Featured Post

    What Should I Do With This Threat Intelligence?

    Are you wondering if you actually need threat intelligence? The answer is yes. We explain the basics for creating useful threat intelligence.

    Getting information about Fonts being used in a PDF file A colleague of mine recently faced an issue related to the PDF file format. The PDFs were containing mission critical client information, they were successfully mailed but there was a sm…
    Inserting page numbers in Portable Document Files not only enhances manageability but also makes them look professional. With numbered pages, the file appears more organized and it becomes easier to search for a particular page. The size and the vol…
    In this second video of the Xpdf series, we discuss and demonstrate the PDFimages utility, which, in a single command, is able to extract all the images from a PDF file and save each one in a separate image file (PBM, PPM, or JPG). Download and inst…
    In this video, we show how to convert an image-only PDF file into a PDF Searchable Image file, that is, a file with both the image (typically from scanning) and text, which is created in an automated fashion with Optical Character Recognition (OCR) …

    737 members asked questions and received personalized solutions in the past 7 days.

    Join the community of 500,000 technology professionals and ask your questions.

    Join & Ask a Question

    Need Help in Real-Time?

    Connect with top rated Experts

    16 Experts available now in Live!

    Get 1:1 Help Now