• Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 641
  • Last Modified:

Mass Change PDF Documents

Bit of a strange one but I wonder if anyone has done this before, We have a lot of PDF documents that contain a value in the Author field in Document Properties, now I need to strip this out of all the PDF's so ideally it's blank.  Has anyone done this before or can anyone recommend a way to do this on mass?

Any Help most welcome...

  • 2
1 Solution
Giovanni HewardCommented:
Yes, try pdftk

pdftk in.pdf dump_data output report.txt

Open in new window

Find and remove hidden content

Use the Remove Hidden Information feature to find and remove content from a document that you don’t want, such as hidden text, metadata, comments, and attachments. When you remove items, additional items are automatically removed from the document. Items that are removed include digital signatures, document information added by third-party plug-ins and applications, and special features that enable Adobe Reader users to review, sign, and fill PDF documents.
To examine every PDF for hidden content before you close it or send it in e-mail, specify that option in the Documents preferences using the Preferences dialog box.

    Choose Tools > Protection > Remove Hidden Information. If you don’t see the Protection panel, see the instructions for adding panels at Task panes.

    If items are found, they are listed in the Remove Hidden Information panel with a selected check box beside each item.
    Make sure that the check boxes are selected only for the items that you want to remove from the document. (See Remove Hidden Information options.)
    Click Remove to delete selected items from the file, and click OK.
    Choose File > Save, and specify a filename and location. If you don’t want to overwrite the original file, save the file to a different name, location, or both.

The selected content is permanently removed when you save the file. If you close the file without saving it, repeat this process, making sure to save the file.
Remove Hidden Information options

    Metadata includes information about the document and its contents, such as the author’s name, keywords, and copyright information, used by search utilities. To view metadata, choose File > Properties.
Giovanni HewardCommented:
To update on an automated (mass) scale, try the following:

1. Create text file containing replacement metadata

InfoKey: Author
InfoValue: Anonymous

2. Run pdftk in the following fashion

pdftk "input.pdf" update_info data.txt output "output.pdf"

3. Automate on a large scale, use this batch script as a template

for /f "delims=" %%f in ('dir *.pdf /s/a/b') do (
     pdftk "%%f" update_info data.txt output "%%f"

Open in new window

To delete the author metadata completely, your data.txt file should look like this:

InfoKey: Author
Jamie786Author Commented:
Perfect exactly what I needed.... had to tweek the batch command but it seems to work perfectly well many thanks
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

Join & Write a Comment

Featured Post

Free Tool: Path Explorer

An intuitive utility to help find the CSS path to UI elements on a webpage. These paths are used frequently in a variety of front-end development and QA automation tasks.

One of a set of tools we're offering as a way of saying thank you for being a part of the community.

  • 2
Tackle projects and never again get stuck behind a technical roadblock.
Join Now