Extract multiple PDF files from one PDF file

Bill
Bill used Ask the Experts™
on
We have an external application that creates a PDF file for each customer delivery which contains 2 - 14 PDF delivery documents within that PDF file. Is there a way to open the PDF file in code and extract the PDF documents within that file?
Comment
Watch Question

Do more with

Expert Office
EXPERT OFFICE® is a registered trademark of EXPERTS EXCHANGE®
Fractional CTO
Distinguished Expert 2018
Commented:
I'd look at poppler tools + pdftohtml + libreoffice --headless + related tools.

Likely best to start with Ubuntu Bionic as your platform for this, as tools are plentiful in this runtime environment.

Fairly simple to do in most cases + no 2x PDFs are created the same, especially aggregate PDFs (many PDF files merged into one file).

The easiest way to do this is to capture the individual PDF files before they're merged into the aggregate PDF.

Splitting aggregate PDFs rarely creates individual which are 100% same as originals.
BillBusiness Systems Analyst

Author

Commented:
Thanks David

Do more with

Expert Office
Submit tech questions to Ask the Experts™ at any time to receive solutions, advice, and new ideas from leading industry professionals.

Start 7-Day Free Trial