Avatar of Patrick O'Dea
Patrick O'Dea
Flag for Ireland asked on

Merging 10,000 PDF's into ONE searchable file

This is a one off exercise:

I have 10,000 pdf's in about 500 folders.
The pdf's are called a persons name and have a photo of the person in the pdf.

johnsmith.pdf has a picture of John Smith !

I want to start at a high level folder and drill down into all lower folders looking for pdf's.

REQUIREMENT: One large PDF containing all the smaller ones.

Also, I need to be able to search the PDF and find the photo of "John Smith" or whoever.
Document ImagingAdobe AcrobatPDF

Avatar of undefined
Last Comment
Joe Winograd

8/22/2022 - Mon
Joe Winograd

View this solution by signing up for a free trial.
Members can start a 7-Day free trial and enjoy unlimited access to the platform.
See Pricing Options
Start Free Trial
Ask your own question & get feedback from real experts
Find out why thousands trust the EE community with their toughest problems.
Joe Winograd

The approach documented in post #a42049716 will work well. In fact, I have working subroutines of all the components described in the post. It would be a matter of combining them into a single program and, of course, testing it thoroughly. It would not be trivial, especially to make sure that it is able to handle error conditions when processing 10,000 PDFs in 500 folders. This is why I mentioned a Gig if the asker does not have the expertise to write the program. But it is very doable using the roadmap in my post.
All of life is about relationships, and EE has made a viirtual community a real community. It lifts everyone's boat
William Peck