We help IT Professionals succeed at work.

Merging 10,000 PDF's into ONE searchable file

223 Views
Last Modified: 2017-04-03
This is a one off exercise:

I have 10,000 pdf's in about 500 folders.
The pdf's are called a persons name and have a photo of the person in the pdf.

Example:
johnsmith.pdf has a picture of John Smith !

I want to start at a high level folder and drill down into all lower folders looking for pdf's.

REQUIREMENT: One large PDF containing all the smaller ones.

Also, I need to be able to search the PDF and find the photo of "John Smith" or whoever.
Comment
Watch Question

Developer
CERTIFIED EXPERT
Fellow
Most Valuable Expert 2018
Commented:
This problem has been solved!
(Unlock this solution with a 7-day Free Trial)
UNLOCK SOLUTION
Joe WinogradDeveloper
CERTIFIED EXPERT
Fellow
Most Valuable Expert 2018

Commented:
The approach documented in post #a42049716 will work well. In fact, I have working subroutines of all the components described in the post. It would be a matter of combining them into a single program and, of course, testing it thoroughly. It would not be trivial, especially to make sure that it is able to handle error conditions when processing 10,000 PDFs in 500 folders. This is why I mentioned a Gig if the asker does not have the expertise to write the program. But it is very doable using the roadmap in my post.