The approach documented in post #a42049716 will work well. In fact, I have working subroutines of all the components described in the post. It would be a matter of combining them into a single program and, of course, testing it thoroughly. It would not be trivial, especially to make sure that it is able to handle error conditions when processing 10,000 PDFs in 500 folders. This is why I mentioned a Gig if the asker does not have the expertise to write the program. But it is very doable using the roadmap in my post.