PDF File Parsing - Page Setup Retrieval (and Manipulation)
I was wondering if anyone knows of a method using VB.Net that means that a large library (1000's of PDF files) of PDF files can be opened and parsed prgrammatically to extract the current Page Settings (primarily the Paper/Page Size) ??
We have a large number of files that have been generated out of CAD software and sometimes they have been reformatted to A4 paper size, and sometimes not (sometimes printing to paper 65" square!!). When these latter files are sent to the printer (an HP Printer in this case) the printer does not cope at all well with the paper-size and just errors the job and blocks the print queue.
What I would like to do is to programmatically run through the entire library and extract the page size and generate an excel spreadsheet (this last bit is easy once I have the details from the PDF!) that gives the document name and the page size setting.
Ideally I would like to be able to manipulate the page size to be A4 also, but this last step is not critical.