I have a very large set of assorted PDF files. They contain searchable text, but the text is filled with errors. If I save a page from the PDF as an image, and use my OCR software on it, I get a much better result. So, I would like to re-OCR all of the files — but first I need to "flatten" all of the text objects in the PDFs, since none of my OCR tools will overwrite any existing text.
I do have Acrobat X and XI Pro, and I've tried using a batch action to strip the text and rerun OCR, but anytime the program encounters an error, it interrupts the process with a dialog box. I searched for a way to prevent this, but there does not appear to be one.
So, the way I see it, I need one of three things:
1. A way to force Acrobat to skip over errors in batch actions and process the remaining files. I could swear you used to be able to do this.
2. A batch OCR tool, free or paid, that will remove and replace all existing text objects.
3. A tool to batch-flatten all text objects in a large set of PDFs (so I can then run them through OCR). I've found software that looks related, but everything seems to either delete text, which I don't want; or else it flattens form fields, images, etc. but does not mention text.
Any of the three of these would solve my problem. I'm open to other suggestions too, of course -- any advice is greatly appreciated!