results from Acrobat paper capture cause word to freeze
Posted on 2009-04-21
Acrobat 9 Standard
I've scanned a handful of documents (printed emails, FWIW) to a 10-page PDF. Fairly clean results, all text, no graphics, a bit of formatting to deal with, but nothing that OCR shouldn't be able to handle.
Using Acrobat 9 Standard, I ran OCR Text Recognition from the Document menu.
Trouble is, when I try to extract the results, other applications completely melt down.
Select Text Tool > copy all text to clipboard > paste into new Outlook message > freeze
File > Export > Word Document > open the resulting word doc > freeze
Have tried sending the resulting Word doc to other computers and they get the same thing. Word starts behaving much the same way as when it tries to open a heavily formatted/poorly formatted/corrupt document.
I can, however, use the Select Text tool to copy out of Acrobat and paste into Notepad, then copy/paste out of there into wherever I want it, but it's tedious. I've also had limited success copying one sentence or one small paragraph at a time directly into into Word or Outlook.
I understand that:
1.OCR is more art than science
2. going from PDF to Word is akin to unscrambling an egg
3. there are other products other than Adobe that handle it with more finesse
...but is there a way to export OCR'd text from Adobe in one step without Office freaking out? Maybe some settings that I'm missing?