How to extract text from an .XPS file using VBA
Posted on 2010-09-23
I am connected to Corporate via VPN. I cannot install ANY software. I am trying to "screen scrape" from a web page. I can copy-paste, but I need to save up to 5 screens for each of 2,000 customers. I can print to a file, but ONLY to a .XPS file. I am further constrained in that I can only use Internet Explorer 7 (drat, no Greasemonkey).
If I change .XPS to .ZIP, then I can see all of the internal files and folders. I've tried to manually strip the text from the "Glyph" nodes I found in one of the XML files, but that was less than satisfactory (sentences were returned as individual words; all of the text was duplicated).
I am looking for a way to automate this copy-paste nightmare. I can use Office 2007 (either Access or Excel). If I knew how, I would attach to the Internet Explorer process via the Windows API, but alas, my knowledge in this area is limited.
I am a seasoned veteran of the I.T. world (meaning I remember xenix). You can throw technical stuff at me and I will not recoil from it.
I will do ANYTHING to overcome this dilemma, within the restrictions placed upon me by Corporate. And no, They will not grant me authority to do anything, As far as They are concerned, I do not exist.
I am out of the office until 6:00 PM this evening, so please do not think I am ignoring your valued responses.