eng40490
asked on
pdf->ascii convert??
convert an *entire* adobe pdf file into ascii
any software? any technique? in C or any other language.
i know there are some perl modules that parse the informational headers (which are already in ascii).
any software? any technique? in C or any other language.
i know there are some perl modules that parse the informational headers (which are already in ascii).
ASKER CERTIFIED SOLUTION
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
ASKER
i tried all the email solutions. they all failed in the same place -- they converted 'Offer' to 'oxxxxer' whre xxxx is some funny character.
seems like that's the state of the art in pdf->txt conversion.
seems like that's the state of the art in pdf->txt conversion.
Hmm!!
Even the adobe site gave you this problem??
did it do this for all chars or some??
let me know
Even the adobe site gave you this problem??
did it do this for all chars or some??
let me know
ASKER
Global Oering
the O,cial List of the SES
i found 5 instances of the first line and 1 instance of the 2nd. all other occurances of 'F' are converted.
i used pdf2txt@adobe.com, pdf2html@adobe, pdf2txt@sun.trace.wisc.edu .
i used 2 pdf documents.
all suffered from the same problem.
the O,cial List of the SES
i found 5 instances of the first line and 1 instance of the 2nd. all other occurances of 'F' are converted.
i used pdf2txt@adobe.com, pdf2html@adobe, pdf2txt@sun.trace.wisc.edu
i used 2 pdf documents.
all suffered from the same problem.
i would suggest that you post this problem to adobe and bring this to their notice. Who knows, this might be a known bug, due to version compability problem, or something!!
Rgds
Rgds
ASKER
actually only 1 pdf docs sufferred from the problem. i just sent another for conversion and it's ok. so looks like the first pdf document has something unusual.
was the problematic PDF file created using a different ver. of acrobat than the one which worked fine?
does the problematic PDF file have sp./international characters in it??
pl. let me know.
does the problematic PDF file have sp./international characters in it??
pl. let me know.
ASKER
can't ask the author/creator of the doc. no special or international character in the problematic *words*. i did not read every word of the doc so can't say about the entire doc.
sorry, cant think of anything else that might cause the problem. :-(
if so, pl let me know of the solution you used?
Thanks