PDF to HTML or XML
Posted on 2002-07-25
I've read here somewhere, that by converting PDF to postscript, GSView could concert to HTML? But how? I cannot find it.
I want to convert some PDF's to html in some way. It dosen't matter if i have to convert to XML and program the xml to HTML.
Can i read a pdf i som PDF script language, and write a parser, tha extracts what i need?
How does i read the PDF? Is there a DOM?