can any one tell me logic or techniques to convert pdf to word using php.

Posted on 2008-11-06
Last Modified: 2013-12-13
i need to convert pdf into word document which shoud be embed in web page.

can you suggest me how i can implement that using php

example site
Question by:psarun85
    LVL 44

    Expert Comment

    by:Karl Heinz Kremer
    You cannot do that with PHP. There are a few applictions available that attempt to do that conversion (e.g. from ABBYY and Nuance). You would have to parse a PDF file and then convert it's parts to objects that can be used in a Word file and then re-create the layout. For anything that is an image, you would have to run OCR to find out if you can extract text, determine the font and font size and recreate that text object. Then you have to decide if you can get rid of the image, or if there is more information in the image than what you just extracted as text... All in all, this is not a trivial task, and PHP is not an environment you want to implement that.

    Author Comment

    if it is not possible in php, then can you tell me that in java or

    is it possible by any technologies
    LVL 44

    Accepted Solution

    Any language will cause you problems, because it's not a trivial task by any means. Do you know and understand the PDF format completely (_ALL_ about 1000 pages of the PDF spec)? Do you also have a solid understanding of the Word format? If you cannot answer both questions with a YES!!!, this is not a task you can or should undertake. If you really need to convert from PDF to Word use a 3rd party toolkit that does the conversion for you, and just add the user interface. One option is - I don't have any first hand experience with that package, but I've used other PDF products from BCL.

    Write Comment

    Please enter a first name

    Please enter a last name

    We will never share this with anyone.

    Featured Post

    How your wiki can always stay up-to-date

    Quip doubles as a “living” wiki and a project management tool that evolves with your organization. As you finish projects in Quip, the work remains, easily accessible to all team members, new and old.
    - Increase transparency
    - Onboard new hires faster
    - Access from mobile/offline

    *Adobe Acrobat 9 was used for this article.  Particular steps may vary depending on software versions. Adobe Acrobat has many, many variables that my be utilized to customize your forms for clarity and ease of use. The Form Editing Tool will be y…
    These days socially coordinated efforts have turned into a critical requirement for enterprises.
    In this video, we show how to convert an image-only PDF file into a PDF Searchable Image file, that is, a file with both the image (typically from scanning) and text, which is created in an automated fashion with Optical Character Recognition (OCR) …
    In this fifth video of the Xpdf series, we discuss and demonstrate the PDFdetach utility, which is able to list and, more importantly, extract attachments that are embedded in PDF files. It does this via a command line interface, making it suitable …

    759 members asked questions and received personalized solutions in the past 7 days.

    Join the community of 500,000 technology professionals and ask your questions.

    Join & Ask a Question

    Need Help in Real-Time?

    Connect with top rated Experts

    13 Experts available now in Live!

    Get 1:1 Help Now