[Last Call] Learn how to a build a cloud-first strategyRegister Now


can any one tell me logic or techniques to convert pdf to word using php.

Posted on 2008-11-06
Medium Priority
Last Modified: 2013-12-13
i need to convert pdf into word document which shoud be embed in web page.

can you suggest me how i can implement that using php

example site
Question by:psarun85
  • 2
LVL 44

Expert Comment

by:Karl Heinz Kremer
ID: 22897786
You cannot do that with PHP. There are a few applictions available that attempt to do that conversion (e.g. from ABBYY and Nuance). You would have to parse a PDF file and then convert it's parts to objects that can be used in a Word file and then re-create the layout. For anything that is an image, you would have to run OCR to find out if you can extract text, determine the font and font size and recreate that text object. Then you have to decide if you can get rid of the image, or if there is more information in the image than what you just extracted as text... All in all, this is not a trivial task, and PHP is not an environment you want to implement that.

Author Comment

ID: 22901996
if it is not possible in php, then can you tell me that in java or asp.net.

is it possible by any technologies
LVL 44

Accepted Solution

Karl Heinz Kremer earned 2000 total points
ID: 22903566
Any language will cause you problems, because it's not a trivial task by any means. Do you know and understand the PDF format completely (_ALL_ about 1000 pages of the PDF spec)? Do you also have a solid understanding of the Word format? If you cannot answer both questions with a YES!!!, this is not a task you can or should undertake. If you really need to convert from PDF to Word use a 3rd party toolkit that does the conversion for you, and just add the user interface. One option is http://www.pdfonline.com/easyconverter/sdk/ - I don't have any first hand experience with that package, but I've used other PDF products from BCL.

Featured Post

Free Tool: Port Scanner

Check which ports are open to the outside world. Helps make sure that your firewall rules are working as intended.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

The Adobe PDF proprietary file format is recognized as secure and formulated. But these PDF files are also prone to corruption and any external threat like virus attacks, improper storage can hit PDF file integrity.This type of damages can make cruc…
The title says it all. Writing any type of PHP Application or API code that provides high throughput, while under a heavy load, seems to be an arcane art form (Black Magic). This article aims to provide some general guidelines for producing this typ…
Sometimes we receive PDF files that are in the wrong orientation. They may be sideways or even upside down. This most commonly happens with scanned or faxed documents. It is possible to rotate the view of these PDFs with the free Adobe Reader produc…
We often encounter PDF files that are pure images, that is, they do not have text characters, but instead contain only raster graphics. The most common causes of this are document scanning software and faxing software/services that create image-only…
Suggested Courses
Course of the Month18 days, 2 hours left to enroll

830 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question