Go Premium for a chance to win a PS4. Enter to Win

x
  • Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 412
  • Last Modified:

Changing Word Docs and PDFs to html

I am talking to some people who write  Word docs and publish them as PDF files all the time as attachments to blog posts. They were talking about wanting to do this - the letters/files etc. should be converted to html format so they are readable on the site. SOme of these documents are really long, like over 50 pages.
My first reaction is that this is a BAD idea. Sure, you can save a Word doc as html, but in my experience the html is bloated and crappy.
I know there are program that converts Word to html, but how are they? DO they work? Certainly the CSS on the site won't work if the correct classes and IDs are not applied.
Also, what is the matter with PDFs? Everyone has Adobe Reader.
0
nanharbison
Asked:
nanharbison
4 Solutions
 
Dan CraciunIT ConsultantCommented:
Using Word's save to HTML feature is usually a bad idea, but why aren't they just pasting from Word into the blog's editor?
If the blog uses something like TinyMCE (the default on Wordpress) then they can learn pretty quickly how to do basic formatting, if needed.

HTH,
Dan
0
 
GaryCommented:
My reaction would be stick to PDF, easy to maintain a single format throughout, easy to read offline and small in size (assuming not lots of images etc).
But maybe first try using Google Docs (export to HTML) or a number of the online free convertors and see how the .doc exports.  Will depend a lot on the makeup of the doc.
0
 
Dave BaldwinFixer of ProblemsCommented:
Word and PDF can display things that HTML pretty much can't.  Some formatting, fonts, and layered presentations are difficult if not impossible to duplicate easily in HTML.  In addition, Word and PDF are oriented towards paper and not the screen.
0
Concerto's Cloud Advisory Services

Want to avoid the missteps to gaining all the benefits of the cloud? Learn more about the different assessment options from our Cloud Advisory team.

 
BillDLCommented:
I agree with the others on this one.  PDF (Portable Document Format) is being used exactly as was originally intended, which is sharing of documents in a common format that should be displayed to all readers in a consistent layout.  It is no surprise that most user manuals come as PDF files for this reason.  A Blog (Web Log) is really intended for short entries, not the equivalent of 50 pages in a PDF document.  I would not particularly want to have to scroll down through 50 pages worth of content on a single blog entry.

To say that "Everyone has Adobe Reader" is probably not completely accurate, however.  Malformed PDF files have been used to spread viruses because of the default behaviour of opening inside the browser window via the browser plugin.  This can be a System Administrator's nightmare.  My preference is to always prompt for a "Save" or "Open With Acrobat Reader", but not everybody knows how to configure Acrobat Reader and disable all the potentially risky functionality like allowing JavaScript and allowing executables to be opened from links.  In addition, Google Chrome, which is in use on many smart phones these days, uses its own internal PDF Viewer within the browser.  Saving to the hard drive and then opening is a nuisance to some impatient or uninformed people who prefer to just click a link and have the document open right in the browser, but rendering it (especially with a long document full of images) can be slow and jerky, leading to complaints.

What this comes down to is common sense.  If somebody has a lengthy document that doesn't contain a bunch of cross-links to other resources, and has embedded images that are large enough to see without having a link to pop up a larger image, then PDF is the logical choice over MS Word or posting a long HTML blog entry.

There will always be people who will decide to use something just because it is there, and because recent versions of Word allow you to Save As PDF, they will use this even for short documents that could have just as easily been posted and formatted in the blog's editor.  These people will never be changed and they will always be there ... walking amongst us.
0
 
nanharbisonAuthor Commented:
Thanks for all this feedback! Very helpful.
0
 
BillDLCommented:
Thank you nanharbison
0

Featured Post

Technology Partners: We Want Your Opinion!

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

Tackle projects and never again get stuck behind a technical roadblock.
Join Now