Solved

Convert Word 2003 to clean HTML code without the bloat

Posted on 2004-04-05
12
1,796 Views
Last Modified: 2011-10-03
Hi

I have a doc I edit in Word 2003 and when I convert it to HTML using the Word 2003 Save as dialog it saves a massive HTML file with all sorts of nasty crap in it as well.

Can anyone recommend a good, free convertor (or method) that actually writes clean HTML and not pages of bloatcode, filled with irrelevant and unwanted additional tags.

cheers

JamesDS
0
Comment
Question by:JamesDS
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 5
  • 3
  • 3
  • +1
12 Comments
 
LVL 19

Assisted Solution

by:webwoman
webwoman earned 50 total points
ID: 10757244
I don't know of one, but there IS something you can do.

Write a macro that adds valid HTML code. It will take some time to set up, but once you do it, you never get crappy HTML code again.

Or you can save as a text file, and add code yourself.
0
 
LVL 10

Expert Comment

by:stu215
ID: 10758025
Ponder, Macromedia offers a free trial of their software & i don't know what functions they let you use, but Dreamweaver has an option to cleanup code that works very nicely, and they have a utility built in to specifically clean up Word HTML as its so nasty....

Free 30 Day Trial : http://www.macromedia.com/cfusion/tdrc/index.cfm?product=dreamweaver

~Stu :-)
0
 
LVL 10

Expert Comment

by:stu215
ID: 10758065
-- If your looking for a more permanent solution, you may try looking for a perl module or something of the sort from one of the large perl archives (open source/free stuff -- by now im sure someone has made a module to do it) :

CPAN - http://www.cpan.org/

~Stu :-)
0
Secure Your WordPress Site: 5 Essential Approaches

WordPress is the web's most popular CMS, but its dominance also makes it a target for attackers. Our eBook will show you how to:

Prevent costly exploits of core and plugin vulnerabilities
Repel automated attacks
Lock down your dashboard, secure your code, and protect your users

 
LVL 10

Assisted Solution

by:stu215
stu215 earned 100 total points
ID: 10758163
Some Clean up utilities [ PERL ]:

Pretty HTML: http://webdesign.about.com/library/weekly/aa012003a.htm

TIDY : http://sourceforge.net/projects/tidy
*** Ive heard other people using this one, and its sposed to work pretty well, not that i've tried it myself.  It does have good documentation though...

--------------------------------------------------
Another Free Script Archive u can search :

http://sourceforge.net

~Stu :-)
0
 
LVL 31

Expert Comment

by:seanpowell
ID: 10759569
0
 
LVL 16

Author Comment

by:JamesDS
ID: 10763478
Seanpowell

That is exactly what I wanted, but the Microsoft solution only installs on the older version of word.

I would rather the solution integrated into Word as the process is already a pain if I have to do a number of files.

Cheers

JamesDS
0
 
LVL 31

Accepted Solution

by:
seanpowell earned 100 total points
ID: 10763743
I should have checked that, but I didn't see it in the online documentation :-(

Word 2003 now contains an integral version - the problem is that I don't know if it's as good as it needs to be.
Try this:

1. Open a normal word doc that you've previosuly saved as an html file
2. On the File menu, and click Save As.
3. Enter a new name for the file in the File name box.
4. In the Save as type box, click Web Page, Filtered.
5. Click Save.
0
 
LVL 16

Author Comment

by:JamesDS
ID: 10763997
Seanpowell

This is what I use now, but the file it creates is about twice the size it needs to be
Thanks for the help anyway
Cheers

JamesDS
0
 
LVL 31

Expert Comment

by:seanpowell
ID: 10764171
No problem JamesDS - I would think that the Office Filter should work on 2003 - I would try it at least. You can always uninstall it.
0
 
LVL 16

Author Comment

by:JamesDS
ID: 10764315
i did, it has a right strop-on and complains it can't find office

in a word - arse!

Can we say that here?

Thank again, I'll split the points to all who helped

Cheers

JamesDS
0
 
LVL 31

Expert Comment

by:seanpowell
ID: 10764360
Yes you can say that - but only if you happen to be discussing MS at the time. I think.
Maybe not...

We'll let it go this time :-)
0

Featured Post

Industry Leaders: We Want Your Opinion!

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

This article describes how to create custom column layout styles for Bootstrap. The article uses 5 columns to illustrate the concept, but the principle can be extended to any number of columns.
When crafting your “Why Us” page, there are a plethora of pitfalls to avoid. Follow these five tips, and you’ll be well on your way to creating an effective page.
In this tutorial viewers will learn how add a scalable full-width header using CSS3. Create a new HTML document with an internal stylesheet. Set a tiled background.:  Create a new div and name it Header. Position it with position:absolute at the top…
Learn how to create flexible layouts using relative units in CSS.  New relative units added in CSS3 include vw(viewports width), vh(viewports height), vmin(minimum of viewports height and width), and vmax (maximum of viewports height and width).

726 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question