Solved

Convert Word 2003 to clean HTML code without the bloat

Posted on 2004-04-05
12
1,749 Views
Last Modified: 2011-10-03
Hi

I have a doc I edit in Word 2003 and when I convert it to HTML using the Word 2003 Save as dialog it saves a massive HTML file with all sorts of nasty crap in it as well.

Can anyone recommend a good, free convertor (or method) that actually writes clean HTML and not pages of bloatcode, filled with irrelevant and unwanted additional tags.

cheers

JamesDS
0
Comment
Question by:JamesDS
  • 5
  • 3
  • 3
  • +1
12 Comments
 
LVL 19

Assisted Solution

by:webwoman
webwoman earned 50 total points
Comment Utility
I don't know of one, but there IS something you can do.

Write a macro that adds valid HTML code. It will take some time to set up, but once you do it, you never get crappy HTML code again.

Or you can save as a text file, and add code yourself.
0
 
LVL 10

Expert Comment

by:stu215
Comment Utility
Ponder, Macromedia offers a free trial of their software & i don't know what functions they let you use, but Dreamweaver has an option to cleanup code that works very nicely, and they have a utility built in to specifically clean up Word HTML as its so nasty....

Free 30 Day Trial : http://www.macromedia.com/cfusion/tdrc/index.cfm?product=dreamweaver

~Stu :-)
0
 
LVL 10

Expert Comment

by:stu215
Comment Utility
-- If your looking for a more permanent solution, you may try looking for a perl module or something of the sort from one of the large perl archives (open source/free stuff -- by now im sure someone has made a module to do it) :

CPAN - http://www.cpan.org/

~Stu :-)
0
 
LVL 10

Assisted Solution

by:stu215
stu215 earned 100 total points
Comment Utility
Some Clean up utilities [ PERL ]:

Pretty HTML: http://webdesign.about.com/library/weekly/aa012003a.htm

TIDY : http://sourceforge.net/projects/tidy
*** Ive heard other people using this one, and its sposed to work pretty well, not that i've tried it myself.  It does have good documentation though...

--------------------------------------------------
Another Free Script Archive u can search :

http://sourceforge.net

~Stu :-)
0
 
LVL 31

Expert Comment

by:seanpowell
Comment Utility
0
 
LVL 31

Expert Comment

by:seanpowell
Comment Utility
0
What Security Threats Are You Missing?

Enhance your security with threat intelligence from the web. Get trending threat insights on hackers, exploits, and suspicious IP addresses delivered to your inbox with our free Cyber Daily.

 
LVL 16

Author Comment

by:JamesDS
Comment Utility
Seanpowell

That is exactly what I wanted, but the Microsoft solution only installs on the older version of word.

I would rather the solution integrated into Word as the process is already a pain if I have to do a number of files.

Cheers

JamesDS
0
 
LVL 31

Accepted Solution

by:
seanpowell earned 100 total points
Comment Utility
I should have checked that, but I didn't see it in the online documentation :-(

Word 2003 now contains an integral version - the problem is that I don't know if it's as good as it needs to be.
Try this:

1. Open a normal word doc that you've previosuly saved as an html file
2. On the File menu, and click Save As.
3. Enter a new name for the file in the File name box.
4. In the Save as type box, click Web Page, Filtered.
5. Click Save.
0
 
LVL 16

Author Comment

by:JamesDS
Comment Utility
Seanpowell

This is what I use now, but the file it creates is about twice the size it needs to be
Thanks for the help anyway
Cheers

JamesDS
0
 
LVL 31

Expert Comment

by:seanpowell
Comment Utility
No problem JamesDS - I would think that the Office Filter should work on 2003 - I would try it at least. You can always uninstall it.
0
 
LVL 16

Author Comment

by:JamesDS
Comment Utility
i did, it has a right strop-on and complains it can't find office

in a word - arse!

Can we say that here?

Thank again, I'll split the points to all who helped

Cheers

JamesDS
0
 
LVL 31

Expert Comment

by:seanpowell
Comment Utility
Yes you can say that - but only if you happen to be discussing MS at the time. I think.
Maybe not...

We'll let it go this time :-)
0

Featured Post

Why You Should Analyze Threat Actor TTPs

After years of analyzing threat actor behavior, it’s become clear that at any given time there are specific tactics, techniques, and procedures (TTPs) that are particularly prevalent. By analyzing and understanding these TTPs, you can dramatically enhance your security program.

Join & Write a Comment

Suggested Solutions

Showing your events from Google Calendar in Google Maps Why? I travel all week and I thought it would be ideal if staff in office knew where I was based on my calendar. (OK real reason: my son wanted to see where I would be working, and I thoug…
Building a website can seem like a daunting task to the uninitiated but it really only requires knowledge of two basic languages: HTML and CSS.
In this tutorial viewers will learn how to define a gradient in CSS. Create a new HTML document with an internal stylesheet.: Create a div in CSS and name it Gradient. Define the background as "linear-gradient(to right, #ee3668, black)". Ensure you …
In this tutorial viewers will learn how to embed videos in a webpage using HTML5. Ensure your DOCTYPE declaration is set to HTML5: "<!DOCTYPE html>": Use the <video> tag to insert a video. Define the src as the URL of your video; this is similar to …

744 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

8 Experts available now in Live!

Get 1:1 Help Now