[Okta Webinar] Learn how to a build a cloud-first strategyRegister Now

x
  • Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 1803
  • Last Modified:

Convert Word 2003 to clean HTML code without the bloat

Hi

I have a doc I edit in Word 2003 and when I convert it to HTML using the Word 2003 Save as dialog it saves a massive HTML file with all sorts of nasty crap in it as well.

Can anyone recommend a good, free convertor (or method) that actually writes clean HTML and not pages of bloatcode, filled with irrelevant and unwanted additional tags.

cheers

JamesDS
0
JamesDS
Asked:
JamesDS
  • 5
  • 3
  • 3
  • +1
3 Solutions
 
webwomanCommented:
I don't know of one, but there IS something you can do.

Write a macro that adds valid HTML code. It will take some time to set up, but once you do it, you never get crappy HTML code again.

Or you can save as a text file, and add code yourself.
0
 
stu215Commented:
Ponder, Macromedia offers a free trial of their software & i don't know what functions they let you use, but Dreamweaver has an option to cleanup code that works very nicely, and they have a utility built in to specifically clean up Word HTML as its so nasty....

Free 30 Day Trial : http://www.macromedia.com/cfusion/tdrc/index.cfm?product=dreamweaver

~Stu :-)
0
 
stu215Commented:
-- If your looking for a more permanent solution, you may try looking for a perl module or something of the sort from one of the large perl archives (open source/free stuff -- by now im sure someone has made a module to do it) :

CPAN - http://www.cpan.org/

~Stu :-)
0
Independent Software Vendors: We Want Your Opinion

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

 
stu215Commented:
Some Clean up utilities [ PERL ]:

Pretty HTML: http://webdesign.about.com/library/weekly/aa012003a.htm

TIDY : http://sourceforge.net/projects/tidy
*** Ive heard other people using this one, and its sposed to work pretty well, not that i've tried it myself.  It does have good documentation though...

--------------------------------------------------
Another Free Script Archive u can search :

http://sourceforge.net

~Stu :-)
0
 
seanpowellCommented:
0
 
JamesDSAuthor Commented:
Seanpowell

That is exactly what I wanted, but the Microsoft solution only installs on the older version of word.

I would rather the solution integrated into Word as the process is already a pain if I have to do a number of files.

Cheers

JamesDS
0
 
seanpowellCommented:
I should have checked that, but I didn't see it in the online documentation :-(

Word 2003 now contains an integral version - the problem is that I don't know if it's as good as it needs to be.
Try this:

1. Open a normal word doc that you've previosuly saved as an html file
2. On the File menu, and click Save As.
3. Enter a new name for the file in the File name box.
4. In the Save as type box, click Web Page, Filtered.
5. Click Save.
0
 
JamesDSAuthor Commented:
Seanpowell

This is what I use now, but the file it creates is about twice the size it needs to be
Thanks for the help anyway
Cheers

JamesDS
0
 
seanpowellCommented:
No problem JamesDS - I would think that the Office Filter should work on 2003 - I would try it at least. You can always uninstall it.
0
 
JamesDSAuthor Commented:
i did, it has a right strop-on and complains it can't find office

in a word - arse!

Can we say that here?

Thank again, I'll split the points to all who helped

Cheers

JamesDS
0
 
seanpowellCommented:
Yes you can say that - but only if you happen to be discussing MS at the time. I think.
Maybe not...

We'll let it go this time :-)
0

Featured Post

Free Tool: Subnet Calculator

The subnet calculator helps you design networks by taking an IP address and network mask and returning information such as network, broadcast address, and host range.

One of a set of tools we're offering as a way of saying thank you for being a part of the community.

  • 5
  • 3
  • 3
  • +1
Tackle projects and never again get stuck behind a technical roadblock.
Join Now