Link to home
Start Free TrialLog in
Avatar of cacklebunny
cacklebunny

asked on

Export MS Word HTML into valid, XHTML table?

We have employees in another department who post data to our website via an online form we'd set up for them.  The form automatically generates basic HTML (paragraph tags, bold tags, italics, etc.).

The employees often want to post long HTML tables that they'd generated in MS Word.  Unfortunately, if we ask these employees to save these as an HTML file, MS Word generates some pretty hideous code and many unnecessary, Microsoft-specific tags.  Usually the ending code is so large that it crashes our online form, since online forms have a maximum byte amount they can pass at any given time (no joke, these tables are often quite huge).

Is there any tool out there that can parse MS Word-generated table data and recreate a table into tight, XHTML format?  We're trying to avoid the time necessary to program our own robust, MS Word HTML parser, since this is such a low priority item (but not so low that the subject of doing this comes up once every year)
ASKER CERTIFIED SOLUTION
Avatar of seanpowell
seanpowell
Flag of Canada image

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
It won't be perfect, but it will be 90% better...
Avatar of webwoman
webwoman

Develop a macro, put it in all the normal.dot templates, and show them how to use it. Not only should you be able to format the table, you can convert the quotes, set the bold/italic/etc., and even add styles to the tags.

Then YOU control what gets generated, not MS.
Avatar of cacklebunny

ASKER

Thanks, Sean --at last something that works! :)