Link to home
Start Free TrialLog in
Avatar of Lev Seltzer
Lev SeltzerFlag for Israel

asked on

Create a publication in Word from a HTML data

I have an MSSQL database (which I can easily export to Access) that has 2000 questions and answers. The data was entered via a browser and contains simple HTML codes, such as <p>, <b> and <em>. I want to merge this data into a publication in MS Word, with the HTML codes converted to the appropriate display. Is there a way to do this?
Avatar of Kim Walker
Kim Walker
Flag of United States of America image

View the page in your browser then save page as...web page complete. Launch MS Word and open the web page you just saved.

MS Word is capable of reading and writing HTML but it is somewhat limited. You may have to make some adjustments after opening the page in MS Word.
Avatar of Lev Seltzer

ASKER

I have 2000 questions in HTML code. I can't open and close each one.
Are they each on a separate page? Can't you view multiple questions on a page?

Do you have them on a local computer or are they on the internet? Do you have access to the site source files?

Re-reading the question I wonder if I understand what you're saying.
The data was entered via a browser and contains simple HTML codes, such as <p>, <b> and <em>.
Can you view the questions in a browser?
I can easily view the questions - even all 2000 questions - in a browser. They appear in a table format. I want to merge this data into word in a different format.
You should be able to reformat after you import into Word.
Good. How can I reformat?
I don't mind running a macro after the merge, thereby converting all <b> to bold, <I> to italic, etc. I just don't know how to do it.
When you view a page in your browser with all the questions on one page. Aren't the questions already formatted? You said they have HTML tags in them. Or are you seeing the HTML tags with the questions?
The questions are perfect in the browser.
But I do not want to take them from there.
I want to connect to the database directly and I want to retain the simple formatting rules, which I would lose if I connected to (or converted from) the data visible in the browser.

I think I am going no where here, and will try to create a different question.
You shouldn't lose anything. Word recognizes all the HTML mark up. In fact you can create a web page in Word but I wouldn't recommend it. If you view the page in your browser, save it as a complete .html file. You can open it in Word and most of the formatting would be retained. There might be some minor tweaks required, but they should be minor.
When merging a datafile whos entries contain formatting to a new document, MS Word will NOT include the formatting. it will be lost.
I haven't been talking about a datafile or new document.
The questions are perfect in the browser.
You have said you can view a web page with all the questions perfectly in the browser. If you save that web page document as a .html or .htm file to your local hard disk, you can open it in Word and the formatting you saw in the web page document will be retained to a certain degree. There are some complex properties that will not be retained but certainly any text formatting properties will be retained. I do this frequently to preview what an HTML email will look like in Outlook.

Can you provide a link to the web page that shows the questions? Or at least some of the questions?

Other than this, I am not aware of a method in Word of retrieving the formatted questions directly from the database. I would not be surprised, though, if there was a way in MSSQL to export the data to a Word document. But I doubt if it would render the HTML codes in the data.
You are correct that I could view the data in a browser and then save the table in MS Word. It would be saved perfectly.

In fact, I have been doing that for a few years.

But now, I want to radically format the data and use it to publish a book. I do not want a table any longer. In a normal case, MS Word "mailmerge" would be a perfect solution. But here, I just end up with HTML tags such as <b> in the end product. I am now investigating, via a different ticket, if there is a way to write a MACRO to convert <b> (etc.) to BOLD and remove the tags. So far, no luck.
ASKER CERTIFIED SOLUTION
Avatar of Kim Walker
Kim Walker
Flag of United States of America image

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
I have hired a programmer to write a macro to convert the text. Thank you.