?
Solved

Convert HTML to PDF

Posted on 2006-11-30
14
Medium Priority
?
537 Views
Last Modified: 2013-12-02
i am converting several HTML pages to PDF, but when i do the PDF conversion has page breaks in places that i don't want them.  is there anyway to add a tag to the HTML doc that a PDF converter will recognize as a page break and insert breaks at the specific point?

i am using Adobe Acrobat Professional 7.0 for the conversion.
0
Comment
Question by:scottspivey
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 4
  • 4
  • 4
14 Comments
 
LVL 26

Expert Comment

by:EddieShipman
ID: 18046980
You may be able to use the CSS identifiers page-break-before or page-break-after to do this. I have not tired it, though.
0
 
LVL 1

Author Comment

by:scottspivey
ID: 18046995
EddieShipman ,

i will give it a shot and see what happens.

scott
0
 
LVL 10

Expert Comment

by:Dennis Maeder
ID: 18047902
A 3rd-party freeware converter html2ps (coupled with ps2pd) uses a comment tag to hint a page break. Perhaps this is also used in the adobe converter?

<!--NewPage-->

From http://user.it.uu.se/~jan/html2psug.html
doc-sep
A string of HTML code that will be inserted between the documents when more than one are converted. The default is "<!--NewPage-->", which will cause a page break.

D
0
Industry Leaders: We Want Your Opinion!

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

 
LVL 1

Author Comment

by:scottspivey
ID: 18047980
EddieShipman,

>>CSS identifiers page-break-before or page-break-after

they did not work.

scott
0
 
LVL 1

Author Comment

by:scottspivey
ID: 18047986
dennis_maeder,

i am looking at that now and will see what happens.

scott
0
 
LVL 10

Expert Comment

by:Dennis Maeder
ID: 18047994
I should add from html2ps:

A few extensions to HTML 4.01 are recognized by html2ps:

A page break can be forced by including any of these in a document:
   <HR class=PAGE-BREAK>
   
   <?page-break>
   
   <!--NewPage-->

and also correct ps2pd above to ps2pdf.

The <HR> method must assume CSS which is a mechanism to actuate page-break-after.
e.g.
<HR style="page-break-after:always ">
or
<HR style="page-break-after:auto">

D


D
0
 
LVL 26

Expert Comment

by:EddieShipman
ID: 18048750
I doubt that Adobe would follow the same conventions.
0
 
LVL 10

Expert Comment

by:Dennis Maeder
ID: 18050906
Eddie,

I have no idea whether adobe is even html aware.
I suspect IE takes html, renders it to postscript and the Adobe plugin takes over from there.

However this should work if the renderer honors CSS.

<HR style="page-break-after:always">

D
0
 
LVL 26

Expert Comment

by:EddieShipman
ID: 18053808
But he wants to convert it to PDF, not PS.
0
 
LVL 1

Author Comment

by:scottspivey
ID: 18053836
dennis_maeder,

Eddie is correct, i do want to convert to PDF not PS, but i wonder if i will have to go through a two step process to do this.  i have found that if i pull the txt file into msword, replace the <PAGE> tags with <PAGE> ^m, adobe regognizes those as page breaks and puts the break where i want it.  but i would rather not have to go through MSWord to get my results.

scott
0
 
LVL 26

Accepted Solution

by:
EddieShipman earned 1000 total points
ID: 18054612
You can do that by automating MSWord. If you need help doing that, post another question. I'm sure someone
will help you out.
0
 
LVL 10

Assisted Solution

by:Dennis Maeder
Dennis Maeder earned 1000 total points
ID: 18055971
Scott,

Free HTML to PDF conversion on the fly happens at  e.g.
http://www.rustyparts.com/pdf.php
which uses html2ps underthe hood and will honor the tags I presented above.

Its also worth taking a look at
http://www.easysw.com/htmldoc/pdf-o-matic.php
which supports <!-- NEW PAGE -->
and many other similar tags

D
0

Featured Post

Nothing ever in the clear!

This technical paper will help you implement VMware’s VM encryption as well as implement Veeam encryption which together will achieve the nothing ever in the clear goal. If a bad guy steals VMs, backups or traffic they get nothing.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

When it comes to write a Context Sensitive Help (an online help that is obtained from a specific point in state of software to provide help with that state) ,  first we need to make the file that contains all topics, which are given exclusive IDs. …
Originally, this post was published on Monitis Blog, you can check it here . In business circles, we sometimes hear that today is the “age of the customer.” And so it is. Thanks to the enormous advances over the past few years in consumer techno…
In this fifth video of the Xpdf series, we discuss and demonstrate the PDFdetach utility, which is able to list and, more importantly, extract attachments that are embedded in PDF files. It does this via a command line interface, making it suitable …
In an interesting question (https://www.experts-exchange.com/questions/29008360/) here at Experts Exchange, a member asked how to split a single image into multiple images. The primary usage for this is to place many photographs on a flatbed scanner…

649 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question