parsing HTML code
Posted on 2004-08-26
hi - has anyone had experience in taking HTML code - searching for all possible graphics paths and changing those paths then resaving the HTML code-?
I am creating a "packager" that the "graphic guys and gals" can create a html page using graphics from several sources (ie paths) across our network
When the HTML file is put in "the packager" it would seach though HTML code and find all external references -copy those files locally and strip the paths off within the HTML code - In other words the HTML would run properly if all the graphic files were in the same directory as the HTML code ( which is what I want)
The program then makes a cab file of this HTML file and all graphic files
* what is this used for?? well we send all types of graphic formats to be displayed on remote advertising machines - but these are all single files (mostly .swf s) - we need to keep this methodology ( 1 file ) so thats why the "packaging")
Ive got the interface done - they can drag and drop files and pick from file chooser
Ive got the cab file maker done
I can 'prolly seach the HTML text for obvious "scr=" and "BACKGROUND=" and " .jpg" and " .gif" ect and copy then strip out the paths but just wondered if anyone has done this kind of thing before ??
(OR IS THERE A WAY THAT A WEB PAGE CAN BE PACKAGED WITH ALL GRAPHICS CONTAINED WITH IN ?) obviously im not a web page programmer :)
thanks in advance