Go Premium for a chance to win a PS4. Enter to Win

x
  • Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 141
  • Last Modified:

HTML processing

Hi,
I would like to be able to do the following:
Download websites and then remove all external links from site ( including banner links ) and then have a "viewer" browser to browse the website offline. The "viewer" can be any browser, doesn't have to be a specially developed browser. I would also like to have any flash and java components stay in tact after links are removed.
So I would basically need some perl script to run through the html pages and look for the external links and remove them.
0
psimation
Asked:
psimation
  • 2
  • 2
1 Solution
 
DABOMBCommented:
so you want links removed, but flash and java to stay, how about the images? if you get rid of the banners it will kill images also 99% of the time.

--Dabomb
0
 
psimationAuthor Commented:
Banners aren't really a problem, if you just get rid of the links, the banner's image should still be intact?? It just won't link right?
0
 
DABOMBCommented:
the banner is still called by an <IMG SRC> tag, flash is <EMB SRC> links are <A HREF> the links are just underlying on the banner.
0
 
psimationAuthor Commented:
OK, I'm going to accept DABOMB's suggestion, but for the record and for PAQ's , it did not solve my problem; I'm giving up on this.
0

Featured Post

Free Tool: Subnet Calculator

The subnet calculator helps you design networks by taking an IP address and network mask and returning information such as network, broadcast address, and host range.

One of a set of tools we're offering as a way of saying thank you for being a part of the community.

  • 2
  • 2
Tackle projects and never again get stuck behind a technical roadblock.
Join Now