Solved

HTML processing

Posted on 2002-04-18
4
133 Views
Last Modified: 2010-03-05
Hi,
I would like to be able to do the following:
Download websites and then remove all external links from site ( including banner links ) and then have a "viewer" browser to browse the website offline. The "viewer" can be any browser, doesn't have to be a specially developed browser. I would also like to have any flash and java components stay in tact after links are removed.
So I would basically need some perl script to run through the html pages and look for the external links and remove them.
0
Comment
Question by:psimation
  • 2
  • 2
4 Comments
 
LVL 3

Expert Comment

by:DABOMB
Comment Utility
so you want links removed, but flash and java to stay, how about the images? if you get rid of the banners it will kill images also 99% of the time.

--Dabomb
0
 
LVL 17

Author Comment

by:psimation
Comment Utility
Banners aren't really a problem, if you just get rid of the links, the banner's image should still be intact?? It just won't link right?
0
 
LVL 3

Accepted Solution

by:
DABOMB earned 50 total points
Comment Utility
the banner is still called by an <IMG SRC> tag, flash is <EMB SRC> links are <A HREF> the links are just underlying on the banner.
0
 
LVL 17

Author Comment

by:psimation
Comment Utility
OK, I'm going to accept DABOMB's suggestion, but for the record and for PAQ's , it did not solve my problem; I'm giving up on this.
0

Featured Post

Free Trending Threat Insights Every Day

Enhance your security with threat intelligence from the web. Get trending threat insights on hackers, exploits, and suspicious IP addresses delivered to your inbox with our free Cyber Daily.

Join & Write a Comment

A year or so back I was asked to have a play with MongoDB; within half an hour I had downloaded (http://www.mongodb.org/downloads),  installed and started the daemon, and had a console window open. After an hour or two of playing at the command …
In the distant past (last year) I hacked together a little toy that would allow a couple of Manager types to query, preview, and extract data from a number of MongoDB instances, to their tool of choice: Excel (http://dilbert.com/strips/comic/2007-08…
Explain concepts important to validation of email addresses with regular expressions. Applies to most languages/tools that uses regular expressions. Consider email address RFCs: Look at HTML5 form input element (with type=email) regex pattern: T…
In this tutorial you'll learn about bandwidth monitoring with flows and packet sniffing with our network monitoring solution PRTG Network Monitor (https://www.paessler.com/prtg). If you're interested in additional methods for monitoring bandwidt…

744 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

17 Experts available now in Live!

Get 1:1 Help Now