Link to home
Start Free TrialLog in
Avatar of gateguard
gateguard

asked on

web crawlers shutting down my site

I have a tomcat site running on a server2008r2 machine.

My site is no longer responsive.

a netstat -a command yields the results in the attached file

how do I block all those things from jamming my port 80?

thanks
netstata2.txt
SOLUTION
Avatar of Guy Lidbetter
Guy Lidbetter
Flag of United Kingdom of Great Britain and Northern Ireland image

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
Avatar of gateguard
gateguard

ASKER

Thanks, Guy.  I'm going to try these solutions.

I do have a question regarding the wikipedia article, something I don't understand.

The article talks about keeping crawlers out of the site or out of specific folders, but what I don't understand is why are the crawlers establishing all those port 80 connections, which is effectively shutting down the site.  Is that their intent?

Thanks again.
ASKER CERTIFIED SOLUTION
Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
SOLUTION
Avatar of Lucas Bishop
Lucas Bishop
Flag of United States of America image

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
where do I put a robots.txt file?  I don't have one right now.
It goes in the root folder of your web site.
hi gateguard,

The wiki page in my very first post explains everything about the robots file.

Regards

Guy