Web crawlers and bots tutorial

Can anyone point me to a good tutorial on how to write web bots and crawlers in java.

Who is Participating?
pellepConnect With a Mentor Commented:
I'm not sure how 'smart' of a crawler you aim to develop, but i think you may be in over your head on this one. I have a friend that works for an ISP who wanted to develop their own search-engine. They had a team of 10 developers working for nearly 4 months before they cancelled the project as beeing too large and costly. All the engines out there (google, webcrawler etc) use their own 'intelligence' and they are not at all keen on letting anyone else in on how they do it. If you have a vague concept of what a link crawler or meta crawler does, you pretty much have your tutorial there. The basic idea is pretty simple, but the implementations can be infinitly complex. I may be wrong, of course. Someone might come up with 'the comprehensive guide to developing search-engines', but I doubt it.
Actually there are a few books out there I like that could give you a good intro. Programming Spiders, Bots, and Aggregators in Java by Jeff Heaton, is one.
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

All Courses

From novice to tech pro — start learning today.