Solved

Web crawlers and bots tutorial

Posted on 2002-03-20
2
1,628 Views
Last Modified: 2012-08-13
Can anyone point me to a good tutorial on how to write web bots and crawlers in java.

Thanks.
0
Comment
Question by:cvcarson
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
2 Comments
 
LVL 4

Accepted Solution

by:
pellep earned 100 total points
ID: 6884228
I'm not sure how 'smart' of a crawler you aim to develop, but i think you may be in over your head on this one. I have a friend that works for an ISP who wanted to develop their own search-engine. They had a team of 10 developers working for nearly 4 months before they cancelled the project as beeing too large and costly. All the engines out there (google, webcrawler etc) use their own 'intelligence' and they are not at all keen on letting anyone else in on how they do it. If you have a vague concept of what a link crawler or meta crawler does, you pretty much have your tutorial there. The basic idea is pretty simple, but the implementations can be infinitly complex. I may be wrong, of course. Someone might come up with 'the comprehensive guide to developing search-engines', but I doubt it.
0
 

Expert Comment

by:polkadot
ID: 11982945
Actually there are a few books out there I like that could give you a good intro. Programming Spiders, Bots, and Aggregators in Java by Jeff Heaton, is one.
0

Featured Post

PeopleSoft Has Never Been Easier

PeopleSoft Adoption Made Smooth & Simple!

On-The-Job Training Is made Intuitive & Easy With WalkMe's On-Screen Guidance Tool.  Claim Your Free WalkMe Account Now

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Java contains several comparison operators (e.g., <, <=, >, >=, ==, !=) that allow you to compare primitive values. However, these operators cannot be used to compare the contents of objects. Interface Comparable is used to allow objects of a cl…
Introduction Java can be integrated with native programs using an interface called JNI(Java Native Interface). Native programs are programs which can directly run on the processor. JNI is simply a naming and calling convention so that the JVM (Java…
Viewers learn about the third conditional statement “else if” and use it in an example program. Then additional information about conditional statements is provided, covering the topic thoroughly. Viewers learn about the third conditional statement …
This tutorial covers a practical example of lazy loading technique and early loading technique in a Singleton Design Pattern.

717 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question