[Okta Webinar] Learn how to a build a cloud-first strategyRegister Now

x
?
Solved

Nutch error: crawled already exists

Posted on 2008-11-20
6
Medium Priority
?
761 Views
Last Modified: 2013-12-09
I am using the command below in nutch.
$ bin/nutch crawl urls -dir crawled -depth 3 -threads 4

Then get the erroras below.
Exception in thread "main" java.lang.RuntimeException: crawled already exists.
        at org.apache.nutch.crawl.Crawl.main(Crawl.java:85)

May you give any suggestion?
0
Comment
Question by:turbot_yu
  • 3
  • 3
6 Comments
 
LVL 86

Accepted Solution

by:
CEHJ earned 2000 total points
ID: 23003290
Try
$ rm -Rf crawled
$ $ bin/nutch crawl urls -dir crawled -depth 3 -threads 4

Open in new window

0
 

Author Comment

by:turbot_yu
ID: 23003334
Great, start working,

What does the magic word means. Thanks
0
 
LVL 86

Expert Comment

by:CEHJ
ID: 23003545
nutch, whatever it is, apparently expects to create its own directory to put its results in. The first command empties the directory and deletes it
0
Concerto Cloud for Software Providers & ISVs

Can Concerto Cloud Services help you focus on evolving your application offerings, while delivering the best cloud experience to your customers? From DevOps to revenue models and customer support, the answer is yes!

Learn how Concerto can help you.

 

Author Comment

by:turbot_yu
ID: 23003572
Thanks  a lot, I am quite new to nutch, is there any good and simple tutorial materials, thanks.
0
 
LVL 86

Expert Comment

by:CEHJ
ID: 23003619
Never heard of it sorry ;-)
0
 

Author Comment

by:turbot_yu
ID: 23003634
Then any web, how to learn it ya.
0

Featured Post

Important Lessons on Recovering from Petya

In their most recent webinar, Skyport Systems explores ways to isolate and protect critical databases to keep the core of your company safe from harm.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Ready to get certified? Check out some courses that help you prepare for third-party exams.
The first step to building an amazing About page is to figure out what you want the page to say about your company. You then must grab the attention of the reader, boast a bit, tell a story and let others brag about you. With a little bit of thought…
This theoretical tutorial explains exceptions, reasons for exceptions, different categories of exception and exception hierarchy.
This Micro Tutorial will demonstrate how to add subdomains to your content reports. This can be very importing in having a site with multiple subdomains.
Suggested Courses
Course of the Month19 days, 3 hours left to enroll

834 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question