Solved

Configure nutch 1.6 in windows

Posted on 2013-06-05
5
788 Views
Last Modified: 2013-06-08
I have downloaded the nutch "apache-nutch-1.6" from the apache site and extracted in the folder as mentioned in the site.

Now when i execute the nutch command in the command prompt it shows "nutch is not a recognized internal or external command"

Please help me where should i correct to remove th error.

Secondly how to search contents inside  a static website deployed on a server?
Is there any good tutorial for the same?
0
Comment
Question by:Rocking
  • 3
  • 2
5 Comments
 
LVL 26

Expert Comment

by:mrcoffee365
Comment Utility
You need to read the nutch documentation.  There is an O'Reilly book on nutch, too, which would probably help you a great deal.  Downloading software does not install it or make it run.

Nutch will not search anything unless it's running and it has indexed a body of text/documents.
0
 

Author Comment

by:Rocking
Comment Utility
I couldn;t find the O'Reilly book on nutch.
can u pls name the book.
0
 
LVL 26

Accepted Solution

by:
mrcoffee365 earned 500 total points
Comment Utility
You're right -- I typed too quickly.  I was thinking of the Lucene books, which usually mention nutch.  There are books on Lucene, not just from O'Reilly.  And there's the apache tutorial for nutch:
http://nutch.apache.org/tutorial.html
http://wiki.apache.org/nutch/

If you search amazon.com for nutch, it will list some lucene and data mining books which mention nutch.
0
 

Author Comment

by:Rocking
Comment Utility
The tutorial mentioned i have already gone through. thanks btw.
Regarding the problem
"nutch is not a recognized internal or external command"" i have solved and crawled is done successfully
  bin/nutch crawl urls -dir crawl -depth 3 -topN 5.

Now i want to integrate it in tomcat web application.
I have a search button in my website and i need is when user typed in something a new page should appear and displays the results (which have been crawled before) in jsp file created by me.
What are the steps for the above?

I had gone through some websites and come to know that we need a nutch war file prior to be deployed in tomcat which was included in nutch version prior to 1.3.
Above version we need to create the same.
0
 
LVL 26

Expert Comment

by:mrcoffee365
Comment Utility
Glad to help.

Open a new question for your new problem.  Good luck!
0

Featured Post

6 Surprising Benefits of Threat Intelligence

All sorts of threat intelligence is available on the web. Intelligence you can learn from, and use to anticipate and prepare for future attacks.

Join & Write a Comment

Suggested Solutions

[Part 4 of a 6 part series called SEO Basics: 5 SEO Secrets for Creating Content that Drives Traffic (http://www.experts-exchange.com/Web_Development/Internet_Marketing/Search_Engine_Optimization_SEO/A_8369-SEO-Basics-5-SEO-Secrets-for-Creating-Cont…
This code takes an Excel list of URL’s and adds a header titled “URL List”. It then searches through all URL’s in column “A”, looking for duplicates. When a duplicate is found, it is moved to the top of the list. The duplicate URL’s are then highlig…
This tutorial walks through the best practices in adding a local business to Google Maps including how to properly search for duplicates, marker placement, and inputing business details. Login to your Google Account, then search for "Google Mapmaker…
This Micro Tutorial will demonstrate how to add subdomains to your content reports. This can be very importing in having a site with multiple subdomains.

743 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

15 Experts available now in Live!

Get 1:1 Help Now