Go Premium for a chance to win a PS4. Enter to Win

x
?
Solved

Error with Apache nutch installation on windows 7

Posted on 2014-01-21
4
Medium Priority
?
1,147 Views
Last Modified: 2014-05-12
Hello All,

I have installed apache nutch 2.1 in windows7 and am using CYGWIN.

I have the following environment variables set:

JAVA_HOME : C:\Java\jdk1.7.0_07
NUTCH_HOME:  C:\apache-nutch-2.1-src\apache-nutch-2.1
NUTCH_JAVA_HOME: C:\Java\jdk1.7.0_07

When I execute the command, "./bin/nutch crawl urls -depth 3 -topN 5" I get the error below:

Error: Could not find or load main class org.apache.nutch.crawl.Crawler

Is there a permissions error here? Should I set my environment variables differently? Suggestions please?


Bewlow are some of the commands I have executed:

$ set | grep 'HOME'
ANT_HOME='C:\Program Files\ant'
HOME=/home/prasankr
HOMEDRIVE=C:
HOMEPATH='\Users\prasankr'
JAVA_HOME='C:\Java\jdk1.7.0_07'
NUTCH_HOME='C:\apache-nutch-2.1-src\apache-nutch-2.1'
NUTCH_JAVA_HOME='C:\Java\jdk1.7.0_07'


 
$ find ${NUTCH_HOME} -type f -name '*nutch*.jar'
C:\apache-nutch-2.1-src\apache-nutch-2.1/build/apache-nutch-2.1.jar

Executed:
./bin/nutch crawl urls -depth 3 -topN 5 2>&1 | tee nutch.log


Output:
alling nutch job
cygpath: can't convert empty path
after calling nutch job

before nutch conf
C:\Java\jdk1.6.0_45\bin;C:\Users\prasankr\Downloads\vertica-jdk5-6.0.1-0.jar
nutch conf dir
/cygdrive/c/apache-nutch-2.1-src/apache-nutch-2.1/src/conf
/cygdrive/c/apache-nutch-2.1-src/apache-nutch-2.1/src/conf:C:\Java\jdk1.7.0_07/lib/tools.jar
checking cygwin
nutch opts
-Dhadoop.log.dir=C:\apache-nutch-2.1-src\apache-nutch-2.1\src\logs -Dhadoop.log.file=hadoop.log
nutch opts after
-Dhadoop.log.dir=C:\apache-nutch-2.1-src\apache-nutch-2.1\src\logs -Dhadoop.log.file=hadoop.log
executing call
C:\Java\jdk1.7.0_07/bin/java -Xmx1000m -Djavax.xml.parsers.DocumentBuilderFactory=com.sun.org.apache.xerces.internal.jaxp.DocumentBuilderFactoryImpl -Dhadoop.log.dir=C:\apache-nutch-2.1-src\apache-nutch-2.1\src\logs -Dhadoop.log.file=hadoop.log -classpath C:\apache-nutch-2.1-src\apache-nutch-2.1\src\conf;C;C:\Java\jdk1.7.0_07\lib\tools.jar;C:\apache-nutch-2.1-src\apache-nutch-2.1\src\lib\*.jar
Class
org.apache.nutch.crawl.Crawler
Error: Could not find or load main class org.apache.nutch.crawl.Crawler


Thanks,
Prasanna
0
Comment
Question by:pkrish80
  • 2
  • 2
4 Comments
 

Author Comment

by:pkrish80
ID: 39799006
Also, attached the nutch file.
nutch.txt
0
 
LVL 27

Accepted Solution

by:
mrcoffee365 earned 1500 total points
ID: 39808707
This line:
Error: Could not find or load main class org.apache.nutch.crawl.Crawler

indicates that the classpath for your executable is incorrect.  Try checking your classpath again and the content of the jars in your classpath.
0
 

Author Comment

by:pkrish80
ID: 39883052
I checked the classpath and still have issues but will look into reinstalling nutch.
0
 
LVL 27

Expert Comment

by:mrcoffee365
ID: 39883619
There's no other interpretation of that exception.  Many things can cause it, and it can be hard for new users to track down.  Reinstalling nutch might fix it -- maybe there's an environment variable which wasn't set correctly.
0

Featured Post

What does it mean to be "Always On"?

Is your cloud always on? With an Always On cloud you won't have to worry about downtime for maintenance or software application code updates, ensuring that your bottom line isn't affected.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Basic understanding on "OO- Object Orientation" is needed for designing a logical solution to solve a problem. Basic OOAD is a prerequisite for a coder to ensure that they follow the basic design of OO. This would help developers to understand the b…
We live in a world of interfaces like the one in the title picture. VBA also allows to use interfaces which offers a lot of possibilities. This article describes how to use interfaces in VBA and how to work around their bugs.
This tutorial covers a practical example of lazy loading technique and early loading technique in a Singleton Design Pattern.
This video teaches viewers about errors in exception handling.
Suggested Courses

916 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question