Error with Apache nutch installation on windows 7

Hello All,

I have installed apache nutch 2.1 in windows7 and am using CYGWIN.

I have the following environment variables set:

JAVA_HOME : C:\Java\jdk1.7.0_07
NUTCH_HOME:  C:\apache-nutch-2.1-src\apache-nutch-2.1
NUTCH_JAVA_HOME: C:\Java\jdk1.7.0_07

When I execute the command, "./bin/nutch crawl urls -depth 3 -topN 5" I get the error below:

Error: Could not find or load main class org.apache.nutch.crawl.Crawler

Is there a permissions error here? Should I set my environment variables differently? Suggestions please?


Bewlow are some of the commands I have executed:

$ set | grep 'HOME'
ANT_HOME='C:\Program Files\ant'
HOME=/home/prasankr
HOMEDRIVE=C:
HOMEPATH='\Users\prasankr'
JAVA_HOME='C:\Java\jdk1.7.0_07'
NUTCH_HOME='C:\apache-nutch-2.1-src\apache-nutch-2.1'
NUTCH_JAVA_HOME='C:\Java\jdk1.7.0_07'


 
$ find ${NUTCH_HOME} -type f -name '*nutch*.jar'
C:\apache-nutch-2.1-src\apache-nutch-2.1/build/apache-nutch-2.1.jar

Executed:
./bin/nutch crawl urls -depth 3 -topN 5 2>&1 | tee nutch.log


Output:
alling nutch job
cygpath: can't convert empty path
after calling nutch job

before nutch conf
C:\Java\jdk1.6.0_45\bin;C:\Users\prasankr\Downloads\vertica-jdk5-6.0.1-0.jar
nutch conf dir
/cygdrive/c/apache-nutch-2.1-src/apache-nutch-2.1/src/conf
/cygdrive/c/apache-nutch-2.1-src/apache-nutch-2.1/src/conf:C:\Java\jdk1.7.0_07/lib/tools.jar
checking cygwin
nutch opts
-Dhadoop.log.dir=C:\apache-nutch-2.1-src\apache-nutch-2.1\src\logs -Dhadoop.log.file=hadoop.log
nutch opts after
-Dhadoop.log.dir=C:\apache-nutch-2.1-src\apache-nutch-2.1\src\logs -Dhadoop.log.file=hadoop.log
executing call
C:\Java\jdk1.7.0_07/bin/java -Xmx1000m -Djavax.xml.parsers.DocumentBuilderFactory=com.sun.org.apache.xerces.internal.jaxp.DocumentBuilderFactoryImpl -Dhadoop.log.dir=C:\apache-nutch-2.1-src\apache-nutch-2.1\src\logs -Dhadoop.log.file=hadoop.log -classpath C:\apache-nutch-2.1-src\apache-nutch-2.1\src\conf;C;C:\Java\jdk1.7.0_07\lib\tools.jar;C:\apache-nutch-2.1-src\apache-nutch-2.1\src\lib\*.jar
Class
org.apache.nutch.crawl.Crawler
Error: Could not find or load main class org.apache.nutch.crawl.Crawler


Thanks,
Prasanna
pkrish80Asked:
Who is Participating?
 
mrcoffee365Connect With a Mentor Commented:
This line:
Error: Could not find or load main class org.apache.nutch.crawl.Crawler

indicates that the classpath for your executable is incorrect.  Try checking your classpath again and the content of the jars in your classpath.
0
 
pkrish80Author Commented:
Also, attached the nutch file.
nutch.txt
0
 
pkrish80Author Commented:
I checked the classpath and still have issues but will look into reinstalling nutch.
0
 
mrcoffee365Commented:
There's no other interpretation of that exception.  Many things can cause it, and it can be hard for new users to track down.  Reinstalling nutch might fix it -- maybe there's an environment variable which wasn't set correctly.
0
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

All Courses

From novice to tech pro — start learning today.