Apache nutch 1.15 installing and running issues

无人久伴 提交于 2019-12-11 17:56:10

问题


I am trying to run Apache Nutch 1.15 (local) on Windows 10, I have followed the same steps as mentioned on https://wiki.apache.org/nutch/NutchTutorial and https://wiki.apache.org/nutch/NutchHadoopSingleNodeTutorial. When I try to inject the urls using this command on cygwin : bin/nutch inject crawl/crawldb urls i get this error:

Injector: java.io.IOException: (null) entry in command string: null chmod 
0644 C:\Users\INFO\Desktop\apache-nutch1.15\runtime\local\crawl\crawldb\.locked

when put %HADOOP_HOME% on system path (solution proposed on Apache Nutch error: Injector: java.io.IOException: (null) entry in command string: null chmod 0644)

i get a new error :

Exception in thread "main" java.lang.UnsatisfiedLinkError: org.apache.hadoop.io.nativeio.NativeIO$Windows.access0(Ljava/lang/String;I)Z

by the way as HADOOP_HOME i've tried both hadoop-2.8.0 and winutil folder but same problem.

please help.


回答1:


This is a known issue in nutch. Please find the JIRA and the fix below. If you apply the changes from GIT in your local bin/nutch file, everything works fine again. This will be included as a fix when nutch 1.16 gets released

JIRA: https://issues.apache.org/jira/browse/NUTCH-2639?page=com.atlassian.jira.plugin.system.issuetabpanels%3Aall-tabpanel

FIX: https://github.com/apache/nutch/pull/378/commits/7e4502089ecebd194c75719485b6fce1a65797e9



来源:https://stackoverflow.com/questions/55895737/apache-nutch-1-15-installing-and-running-issues

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!