Nutch 2.2.1 doesnt continue after Injector job

删除回忆录丶 提交于 2019-12-29 09:29:43

问题


I am learning nutch and trying to carawl as per this tutorial .I am working on an ubuntu machinewith bash shell. But when I run the script, the execution happens, but nothing happens after ,

InjectorJob: starting at 2014-03-23 09:28:50
InjectorJob: Injecting urlDir: urls/seed.txt

I have waited for hours, I tried running the same with sudo. The same issue occurs. I have tried with default urls given in the tutorial as well. What can be the probable errors?


回答1:


What was missing was I didnt add Proxy and port details in the nutch-site.xml, as I was accessing through proxy. setting up the same for Ant or JVM is not enough



来源:https://stackoverflow.com/questions/22586950/nutch-2-2-1-doesnt-continue-after-injector-job

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!