nutch crawling stops after injector.

拈花ヽ惹草 提交于 2019-12-25 06:21:04

问题


here is my cygwin screen looks...

cygpath: can't convert empty path
Injector: starting at 2014-05-15 16:57:50
Injector: crawlDb: -dir/crawldb
Injector: urlDir: urls
Injector: Converting injected urls to crawl db entries.
Patch for HADOOP-7682: Instantiating workaround file system
Injector: total number of urls rejected by filters: 1
Injector: total number of urls injected after normalization and filtering: 0
Injector: Merging injected urls into crawl db.
Injector: overwrite: false
Injector: update: false
Injector: finished at 2014-05-15 16:57:52, elapsed: 00:00:02

回答1:


Total number of urls injected is 0. That is nothing to crawl.

Injector: total number of urls rejected by filters: 1
Injector: total number of urls injected after normalization and filtering: 0


来源:https://stackoverflow.com/questions/23676969/nutch-crawling-stops-after-injector

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!