Using Nutch crawler with Solr

亡梦爱人 提交于 2019-11-28 16:53:06

问题


Am I able to integrate Apache Nutch crawler with the Solr Index server?

Edit:

One of our devs came up with a solution from these posts

  1. Running Nutch and Solr
  2. Update for Running Nutch and Solr

Answer

Yes


回答1:


If you're willing to upgrade to nutch 1.0 you can use the solrindex as described in this article by Lucid Imagination: http://www.lucidimagination.com/blog/2009/03/09/nutch-solr/.




回答2:


It's still an open issue. If you're feeling adventurous you could try applying those patches yourself, although it looks like it's not so simple




回答3:


nutch 2.x is designed to use solr as default. You can follow the steps in http://wiki.apache.org/nutch/Nutch2Tutorial, or a better instruction in the book "Web Crawling and Data Mining with Apache Nutch".



来源:https://stackoverflow.com/questions/211411/using-nutch-crawler-with-solr

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!