Using Nutch crawler with Solr

前端 未结 3 1901
-上瘾入骨i
-上瘾入骨i 2020-12-23 15:09

Am I able to integrate Apache Nutch crawler with the Solr Index server?

Edit:

One of our devs came up with a solution from these posts

相关标签:
3条回答
  • 2020-12-23 15:50

    If you're willing to upgrade to nutch 1.0 you can use the solrindex as described in this article by Lucid Imagination: http://www.lucidimagination.com/blog/2009/03/09/nutch-solr/.

    0 讨论(0)
  • 2020-12-23 15:52

    It's still an open issue. If you're feeling adventurous you could try applying those patches yourself, although it looks like it's not so simple

    0 讨论(0)
  • 2020-12-23 16:16

    nutch 2.x is designed to use solr as default. You can follow the steps in http://wiki.apache.org/nutch/Nutch2Tutorial, or a better instruction in the book "Web Crawling and Data Mining with Apache Nutch".

    0 讨论(0)
提交回复
热议问题