问题
Am I able to integrate Apache Nutch crawler with the Solr Index server?
Edit:
One of our devs came up with a solution from these posts
- Running Nutch and Solr
- Update for Running Nutch and Solr
Answer
Yes
回答1:
If you're willing to upgrade to nutch 1.0 you can use the solrindex as described in this article by Lucid Imagination: http://www.lucidimagination.com/blog/2009/03/09/nutch-solr/.
回答2:
It's still an open issue. If you're feeling adventurous you could try applying those patches yourself, although it looks like it's not so simple
回答3:
nutch 2.x is designed to use solr as default. You can follow the steps in http://wiki.apache.org/nutch/Nutch2Tutorial, or a better instruction in the book "Web Crawling and Data Mining with Apache Nutch".
来源:https://stackoverflow.com/questions/211411/using-nutch-crawler-with-solr