Does solr do web crawling?

后端 未结 8 1509
Happy的楠姐
Happy的楠姐 2020-12-08 08:09

I am interested to do web crawling. I was looking at solr.

Does solr do web crawling, or what are the steps to do web crawling?

相关标签:
8条回答
  • 2020-12-08 09:05

    Solr does not in of itself have a web crawling feature.

    Nutch is the "de-facto" crawler (and then some) for Solr.

    0 讨论(0)
  • 2020-12-08 09:06

    Yes, I agree with the other posts here, use Apache Nutch

    bin/nutch crawl urls -solr http://localhost:8983/solr/ -depth 3 -topN 5

    Although your solr version has the match the correct version of Nutch, because older versions of solr stores the indices in a different format

    Its tutorial: http://wiki.apache.org/nutch/NutchTutorial

    0 讨论(0)
提交回复
热议问题