What is a good Web search and web crawling engine for Java?

有些话、适合烂在心里 提交于 2019-12-06 07:39:05

问题


I am working on an application where I need to integrate the search engine. This should do crawling also. Please suggest a good Java based search engine.

Thank you in advance.


回答1:


Nutch (Lucene) is an Open Source engine which should satisfy your needs.




回答2:


In the past I worked with terrier, a search engine written in Java:

Terrier is a highly flexible, efficient, effective, and robust search engine, readily deployable on large-scale collections of documents. Terrier implements state-of-the-art indexing and retrieval functionalities. Terrier provides an ideal platform for the rapid development of large-scale retrieval applications.




回答3:


I've spent the last 2 years developing our own high performance search engine with C. For Java I highly suggest Apache Lucene as Ajay mentioned above. For Java, it's the best in terms of speed, relevancy and features.



来源:https://stackoverflow.com/questions/1350884/what-is-a-good-web-search-and-web-crawling-engine-for-java

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!