How to classify documents indexed with lucene

前端 未结 3 1444
醉梦人生
醉梦人生 2021-02-10 02:40

I have classified a set of documents with Lucene (fields: content, category). Each document has it\'s own category, but some of them are labeled as uncategorized. Is there any w

3条回答
  •  不要未来只要你来
    2021-02-10 03:14

    As of Lucene 5.2.1, you can use indexed documents to classify new documents. Out of the box, Lucene offers a naive Bayes classifier, a k-Nearest Neighbor classifier (based on the MoreLikeThis class) and a Perceptron based classifier.

    The drawback is that all of these classes are marked with experimental warnings and documented with links to Wikipedia.

提交回复
热议问题