Stemming - code examples or open source projects?

房东的猫 提交于 2019-11-30 04:04:21

Snowball stemmer (C & Java) I've used it's Python binding, PyStemmer

Check out the nltk toolkit written in python. It has a very functional stemmer.

Another option for stemming would be WordNet, along with one of its APIs. Some basic information on stemming and lemmatization, including a description of the Porter stemming algorithm, can be found online in Introduction to Information Retrieval.

Lucene has a stemmer in, I believe (and IIRC it lets you use your own one if you want).

EDIT: Just checked, and Lucence refers to the Snowball site which is an open source stemming library as far as I can tell.

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!