SnowballStemmer for Russian words list

Deadly 提交于 2019-12-06 04:41:41

问题


I do know how to perform SnowballStemmer on a single word (in my case, on russian one). Doing the next things:

from nltk.stem.snowball import SnowballStemmer 

stemmer = SnowballStemmer("russian") 
stemmer.stem("Василий")
'Васил'

How can I do the following if I have a list of words like ['Василий', 'Геннадий', 'Виталий']?

My approach using for loop seems to be not working :(

l=[stemmer.stem(word) for word in l]

回答1:


Your variable l is not pre-defined, causing the name error. See my last two lines for fix.

>>> from nltk.stem.snowball import SnowballStemmer
>>> stemmer = SnowballStemmer("russian") 
>>> my_words = ['Василий', 'Геннадий', 'Виталий']
>>> l=[stemmer.stem(word) for word in l]
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
NameError: name 'l' is not defined
>>> l=[stemmer.stem(word) for word in my_words]
>>> l
['васил', 'геннад', 'витал']


来源:https://stackoverflow.com/questions/45696028/snowballstemmer-for-russian-words-list

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!