What is the difference between lemmatization vs stemming?

后端 未结 9 2017
无人共我
无人共我 2020-12-07 08:25

When do I use each ?

Also...is the NLTK lemmatization dependent upon Parts of Speech? Wouldn\'t it be more accurate if it was?

9条回答
  •  时光说笑
    2020-12-07 08:50

    There are two aspects to show their differences:

    1. A stemmer will return the stem of a word, which needn't be identical to the morphological root of the word. It usually sufficient that related words map to the same stem,even if the stem is not in itself a valid root, while in lemmatisation, it will return the dictionary form of a word, which must be a valid word.

    2. In lemmatisation, the part of speech of a word should be first determined and the normalisation rules will be different for different part of speech, while the stemmer operates on a single word without knowledge of the context, and therefore cannot discriminate between words which have different meanings depending on part of speech.

    Reference http://textminingonline.com/dive-into-nltk-part-iv-stemming-and-lemmatization

提交回复
热议问题