How to tokenize a Malayalam word?

后端 未结 6 1083
抹茶落季
抹茶落季 2020-12-15 06:27
ഇതുഒരുസ്ടലംമാണ്  

itu oru stalam anu

This is a Unicode string meaning this is a place

import nltk
nltk.w         


        
6条回答
  •  隐瞒了意图╮
    2020-12-15 07:11

    maybe the Viterbi algorithm could help?

    This answer to another SO question (and the other high-vote answer) could help: https://stackoverflow.com/a/481773/583834

提交回复
热议问题