How to do a Python split() on languages (like Chinese) that don't use whitespace as word separator?

后端 未结 9 2502
梦如初夏
梦如初夏 2020-12-03 03:25

I want to split a sentence into a list of words.

For English and European languages this is easy, just use split()

>>> \"This is a sentence.         


        
9条回答
  •  慢半拍i
    慢半拍i (楼主)
    2020-12-03 04:11

    Ok I figured it out.

    What I need can be accomplished by simply using list():

    >>> list(u"这是一个句子")
    [u'\u8fd9', u'\u662f', u'\u4e00', u'\u4e2a', u'\u53e5', u'\u5b50']
    

    Thanks for all your inputs.

提交回复
热议问题