Stanford NLP Tagger via NLTK - tag_sents splits everything into chars

谁都会走 提交于 2019-12-02 02:42:26

The tag_sents function takes a list of list of strings.

tagger.tag_sents(word_tokenize("The quick brown fox jumps over the lazy dog."))

Here's a useful idiom:

 tagger.tag_sents(word_tokenize(sent) for sent in sent_tokenize(text))

where text is a string.

Another variation of what alvas said, which worked for me: tagger.tag_sents([[text]]).

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!