Stanford NLP Tagger via NLTK - tag_sents splits everything into chars
问题 I'm hoping someone has experience with this as I'm unable to find any comments online besides a bug report from 2015 regarding the NERtagger which is probably the same. Anyway, I'm trying to batch process text to get around the poor performing base tagger. From what I understand, tag_sents should help. from nltk.tag.stanford import StanfordPOSTagger from nltk import word_tokenize import nltk stanford_model = 'stanford-postagger/models/english-bidirectional-distsim.tagger' stanford_jar =