Use of PunktSentenceTokenizer in NLTK

后端未结

关注

 4  1866

不思量自难忘° 2020-12-07 16:55

I am learning Natural Language Processing using NLTK. I came across the code using PunktSentenceTokenizer whose actual use I cannot understand in the given code

4条回答

情歌与酒 (楼主)

2020-12-07 17:17

def process_content(corpus):

    tokenized = PunktSentenceTokenizer().tokenize(corpus)

    try:
        for sent in tokenized:
            words = nltk.word_tokenize(sent)
            tagged = nltk.pos_tag(words)
            print(tagged)
    except Exception as e:
        print(str(e))

process_content(train_text)

Without even training it on other text data it works the same as it is pre-trained.

0 讨论(0)

查看其它4个回答