POS tagging in German

后端 未结 5 431
予麋鹿
予麋鹿 2020-12-12 21:28

I am using NLTK to extract nouns from a text-string starting with the following command:

tagged_text = nltk.pos_tag(nltk.Text(nltk.word_tokenize(some_string)         


        
5条回答
  •  轮回少年
    2020-12-12 22:05

    Natural language software does its magic by leveraging corpora and the statistics they provide. You'll need to tell nltk about some German corpus to help it tokenize German correctly. I believe the EUROPARL corpus might help get you going.

    See nltk.corpus.europarl_raw and this answer for example configuration.

    Also, consider tagging this question with "nlp".

提交回复
热议问题