How can I differentiate between a person's name and other names that are derived from verbs [closed]

岁酱吖の 提交于 2019-12-08 14:07:02

问题


How can I extract person names from the text?

I have applied some NLP toolkit for this, specifically I used the Stanford NER toolkit to extract names from text. With that, I can extract person names from the text, but when I want the program to extract words like 'programmer', 'lecturer' or 'engineer', the libraries couldn't extract those. Is there any way to extract these from the text?


回答1:


Since "Programmer, lecturer, and engineer" are not named-entities, you may have to maintain a list of those words. I think you can obtain them from word derivation relationships in Wordnet, like "sing" (verb) and "singer" or "lecture" (verb) and "lecturer" (noun).

A SuperSense tagger may also be used as NER, I think it can tag those words you mentioned as "noun.person" which is what you need. ArkRef (Java) is a coreference tool that uses it (through a Java port of supersense tagger, bundled), and there's an online demo there, so you can check if your target words are tagged in square brackets.



来源:https://stackoverflow.com/questions/9561370/how-can-i-differentiate-between-a-persons-name-and-other-names-that-are-derived

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!