tag generation from a text content

前端 未结 5 1607
北荒
北荒 2020-11-29 15:56

I am curious if there is an algorithm/method exists to generate keywords/tags from a given text, by using some weight calculations, occurrence ratio or other tools.

5条回答
  •  一向
    一向 (楼主)
    2020-11-29 16:15

    A very simple solution to the problem would be:

    • count the occurences of each word in the text
    • consider the most frequent terms as the key phrases
    • have a black-list of 'stop words' to remove common words like the, and, it, is etc

    I'm sure there are cleverer, stats based solutions though.

    If you need a solution to use in a larger project rather than for interests sake, Yahoo BOSS has a key term extraction method.

提交回复
热议问题