What are some good methods to find the “relatedness” of two bodies of text?

后端 未结 7 965
小鲜肉
小鲜肉 2021-02-02 03:46

Here\'s the problem -- I have a few thousand small text snippets, anywhere from a few words to a few sentences - the largest snippet is about 2k on disk. I want to be able to c

7条回答
  •  情书的邮戳
    2021-02-02 04:29

    These articles on semantic relatedness and semantic similarity may be helpful. And this SO question about Latent Semantic Analysis.

    You could also look into Soundex for words that "sound alike" phonetically.

提交回复
热议问题