What are some good methods to find the “relatedness” of two bodies of text?

后端未结

关注

 7  953

小鲜肉 2021-02-02 03:46

Here\'s the problem -- I have a few thousand small text snippets, anywhere from a few words to a few sentences - the largest snippet is about 2k on disk. I want to be able to c

7条回答

情深已故 (楼主)

2021-02-02 04:28

Jeff talked about something like this on the pod cast to find the Related questions listed on the right side here. (in podcast 32)

One big tip was to remove all common words, like "the" "and" "this" etc. This will leave you with more meaningful words to compare.

And here is a similar question Is there an algorithm that tells the semantic similarity of two phrases

0 讨论(0)

查看其它7个回答
发布评论:

提交评论
- 加载中...