Algorithm to find articles with similar text

后端未结

关注

 15  2456

梦谈多话 2020-11-28 18:10

I have many articles in a database (with title,text), I\'m looking for an algorithm to find the X most similar articles, something like Stack Overflow\'s \"Related Questions

15条回答

半阙折子戏 (楼主)

2020-11-28 18:38

The simplest and fastest way to compare similarity among abstracts is probably by utilizing the set concept. First convert abstract texts into set of words. Then check how much each set overlaps. Python's set feature comes very hand performing this task. You would be surprised to see how well this method compares to those "similar/related papers" options out there provided by GScholar, ADS, WOS or Scopus.

0 讨论(0)

查看其它15个回答
发布评论:

提交评论
- 加载中...