Calculating context-sensitive text correlation

后端 未结 5 2089
春和景丽
春和景丽 2021-01-01 01:18

Suppose I want to match address records (or person names or whatever) against each other to merge records that are most likely referring to the same address. Basically, I gu

5条回答
  •  予麋鹿
    予麋鹿 (楼主)
    2021-01-01 02:03

    You can use Levenshtein edit distance to find strings that differ by only a few characters. BK Trees can help speed up the matching process.

提交回复
热议问题