how to recognize similar words with difference in spelling

前端 未结 8 1916
逝去的感伤
逝去的感伤 2020-12-02 02:05

I want to filter out duplicate customer names from a database. A single customer may have more than one entry to the system with the same name but with little difference in

8条回答
  •  不知归路
    2020-12-02 02:41

    There is a very nice R (just search for "R" in Google) package for Record Linkage. The standard examples target exactly your problem: R RecordLinkage

    The C-Code for Soundex etc. is taken directly from PostgreSQL!

提交回复
热议问题