Text clustering with Levenshtein distances

后端 未结 4 1324
暖寄归人
暖寄归人 2020-11-30 22:52

I have a set (2k - 4k) of small strings (3-6 characters) and I want to cluster them. Since I use strings, previous answers on How does clustering (especially String clusteri

4条回答
  •  执念已碎
    2020-11-30 23:32

    If you would like a clear explanation of how to use partitional clustering (which will surely be faster) to solve your problem, check this paper: Effective Spell Checking Methods Using Clustering Algorithms. https://www.researchgate.net/publication/255965260_Effective_Spell_Checking_Methods_Using_Clustering_Algorithms?ev=prf_pub

    The authors explain how to cluster a dictionary using a modified (PAM-like) version of iK-Means.

    Best of Luck!

提交回复
热议问题