Find near-duplicates of comma-separated lists using Levenshtein distance [duplicate]
This question already has an answer here: Potential Duplicates Detection, with 3 Severity Level 1 answer This question based on the answer of my question yesterday. To solve my problem, Jean-François Corbett suggested a Levenshtein distance approach. Then I found this code somewhere to get Levenshtein distance percentage. Public Function GetLevenshteinPercentMatch( _ ByVal string1 As String, ByVal string2 As String, _ Optional Normalised As Boolean = False) As Single Dim iLen As Integer If Normalised = False Then string1 = UCase$(WorksheetFunction.Trim(string1)) string2 = UCase$