Fuzzy string matching in Python
I have 2 lists of over a million names with slightly different naming conventions. The goal here it to match those records that are similar, with the logic of 95% confidence. I am made aware there are libraries which I can leverage on, such as the FuzzyWuzzy module in Python. However in terms of processing it seems it will take up too much resources having every string in 1 list to be compared to the other, which in this case seems to require 1 million multiplied by another million number of iterations. Are there any other more efficient methods for this problem? UPDATE: So I created a