having problemns while using dask map_partitions with string matching algorithm
问题 I'm having some probems apllying a text search algorithm with parallelized dask insfrastructure. I'm tryng to find the best match for 40,000 stirngs in a series object against a 4000 string list. I could have done it using pandas.apply but it's to time expensive, so i decided try parallelization with map_partitions in dask. I'm using this text search library with python-Levenshtein https://marcobonzanini.com/2015/02/25/fuzzy-string-matching-in-python As you can see, it works ok on this