merge nearly similar rows with help of spacy
问题 I want to merge some rows if they are nearly similar. Similarity can be checked by using spaCy. df: string yellow color yellow color looks like yellow color bright red color okay red color blood output: string yellow color looks like bright red color okay blood solution: brute force approach is - for every item in string check similarity with other n-1 item if greater than some threshold value then merge. Is there any other approach ? As i am not in contact with much people idk how they do it