Say I have a list of movie names with misspellings and small variations like this -
\"Pirates of the Caribbean: The Curse of the Black Pearl\"
\"Pirates o
One approach would be to pre-process all the strings before you compare them: convert all to lowercase, standardize whitespace (eg, replace any whitespace with single spaces). If punctuation is not important to your end goal, you can remove all punctuation characters as well.
Levenshtein distance is commonly-used to determine similarity of a string, this should help you group strings which differ by small spelling errors.