I\'m trying to match and remove all words in a list from a string using a compiled regex but I\'m struggling to avoid occurrences within words.
Current:
<
One problem is that only the first \b
is inside a raw string. The second gets interpreted as the backspace character (ASCII 8) rather than as a word boundary.
To fix, change
regex = re.compile(r'\b('+remove+')\b', flags=re.IGNORECASE)
to
regex = re.compile(r'\b('+remove+r')\b', flags=re.IGNORECASE)
^ THIS