The regex [^\w ] will match anything that is not alphanumeric or space.
You could use this to ensure all apostrophes/quotes/etc get removed, even if done with Unicode - though there is not enough information in the question to know if this is acceptable.