Is there a simple regular expression to match all unicode quotes? Or does one have to hand-code it like this:
quotes = ur\"[\\\"\'\\u2018\\u2019\\u201c\\u201
Quotation marks will often have the Unicode category Pi (punctuation, initial quote) or Pf (Punctuation, final quote). You'll have to handle the "neutral" quotation marks ' and " manually.
Pi
Pf
'
"