English, of course, is a no-brainer for regex because that\'s what it was originally developed in/for:
Can regular expressions understand this charact
it is not about the regular expression but about framework that executes it. java and .net i think are very good in handling unicode. so "è and e both considered word characters by regex" is true.