Apparently Java\'s Regex flavor counts Umlauts and other special characters as non-\"word characters\" when I use Regex.
\"TESTÜTEST\".replaceAll( \"
You might want to remove the accents and diacritic signs first, then on each character position check if the "simplified" string is an ascii letter - if it is, the original position shall contain word characters, if not, it can be removed.