Remove diacritical marks (ń ǹ ň ñ ṅ ņ ṇ ṋ ṉ ̈ ɲ ƞ ᶇ ɳ ȵ) from Unicode chars

后端 未结 12 844
故里飘歌
故里飘歌 2020-11-22 11:42

I am looking at an algorithm that can map between characters with diacritics (tilde, circumflex, caret, umlaut, caron) and their \"simple\" character.

For example:

12条回答
  •  一个人的身影
    2020-11-22 11:58

    It's part of Apache Commons Lang as of ver. 3.1.

    org.apache.commons.lang3.StringUtils.stripAccents("Añ");
    

    returns An

提交回复
热议问题