I have the following command to replace Unicode characters with ASCII ones.
sed -i \'s/Ã/A/g\'
The problem is Ã
isn\'t recognized
There is also uconv
, from ICU.
Examples:
uconv -x "::NFD; [:Nonspacing Mark:] > ; ::NFC;"
: to remove accentsuconv -x "::Latin; ::Latin-ASCII;"
: for a transliteration latin/asciiuconv -x "::Latin; ::Latin-ASCII; ([^\x00-\x7F]) > ;"
: for a transliteration latin/ascii and removal of remaining code points > 0x7Fecho "À l'école ☠" | uconv -x "::Latin; ::Latin-ASCII; ([^\x00-\x7F]) > ;"
gives: A l'ecole