Normalization does not preserve code point
问题 Can anyone please explain me why the NFD normalization from U+2126 (Ω) and U+03A9 (Ω) results in the same representation and does not preserve the code point? I would have expected this behaviour for NFKD and NFKC (and for characters with diacritics) only. result1 = unicodedata.normalize("NFD", u"\u2126") result2 = unicodedata.normalize("NFD", u"\u03A9") print("NFD: " + repr(result1)) print("NFD: " + repr(result2)) Output: NFD: u'\u03a9' NFD: u'\u03a9' 回答1: These are known as "singleton