Why UTF-32 exists whereas only 21 bits are necessary to encode every character?

前端未结

关注

 5  1668

既然无缘 2020-12-05 07:02

We know that codepoints can be in this interval 0..10FFFF which is less than 2^21. Then why do we need UTF-32 when all codepoints can be represented by 3 bytes? UTF-24 shoul

5条回答

夕颜 (楼主)

2020-12-05 07:44
UTF-24 has no added value.
- If space matters, UTF-8 can encode all existing unicode characters (0...0x10FFFF) in the same 3 bytes or less (and in most cases will need less than 3 bytes). So UTF-8 is more compact than UTF-24.
- If space doesn't matter, UTF-32 is faster than UTF-24, because computers work better with power-of-2 aligned data.
0 讨论(0)

查看其它5个回答
发布评论:

提交评论
- 加载中...