difference between iso-8859 and iso-8859-1,

萝らか妹 提交于 2019-12-01 05:42:01

ISO-8859 is a standard for 8-bit character encodings. 8 bits give you 256 combinations which is OK for most extensions of the Latin alphabet but not for Chinese characters.

ISO-8859-1 is one of the "versions" of ISO-8859 supporting most Western-European languages (French, German, Spanish,...). For Central-European languages (Polish, Czech, Slovak,...) you need ISO-8859-2, etc.

One of the different points between ISO-8859-1 and ISO-8859-2 is the French letter è in ISO-8859-1, which is at the same position as the Czech/Slovak letter č in ISO-8859-2. That's why you could not combine these two letters in one text then.

Now with the Unicode it is possible to combine Chinese characters too.

There are several encodings available for chinese (e.g. simplified and traditional). See http://download.oracle.com/javase/6/docs/technotes/guides/intl/encoding.doc.html for a list. The most common ones are GB2312 aka EUC_CN for simplified chinese and Big5 for traditional chinese. I've also seen chinese documents represented in UTF-8.

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!