difference between iso-8859 and iso-8859-1,

徘徊边缘 提交于 2019-12-01 03:56:41

问题


is iso-8859 supports for latin character are i need to use iso-8859-1 in java program to read file in chinese character,and what is the difference between this


回答1:


ISO-8859 is a standard for 8-bit character encodings. 8 bits give you 256 combinations which is OK for most extensions of the Latin alphabet but not for Chinese characters.

ISO-8859-1 is one of the "versions" of ISO-8859 supporting most Western-European languages (French, German, Spanish,...). For Central-European languages (Polish, Czech, Slovak,...) you need ISO-8859-2, etc.

One of the different points between ISO-8859-1 and ISO-8859-2 is the French letter è in ISO-8859-1, which is at the same position as the Czech/Slovak letter č in ISO-8859-2. That's why you could not combine these two letters in one text then.

Now with the Unicode it is possible to combine Chinese characters too.




回答2:


There are several encodings available for chinese (e.g. simplified and traditional). See http://download.oracle.com/javase/6/docs/technotes/guides/intl/encoding.doc.html for a list. The most common ones are GB2312 aka EUC_CN for simplified chinese and Big5 for traditional chinese. I've also seen chinese documents represented in UTF-8.



来源:https://stackoverflow.com/questions/4711496/difference-between-iso-8859-and-iso-8859-1

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!