Official repository of Unicode character names

吃可爱长大的小学妹 提交于 2021-02-19 08:46:08

问题


There are a few ways to get the list of all Unicode characters' names: for example using Python module unicodedata, as explained in List of unicode character names, or using the website: https://unicode.org/charts/charindex.html but here it's incomplete, and you have to open and parse PDF to find the names.

But what is the official source / repository of all Unicode character names? (such that if a new character is added, the list is updated, so I'm looking for the initial source for these names, in a machine readable format).

I'm looking for a list with just code point and name, in CSV or any other format:

code   character name
...
0102   LATIN CAPITAL LETTER A WITH BREVE
0103   LATIN SMALL LETTER A WITH BREVE
...

回答1:


The official source for the actual character data (which includes the character names and many, many other details) is the Unicode Character Database.

The latest version of the data files can be accessed via http://www.unicode.org/Public/UCD/latest/.

Names specifically can be found in the files NamesList.txt. The format of that file is described here.

This is the list in CSV format: https://www.unicode.org/Public/UCD/latest/ucd/UnicodeData.txt



来源:https://stackoverflow.com/questions/65158620/official-repository-of-unicode-character-names

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!