Find out the unicode script of a character

前端未结

关注

 5  933

既然无缘 2020-12-09 16:45

Given a unicode character what would be the simplest way to return its script (as \"Latin\", \"Hangul\" etc)? unicodedata doesn\'t seem to provide this kind of feature.

5条回答

心在旅途 (楼主)

2020-12-09 17:23

You can use ord to retrieve the numeric value of a character (it works on both unicode and byte strings of length 1).

The next step, unfortunately, will involve you then testing against the ranges. Possibly the data here will be of assistance: http://cldr.unicode.org/index/downloads

0 讨论(0)

查看其它5个回答
发布评论:

提交评论
- 加载中...