unicode

Why use Unicode if your program is English only?

好久不见. 提交于 2021-01-26 19:17:50
问题 So I've read Joel's article, and looked through SO, and it seems the only reason to switch from ASCII to Unicode is for internationalization. The company I work for, as a policy, will only release software in English, even though we have customers throughout the world. Since all of our customers are scientists, they have functional enough English to use our software as a non-native speaker. Or so the logic goes. Because of this policy, there is no pressing need to switch to Unicode to support

In what 8-bit character set is 0x9d meaningful?

依然范特西╮ 提交于 2021-01-26 09:46:37
问题 In what 8-bit ASCII-like character set for English is 0x9d meaningful? I'm cleaning up some old data files, and occasionally finding a 0x9d in otherwise-ASCII text. (No, it's not UTF-8.) It's not valid in Windows-1252. The Python "latin-1" codec translates it to Unicode 0x9D, which is "Operating System Command". That makes little sense. In Unicode you get a box with [009d]. (In Python, you can convert anything to Latin-1 without errors being raised, but that doesn't mean it's meaningful to do

Can anyone explain how to insert emojis to VBA Excel?

匆匆过客 提交于 2021-01-24 11:26:46
问题 In VBA Excel to print " & Chr(34) & " is used to define " . I have searched and it said 34th character is " , as per this website. In same way for 👍 Thumbs Up Sign the code is 👍 👍 I have tried: " & Chr(#128077) & " " & Chr(128077) & " " & Chr(#x1f44d) & " " & Chr(x1f44d) & " But it says error as a Syntax error or Expecting ) or Invalid Argument Can anyone suggest me the solution 回答1: In Excel, characters are stored using Unicode UTF-16 . "Thumbs up" corresponds to the Unicode character U

What are standard unicode fonts?

不想你离开。 提交于 2021-01-22 06:31:34
问题 What are standard unicode fonts for following operating systems: Windows XP Windows Vista Window 7 By standard I mean they are present in fresh installation of OS - there is no need to install them as additional package. 回答1: check the list by product from links; http://www.microsoft.com/typography/fonts/default.aspx www.microsoft.com/typography/fonts/winxp.htm 回答2: I was looking for the same thing. Looks like there is only one font in all Win OS : Lucida Sans Unicode File name: l_10646.ttf

What are standard unicode fonts?

。_饼干妹妹 提交于 2021-01-22 06:31:29
问题 What are standard unicode fonts for following operating systems: Windows XP Windows Vista Window 7 By standard I mean they are present in fresh installation of OS - there is no need to install them as additional package. 回答1: check the list by product from links; http://www.microsoft.com/typography/fonts/default.aspx www.microsoft.com/typography/fonts/winxp.htm 回答2: I was looking for the same thing. Looks like there is only one font in all Win OS : Lucida Sans Unicode File name: l_10646.ttf

Python - Reading Emoji Unicode Characters

自闭症网瘾萝莉.ら 提交于 2021-01-21 07:22:12
问题 I have a Python 2.7 program which reads iOS text messages from a SQLite database. The text messages are unicode strings. In the following text message: u'that\u2019s \U0001f63b' The apostrophe is represented by \u2019 , but the emoji is represented by \U0001f63b . I looked up the code point for the emoji in question, and it's \uf63b . I'm not sure where the 0001 is coming from. I know comically little about character encodings. When I print the text, character by character, using: s = u'that

Why does the red heart emoji require two code points, but the other colored hearts require one?

浪子不回头ぞ 提交于 2021-01-20 14:43:17
问题 It appears that the red heart emoji (❤️) "\u2764\uFE0F" requires two Unicode codepoints, specifically Heavy Black Heart followed by a Variation Selector. However, blue 💙, green 💚, yellow 💛, and purple 💜 each have their own single codepoint. Why is red so different? 回答1: For historical reasons. Originally, there was only U+2764 HEAVY BLACK HEART which the first applications that supported Emojis decided to render as a red heart. These early applications always rendered U+2764 as Emoji. Later

Why does the red heart emoji require two code points, but the other colored hearts require one?

。_饼干妹妹 提交于 2021-01-20 14:40:49
问题 It appears that the red heart emoji (❤️) "\u2764\uFE0F" requires two Unicode codepoints, specifically Heavy Black Heart followed by a Variation Selector. However, blue 💙, green 💚, yellow 💛, and purple 💜 each have their own single codepoint. Why is red so different? 回答1: For historical reasons. Originally, there was only U+2764 HEAVY BLACK HEART which the first applications that supported Emojis decided to render as a red heart. These early applications always rendered U+2764 as Emoji. Later

Convert fancy/artistic unicode text to ASCII

穿精又带淫゛_ 提交于 2021-01-19 08:56:24
问题 I have a unicode string like "𝖙𝖍𝖚𝖌 𝖑𝖎𝖋𝖊" and would like to convert it to the ASCII form "thug life". I know I can achieve this in Python by import unidecode print(unidecode.unidecode('𝖙𝖍𝖚𝖌 𝖑𝖎𝖋𝖊')) // thug life However, this would asciify also other unicode characters (such as Chinese/Japanese characters, emojis, accented characters, etc.), which I want to preserve. Is there a way to detect these type of "artistic" unicode characters? Some more examples: 𝓽𝓱𝓾𝓰 𝓵𝓲𝓯𝓮 𝓉𝒽𝓊𝑔 𝓁𝒾𝒻𝑒 𝕥𝕙𝕦𝕘 𝕝𝕚𝕗𝕖 thug life

Convert fancy/artistic unicode text to ASCII

无人久伴 提交于 2021-01-19 08:55:13
问题 I have a unicode string like "𝖙𝖍𝖚𝖌 𝖑𝖎𝖋𝖊" and would like to convert it to the ASCII form "thug life". I know I can achieve this in Python by import unidecode print(unidecode.unidecode('𝖙𝖍𝖚𝖌 𝖑𝖎𝖋𝖊')) // thug life However, this would asciify also other unicode characters (such as Chinese/Japanese characters, emojis, accented characters, etc.), which I want to preserve. Is there a way to detect these type of "artistic" unicode characters? Some more examples: 𝓽𝓱𝓾𝓰 𝓵𝓲𝓯𝓮 𝓉𝒽𝓊𝑔 𝓁𝒾𝒻𝑒 𝕥𝕙𝕦𝕘 𝕝𝕚𝕗𝕖 thug life