transliteration

Python - Transliterate German Umlauts to Diacritic

◇◆丶佛笑我妖孽 提交于 2021-02-10 12:10:26
问题 I have a list of unicode file paths in which I need to replace all umlauts with an English diacritic. For example, I would ü with ue, ä with ae and so on. I have defined a dictionary of umlauts (keys) and their diacritics (values). So I need to compare each key to each file path and where the key is found, replace it with the value. This seems like it would be simple, but I can't get it to work. Does anyone out there have any ideas? Any feedback is greatly appreciated! code so far: # -*-

Python - Transliterate German Umlauts to Diacritic

牧云@^-^@ 提交于 2021-02-10 12:01:38
问题 I have a list of unicode file paths in which I need to replace all umlauts with an English diacritic. For example, I would ü with ue, ä with ae and so on. I have defined a dictionary of umlauts (keys) and their diacritics (values). So I need to compare each key to each file path and where the key is found, replace it with the value. This seems like it would be simple, but I can't get it to work. Does anyone out there have any ideas? Any feedback is greatly appreciated! code so far: # -*-

Transliterate/transpose the characters in the NSString

☆樱花仙子☆ 提交于 2020-01-22 05:51:26
问题 I want to transliterate a cyrillic string to it's closest latin equivalent. E.g. "матрешка" => "matreshka", "водка" => "vodka". So ideally I want some ready to use method on the NSString or somewhere else that already knows everything about the alphabets and can do the conversation. But if such functionality doesn't exist in the iOS APIs then I will be totally happy with something like ruby's tr method that just replaces the characters in a string using a simple mapping specified as a

Java Swing TextArea: How to simultaneously print Transliterated text/s in an Output TextArea?

♀尐吖头ヾ 提交于 2020-01-16 10:32:27
问题 I am trying to write a text editor. I want two TextAreas: 1st one for typing/editing in unicode script and 2nd one for simultaneously printing output of the unicode script input in corresponding Roman Script (ASCII) set by myself on the basis of a transliteration scheme. I am not able to update and simultaneously print the input text in output textarea while I call the transliteration method. I can sense a little bit that there is something wrong while I set the output textArea to print

Transliterate CJK to Latin — preferably in C++ [closed]

不想你离开。 提交于 2020-01-06 07:00:50
问题 It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical and cannot be reasonably answered in its current form. For help clarifying this question so that it can be reopened, visit the help center. Closed 7 years ago . I am trying to write a program that can transliterate CJK to Latin (i.e Pinyin, Romaji, etc.). For example you give a Chinese, Japanese or Korean document as input and then you get the transliterated version into

Python - codec encoding ascii to unicode: error

。_饼干妹妹 提交于 2020-01-06 06:37:27
问题 :) I am trying to go about the process of reversing transliteration of an input file(currently in english) back to its original form(in hindi) A sample or a part of the input file looks like this: E-k- b-u-d-z*dhi-m-aan- p-ksii# E-k- ghn-e- j-ngg-l- m-e-ng E-k- b-h-u-t- UUNNc-aa p-e-dr thaa# U-s- k-ii p-t-z*t-o-ng s-e- l-d-ii shaakhaay-e-ng m-j-*zb-uut- b-aaj-u-O-ng k-ii t-r-h- pheil-ii h-u-II thiing# w-n- h-NNs-o-ng k-aa E-k- jhu-nhz*D- I-s- p-e-dr p-r- n-i-w-aas- k-r-t-aa thaa# w-e- s-b- y

Python - codec encoding ascii to unicode: error

感情迁移 提交于 2020-01-06 06:36:19
问题 :) I am trying to go about the process of reversing transliteration of an input file(currently in english) back to its original form(in hindi) A sample or a part of the input file looks like this: E-k- b-u-d-z*dhi-m-aan- p-ksii# E-k- ghn-e- j-ngg-l- m-e-ng E-k- b-h-u-t- UUNNc-aa p-e-dr thaa# U-s- k-ii p-t-z*t-o-ng s-e- l-d-ii shaakhaay-e-ng m-j-*zb-uut- b-aaj-u-O-ng k-ii t-r-h- pheil-ii h-u-II thiing# w-n- h-NNs-o-ng k-aa E-k- jhu-nhz*D- I-s- p-e-dr p-r- n-i-w-aas- k-r-t-aa thaa# w-e- s-b- y

Google transliteration API in Java Swing project

巧了我就是萌 提交于 2020-01-01 06:12:22
问题 I'm working on a transliteration project in Java Swing. It can access the internet, but I'm unable to set it up. How do we do that? Do we need to download the Google API libraries or just a function call will do? Here are some useful Transliteration links: The Google Transliterate API Devloper's Guide Google API library for Java for download (I think it has all API libraries of Google:Translate, transliterate, search, etc.) I spent a lot of time, but Icouldn't find how to set up the

php preg_grep and umlaut/accent

て烟熏妆下的殇ゞ 提交于 2019-12-21 20:53:02
问题 I have an array that consists of terms, some of them contain accented characters. I do a preg grep like this $data= array('Napoléon','Café'); $result = preg_grep('~' . $input . '~i', $data); So if user type in 'le' I would also want the result 'Napoléon' to be matched, which does not work with the ablove command. I did some searching and found that this function might be relevant preg_match("/[\w\pL]/u",$var); How can I combine these and make it work? 回答1: This is not possible with a regular

Does .NET transliteration library exists? [closed]

折月煮酒 提交于 2019-12-19 17:43:33
问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed 2 years ago . Does .NET Transliteration library exists ? Note that this is not translation, something like this Perl lib : http://www.lingua-systems.com/transliteration/Lingua-Translit-Perl-module/ I just find : http://transliterator.codeplex.com/ 回答1: Check my UnidecodeSharpFork . It's based on great Python Unidecode