Chunking English words into graphemes corresponding to distinct sounds

旧时模样 提交于 2019-12-13 02:46:53

问题


How to convert english input word into combinations of graphemes? Is there a library or function that does the job?

What I'm looking for is an algorithm/implementation that splits orthographic words into segments which map to phonemes. That is, the sequence of letters in a word should be broken in between distinct sounds.

To my mind, this would look something like the following:

physically --> ph-y-s-i-c-a-ll-y
psychology --> ps-y-ch-o-l-o-g-y
thrush -->     th-r-u-sh
bought --> b-ough-t
chew --> ch-ew
palm --> p-al-m

回答1:


Googling for split english words into graphemes, the first result appears to be a paper about mapping English orthography onto a phonemic representation using a Machine Learning approach. This paper appears to be doing the kind of thing you're looking for.



来源:https://stackoverflow.com/questions/10825685/chunking-english-words-into-graphemes-corresponding-to-distinct-sounds

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!