simple speech recognition methods

╄→尐↘猪︶ㄣ 提交于 2019-12-01 08:58:40

Some time ago, I read a whitepaper about a limited vocabulary system, which used a simple recognition process. The system divided each utterance into a small number of bins (6 in time, and 4 in magnitude, if I remember correctly, for 24 total), and all it did was count the number of sample audio measurements in each bin. There was a fuzzy logic rule base which then interpreted each utterances 24 bin counts, and generated an interpretation.

I imagine that (for some applications) a simple matching process might work just as well, in which the 24 bin counts of the current utterance are simple matched against those of each of your stored prototypes, and the one with the least overall difference is the winner.

Luis Uebel

There are some open source project in speech recognition:

  1. HTK (Hidden Markov Models Toolkit)
  2. Sphinx

Both have decoder, training, language model toolkits. Eveything to build a complete and robust speech recognizer. Voxforge has acoustic and language models for both open source speech recognition toolkits.

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!