Voice control API - high accuracy on specific phrases [closed]

纵然是瞬间 提交于 2019-12-04 09:23:25

I have been recently involved in a project developing a platform for mobile grammar-based speech recognition applications, with the following features:

All the components are open source and it shouldn't be too hard to set up your own server and port the system to your language, given that you have the acoustic models for that language.

VoiceXML and SRGS might be a good starting point for your search. There's not much in the world of open-source, sadly, because getting this sort of stuff "right" will mean a big payday.

Using a speech recognition system that supports grammars (SRGS) will increase your recognition rate. Grammars restrict the search space by specifying expected words and phrases as rules that the speech recognition system uses to get a match and therefore can increase performance and recognition rate.

VoiceXML is a good language for developing speech applications that use a telephone as a mode of interaction. What I mean by using a telephone as a mode of interaction is that the user actually dials an IVR system which answers the call and then starts interacting with the user through recorded audio prompts and user input through speech or telephone key pad input. VoiceXML is not intended for mobile apps that have visual interfaces like a native Android application or a web application. To develop visual applications that use speech you could use something like Nuance's mobile tool, which can have a hefty price tag. Or something open source like Sphinx.

Most cloud-based APIs for speech recognition (Google, AT&T, Siri, etc.) do not allow a custom SRGS grammar to be used to improve accuracy. That is really unfortunate.

One possibility is to combine two technologies from Voxeo, namely Tropo and Phono. The former is an API-based voice platform that is much easier to use than VoiceXML platforms, and the latter is jQuery plugin for making (and controlling) voice calls from your browser. Tropo supports SRGS grammars.

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!