Recognizing multiple peoples voices

我只是一个虾纸丫 提交于 2019-12-18 05:21:01

问题


I am looking for an open source voice recognition engine that, instead of responding to spoken words, can determine who is speaking. Does anyone know where I might be able to find something like this?


回答1:


You can consider

Bob SPEAR

https://pypi.python.org/pypi/bob.bio.spear

Alize/Mistral

http://mistral.univ-avignon.fr/index_en.html

GMM speaker identification in matlab

https://github.com/codyaray/speaker-recognition

Very basic speaker recognition in Java, not really accurate

https://github.com/amaurycrickx/recognito

In python, quite a state of the art toolkit

https://pypi.python.org/pypi/SIDEKIT




回答2:


What you are looking for is speaker diarization and identification.

You can use Microsoft Speaker Recognition for doing the speaker identification task: https://www.microsoft.com/cognitive-services/en-us/speaker-recognition-api#identification




回答3:


If you're Symfony2 developer, I provide you bundle for Microsoft Speaker Recognition API: https://github.com/onekit/speaker-recognition-bundle

Feel different between speaker and speech. Speech-To-Text can be recognized by Chrome Browser.



来源:https://stackoverflow.com/questions/32914148/recognizing-multiple-peoples-voices

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!