Gender detection of the speaker from wave data of the audio

风格不统一 提交于 2019-11-29 11:29:04

Accurate gender identification can be implemented with GMM classifier of MFCC features. You can read about it here:

AGE AND GENDER RECOGNITION FOR TELEPHONE APPLICATIONS BASED ON GMM SUPERVECTORS AND SUPPORT VECTOR MACHINES

To the date I am not aware of open source implementation of this, though many components are available in open source speech recognition toolkits like CMUSphinx.

Accurate gender identification can be implemented with training a GMM classifier on MFCC features of male and female. Here is how one can go about it.

  1. One needs to collect training set for each of the gender.
  2. Extract MFCCs features from all the audios of respective gender(One can find python implementation like scikit-talkbox etc).
  3. Train GMM models for both the gender on the extracted features from their training set audios.

For details, Here is an open source python implementation of the same. The following tutorials evaluates the code on subset extracted from Google's AudioSet which is released this year (2017)

https://appliedmachinelearning.wordpress.com/2017/06/14/voice-gender-detection-using-gmms-a-python-primer/

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!