Gender detection of the speaker from wave data of the audio

前端 未结 2 578
星月不相逢
星月不相逢 2020-12-19 17:17

I would like to add a gender detection capability to a news video translator app I\'m working on, so that the app can switch between male and female voice according to the v

2条回答
  •  抹茶落季
    2020-12-19 17:29

    Accurate gender identification can be implemented with training a GMM classifier on MFCC features of male and female. Here is how one can go about it.

    1. One needs to collect training set for each of the gender.
    2. Extract MFCCs features from all the audios of respective gender(One can find python implementation like scikit-talkbox etc).
    3. Train GMM models for both the gender on the extracted features from their training set audios.

    For details, Here is an open source python implementation of the same. The following tutorials evaluates the code on subset extracted from Google's AudioSet which is released this year (2017)

    https://appliedmachinelearning.wordpress.com/2017/06/14/voice-gender-detection-using-gmms-a-python-primer/

提交回复
热议问题