Python Speaker Recognition [closed]

后端未结

关注

 4  1350

傲寒

相关标签:

4条回答

深忆病人

2020-12-25 15:24

Start with numpy, and I would look at spectrpgraphs (basically a rolling FFT) as a good method for distinguish different voices in an audio recording.

Here's the spectrogram function in Matplotlib:

http://matplotlib.sourceforge.net/api/pyplot_api.html#matplotlib.pyplot.specgram

I would recommend Python(x,y) if you're just getting started on a Windows platform.

0 讨论(0)
发布评论:

提交评论
- 加载中...
不知归路

2020-12-25 15:36

Have a look at the CMU Sphinx Python library. It's developed in Java so I think that the Python libs are just wrappers for that. The project has a lot of ongoing research behind it.

Official wiki: http://cmusphinx.sourceforge.net/wiki/

Quick-start tutorial for linux here: http://probing.wikidot.com/speech-recognition-using-sphinx3-and-python

0 讨论(0)
发布评论:

提交评论
- 加载中...
时光取名叫无心

2020-12-25 15:42

Check out sciKits Talkbox: http://projects.scipy.org/scikits/wiki/Talkbox

Unfortunutly tutorials are very restricted: http://www.ar.media.kyoto-u.ac.jp/members/david/softwares/talkbox/talkbox_doc/intro.html

0 讨论(0)
发布评论:

提交评论
- 加载中...
谎友^

2020-12-25 15:46

The task of separation of the speakers is not a speech recognition task, it's a speaker recognition task. In the speech comminity this task is also known as speaker diarization. There are several packages for speaker diarization and speaker recognition available for Python:

SIDEKIT from LIUM

Bob toolkit from Idiap

Speaker diarization from ISCI

In case you are not restricted to Python, there are others:

LIUM speaker diarization

Speaker recognition setup in Kaldi. Includes state of the art DNN-based i-vectors called x-vectors.

0 讨论(0)
发布评论:

提交评论
- 加载中...

热议问题