Simplest algorithm of measuring how similar of two short audio

为君一笑 提交于 2019-11-26 21:54:34

问题


The question is to look for any open source or simple implementation to measure how similar between two audios on the iOS application.

Simply speaking, audio can be represented by 1-D vector, to calculate the distance between the 1D vector. But the audio length will be different, therefore need some pre-processing etc.

Looking forward to get some clues here, thanks


回答1:


The similarity between two sequences of variable length can be efficiently calculated with DTW:

http://en.wikipedia.org/wiki/Dynamic_time_warping

This algorithm is simple to implement yourself and there are quite many existing implementations linked on the wiki page.

Simply speaking, audio can represented by 1-D vector,

It's reasonable to split the audio on frames and turn it into 2-D vector of features where for each frame you have an array of values(features) corresponding to the different frequency bands. If you want to deal with music, an FFT for every frame is a good idea, for speech, it's better to calculate mel-frequency cepstrum

Again, you can use many existing libraries for mel frequency features, one of them is a speech recognition toolkit CMUSphinx



来源:https://stackoverflow.com/questions/15653466/simplest-algorithm-of-measuring-how-similar-of-two-short-audio

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!