audio

How to connect object to the filter graph?

心不动则不痛 提交于 2020-11-29 21:22:47
问题 What I need to do is - get decoded sample frames (like vector<frames> ) from DirectShow in order to do it I follow this implementation https://docs.microsoft.com/en-us/windows/win32/directshow/using-the-sample-grabber There is my implementation bool coAudioPlayerSampleGrabber::LoadImp(SoundDataType dataType, unsigned char const * pData, int64_t dataLen) { Cleanup(); m_bReady = false; HRESULT hr = S_OK; assert(pData); assert(dataLen); m_memBuffer.resize(dataLen); memcpy(m_memBuffer.data(),

How to connect object to the filter graph?

ⅰ亾dé卋堺 提交于 2020-11-29 21:12:40
问题 What I need to do is - get decoded sample frames (like vector<frames> ) from DirectShow in order to do it I follow this implementation https://docs.microsoft.com/en-us/windows/win32/directshow/using-the-sample-grabber There is my implementation bool coAudioPlayerSampleGrabber::LoadImp(SoundDataType dataType, unsigned char const * pData, int64_t dataLen) { Cleanup(); m_bReady = false; HRESULT hr = S_OK; assert(pData); assert(dataLen); m_memBuffer.resize(dataLen); memcpy(m_memBuffer.data(),

How to connect object to the filter graph?

自古美人都是妖i 提交于 2020-11-29 21:11:00
问题 What I need to do is - get decoded sample frames (like vector<frames> ) from DirectShow in order to do it I follow this implementation https://docs.microsoft.com/en-us/windows/win32/directshow/using-the-sample-grabber There is my implementation bool coAudioPlayerSampleGrabber::LoadImp(SoundDataType dataType, unsigned char const * pData, int64_t dataLen) { Cleanup(); m_bReady = false; HRESULT hr = S_OK; assert(pData); assert(dataLen); m_memBuffer.resize(dataLen); memcpy(m_memBuffer.data(),

What is the second number in the MFCCs array?

二次信任 提交于 2020-11-29 10:18:04
问题 When I extract MFCCs from an audio the ouput is (13, 22) . What does the number represent? Is it time frames ? I use librosa. The code is use is: mfccs = librosa.feature.mfcc(y=X, sr=sample_rate, n_mfcc=13, hop_length=256) mfccs print(mfccs.shape) And the ouput is (13,22) . 回答1: Yes, it is time frames and mainly depends on how many samples you provide via y and what hop_length you choose. Example Say you have 10s of audio sampled at 44.1 kHz (CD quality). When you load it with librosa, it