问题
I want to train my neural network with different sounds. However, the size of each sounds are different. Does anyone know how to train a neural network with different size of input? Thanks.
回答1:
There is no way to classify inputs of different sizes, but you can transform your signal into a sequence of fixed-size feature vectors (or into a sequence of fixed-size pieces of the original sound). For a sound we usually employ MFCCs or just a spectrogram. Thus, you need to apply methods that operate on sequences. It can be a recurrent neural network, or you can employ a feed-forward network and then post-process its outputs for each frame somehow.
来源:https://stackoverflow.com/questions/41047916/how-to-train-a-neural-network-with-different-size-of-input