I have audio data, which I calculate there Mel-Spectrogram, the audios has different duration (between two seconds and one minute), af
audio data
Mel-Spectrogram
two seconds and one minute