Convolution of multiple 1D signals in a 2D matrix with multiple 1D kernels in a 2D matrix
问题 I have a randomly defined H matrix of size 600 x 10 . Each element in this matrix H can be represented as H(k,t) . I obtained a speech spectrogram S which is 600 x 597 . I obtained it using Mel features, so it should be 40 x 611 but then I used a frame stacking concept in which I stacked 15 frames together. Therefore it gave me (40x15) x (611-15+1) which is 600 x 597 . Now I want to obtain an output matrix Y which is given by the equation based on convolution Y(k,t) = ∑ H(k,τ)S(k,t-τ) . The