Fast 2D convolution for DSP
I want to implement some image-processing algorithms which are intended to run on a beagleboard . These algorithms use convolutions extensively. I'm trying to find a good C implementation for 2D convolution (probably using the Fast Fourier Transform). I also want the algorithm to be able to run on the beagleboard's DSP, because I've heard that the DSP is optimized for these kinds of operations (with its multiply-accumulate instruction). I have no background in the field so I think it won't be a good idea to implement the convolution myself (I probably won't do it as good as someone who