OPENMP F90/95 Nested DO loops - problems getting improvement over serial implementation

后端 未结 2 940
清歌不尽
清歌不尽 2021-01-14 08:57

I\'ve done some searching but couldn\'t find anything that appeared to be related to my question (sorry if my question is redundant!). Anyway, as the title states, I\'m hav

2条回答
  •  难免孤独
    2021-01-14 09:05

    What you have is a convolution. This can be done with a Fast Fourier Transform in N log2(N) time. Your algorithm is N^2. If you use FFT, one core will probably be enough!

提交回复
热议问题