OpenCL autocorrelation kernel

元气小坏坏 提交于 2019-12-04 23:14:24

The code is correct. As far as I know, that should run fine and give corret results.

barrier(CLK_GLOBAL_MEM_FENCE); is not needed. You'll get more speed without that sentence.

Your problem should be outside the kernel, check that you a re passing correctly the input, and you are taking out of GPU the correct data.

BTW, I supose you are using a double precision suported GPU as you are doing double calcs. Check that you are passing also double values. Remember you CAN't point a float pointer to a double value, and viceversa. That will give you wrong results.

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!