Compiling code containing dynamic parallelism fails

前端 未结 3 761
暖寄归人
暖寄归人 2020-12-03 19:09

I am doing dynamic parallelism programming using CUDA 5.5 and an NVDIA GeForce GTX 780 whose compute capability is 3.5. I am calling a kernel function inside a kernel functi

3条回答
  •  遥遥无期
    2020-12-03 19:34

    You need to let nvcc generate CC 3.5 code for your device. This can be done by adding this option to nvcc command line.

     -gencode arch=compute_35,code=sm_35
    

    You may find the CUDA samples on dynamic parallelism for more detail. They contain both command line options and project settings for all supported OS.

    http://docs.nvidia.com/cuda/cuda-samples/index.html#simple-quicksort--cuda-dynamic-parallelism-

提交回复
热议问题