CUDA streams not overlapping

后端 未结 2 1199
-上瘾入骨i
-上瘾入骨i 2021-02-20 16:16

I have something very similar to the code:

int k, no_streams = 4;
cudaStream_t stream[no_streams];
for(k = 0; k < no_streams; k++) cudaStreamCreate(&strea         


        
2条回答
  •  Happy的楠姐
    2021-02-20 17:09

    If you want to see the kernels overlap with kernels (concurrent kernels) you need to make use of CUDA Visual profiler 5.0 that comes with CUDA 5.0 Toolkit. I don't think previous profilers are capable of this. It should also show kernel and memcpy overlap.

提交回复
热议问题