Performance penalty when invoking a cuda kernel

后端 未结 2 939
轻奢々
轻奢々 2021-02-08 20:22

I\'m wondering what the overhead of performing a cuda kernel call is in C/C++ such as the following:

somekernel1<<>>(args);
som         


        
2条回答
  •  我寻月下人不归
    2021-02-08 20:45

    If you are using Visual Studio Pro on Windows I sugest you run a test application using NVidia's Parallel NSight, I think it can tell you the time stamps from the method call to the real execution, in any case a penalty is inherent, but it will be negligible if your kernels lasts long enought.

提交回复
热议问题