Retaining dot product on GPGPU using CUBLAS routine

前端未结

关注

 2  1103

天命终不由人 2020-12-11 20:44

I am writing a code to compute dot product of two vectors using CUBLAS routine of dot product but it returns the value in host memory. I want to use the dot product for furt

2条回答

旧巷少年郎 (楼主)

2020-12-11 21:39

~~You can't, exactly, using CUBLAS.~~ As per talonmies' answer, starting with the CUBLAS V2 api (CUDA 4.0) the return value can be a device pointer. Refer to his answer. But if you are using the V1 API it's a single value, so it's pretty trivial to pass it as an argument to a kernel that uses it—you don't need an explicit cudaMemcpy (but there is one implied in order to return a host value).

Starting with the Tesla K20 GPU and CUDA 5, you will be able to call CUBLAS routines from device kernels using CUDA Dynamic Parallelism. This means you would be able to call cublasSdot (for example) from inside a __global__ kernel function, and your result would therefore be returned on the GPU.

0 讨论(0)

查看其它2个回答
发布评论:

提交评论
- 加载中...