Is it possible to call a CUDA CUBLAS function from a global or device function
问题 I'm trying to parallelize an existing application, I have most of the application parallelized and running on the GPU, I'm having issues migrating one function to the GPU The function uses a function dtrsv which part of the blas library,see below. void dtrsv_call_N(double* B, double* A, int* n, int* lda, int* incx) { F77_CALL(dtrsv)("L","T","N", n, B, lda, A, incx); } I've been able to call the equivalent cuda/cublas function as per below,and the results produced are equivalent to the fortran