GPU shared memory size is very small - what can I do about it?

前端未结

关注

 3  784

佛祖请我去吃肉 2020-12-16 13:03

The size of the shared memory (\"local memory\" in OpenCL terms) is only 16 KiB on most nVIDIA GPUs of today.
I have an application in which I need to create an array th

3条回答

春和景丽 (楼主)

2020-12-16 14:00
You can try to use cudaFuncSetCacheConfig(nameOfKernel, cudaFuncCachePrefer{Shared, L1}) function.

If you prefer L1 to Shared, then 48KB will go to L1 and 16KB will go to Shared. If you prefer Shared to L1, then 48KB will go to Shared and 16KB will go to L1.

Usage:
```
cudaFuncSetCacheConfig(matrix_multiplication, cudaFuncCachePreferShared);
matrix_multiplication<<>>(bla, bla, bla); 
```
0 讨论(0)

查看其它3个回答
发布评论:

提交评论
- 加载中...