program fails for array 30 x 30

前端 未结 2 665
南旧
南旧 2021-01-24 21:05

This is program for matrix multiplication on CUDA architecture. This code is working fine when size of array is 30 x 30 but giving output as a series of 0\'s when size is greate

2条回答
  •  忘掉有多难
    2021-01-24 21:51

    You are invoking the kernel with a configuration of 1 grid with size 30x30:

    matrix_multiply<<<1, SIZE * SIZE>>>(c_input1,c_input2,c_result,SIZE);
    

    There are not enough threads to process more.

提交回复
热议问题