I\'m working with some large data using the cublas library for matrix multiplication. To save memory space, I want something like A=A*B where A and B