I am writing a simple cuda kernel where I am measuring the time of the DRAM access and I want to get the ptx and sass code from the cuda code. The device source code is as f