CUDA has the API call
cudaError_t cudaMemset (void *devPtr, int value, size_t count)
which fills a buffer with a single-byte value. I want to f
As of about CUDA 3.0, runtime API device pointers (and everything else) are interoperable with the driver API. So yes, you can use cuMemsetD32 to fill a runtime API allocation with a 32 bit value. The size of CUdeviceptr will match the size of void *on you platform and it is safe to cast a pointer from the CUDA API to CUdeviceptr or vice versa.