A warp is 32 threads. Does the 32 threads execute in parallel in a Multiprocessor?
If 32 threads are not executing in parallel then there is no race condition in the warp.
I
Yes. The 32 threads in a WARP will execute in parallel. The GPU is a SIMT (single instruction multiple thread) machine, single instruction which is executed by multiple threads in parallel.
Btw, SIMT is somewhat of a marketing term, it is basically the same as SIMD.