Do warp vote functions synchronize threads in the warp?
问题 Do CUDA warp vote functions, such as __ any() and __ all() , synchronize threads in the warp? In other words, is there any guarantee that all threads inside the warp execute instructions preceding warp vote function, especially the instruction(s) that manipulate the predicate? 回答1: The synchronization is implicit, since threads within a warp execute in lockstep. [*] Code that relies on this behavior is known as "warp synchronous." [*] If you are thinking that conditional code will cause