Fastest Offset Read for a Small Array
问题 For speed, I would like to read one of 8 registers referenced by the value in a 9th register. The fastest way I see to do this is to use 3 conditional jumps (checking 3 bits in the 9th register). This should have shorter latency than the standard way of doing this with an offset memory read, but this still requires at least 6 clock cycles (at least one test plus one conditional jmp per bit check). Is there any commercial CPU (preferably x86/x64) with an intrinsic to do this "offset register