I\'m building an interpreter and as I\'m aiming for raw speed this time, every clock cycle matters for me in this (raw) case.
Do you have any experience or informati
You may be barking up the wrong tree. Cache misses can be much more important than the number of instructions that get executed.