I remember reading somewhere that to really optimize & speed up certain section of the code, programmers write that section in Assembly language. My questions are -
use this:
__asm__ __volatile__(/*assembly code goes here*/);
the __asm__ can also just be asm.
__asm__
The __volatile__ stops the compiler from making further optimizations.
__volatile__