Which information does GCC collect when I enable -fprofile-generate and which optimization does in fact uses the collected information (when setting the -
"What Every Programmer Should Know About Memory" by Ulrich Drepper https://people.freebsd.org/~lstewart/articles/cpumemory.pdf http://www.akkadia.org/drepper/cpumemory.pdf
In section 7.4
The result should run faster as it should be better at prefetching code into the processor instruction cache.