SEGFAULT in -O3 mode?

前端 未结 1 1529
傲寒
傲寒 2020-12-18 04:29

I summarized my problem to the following short program.

It causes SEGFAULT in -O3 mode only (-O2 works fine). According to gdb it happens at *f =

相关标签:
1条回答
  • 2020-12-18 04:50

    The source of SEGFAULT was not solely in violation of the strict aliasing rule, as the problem persisted even with -fno-strict-aliasing flag.

    It was indeed accessing unaligned memory, but not as simple as that. As modern processors, generally allow unaligned memory access and there is even not much of an overhead nowadays. I've done some benchmarking and didn't observe a big difference in algined vs unaligned read on my Intel(R) Xeon(R) CPU E5-2680 v2 @ 2.80GHz. Also there are some very similar (and more or less recent) results in the web.

    My problem was that -O3 mode enables -ftree-vectorize flag, therefore my for cycle was vectorized (as I could see using -ftree-vectorizer-verbose flag). And (AFAIU) there is no support (yet?) for unaligned memory access using vectorized instructions, so there was a runtime exception.

    This article helped me out a lot in understanding theory, though it seems that today unaligned memory access is not as harmful as it was, though still tricky

    0 讨论(0)
提交回复
热议问题