Micro fusion and addressing modes

前端 未结 4 2029
后悔当初
后悔当初 2020-11-21 06:07

I have found something unexpected (to me) using the Intel® Architecture Code Analyzer (IACA).

The following instruction using [base+index] addressing

4条回答
  •  星月不相逢
    2020-11-21 06:49

    Older Intel processors without a uop cache can do the fusion, so maybe this is a drawback of the uop cache. I don't have the time to test this right now, but I will add a test for uop fusion next time I update my test scripts. Have you tried with FMA instructions? They are the only instructions that allow 3 input dependencies in an unfused uop.

提交回复
热议问题