How to write a matrix matrix product that can compete with Eigen?

前端未结

关注

 3  984

伪装坚强ぢ 2020-12-24 03:31

Below is the C++ implementation comparing the time taken by Eigen and For Loop to perform matrix-matrix products. The For loop has been optimised to minimise cache misses. T

3条回答

悲哀的现实 (楼主)

2020-12-24 04:21

There are two simple optimizations that I may advice.

1) Vectorize it. It would be better if you vectorize it with inline assembly or write assembly proc, but you may use compiler intrinsics as well. You can even let compiler vectorize the loop, but it is sometimes difficult to write proper loop to be vectorized by compiler.

2) Make it parallel. Try using OpenMP.

0 讨论(0)

查看其它3个回答
发布评论:

提交评论
- 加载中...