Dot Product of Vectors with SIMD
问题 I am attempting to use SIMD instructions to speed up a dot product calculation in my C code. However, the run times of my functions are approximately equal. It'd be great if someone could explain why and how to speed up the calculation. Specifically, I'm attempting to calculate the dot product of two arrays with about 10,000 elements in them. My regular C function is as follows: float my_dotProd( float const * const x, float const * const y, size_t const N ){ // N is the number of elements in