I\'m currently coding an application in C# which could benefit a great deal from using SSE, as a relative small piece of code causes 90-95% of the execution time. The code i
C# supports quite a few SIMD/SSE instructions natively in System.Numerics which is cross-platform. Dot product is a supported instruction.
HPCsharp nuget package on nuget.org, which I've been actively developing for the last two years, uses this capability to accelerate many algorithms. Let me know if certain useful algorithms could use acceleration thru SIMD/SSE and multi-core.