Consider using vectorization for floating point calculations

Open austinmorozane opened this issue 2 years ago • 1 comments

Describe the solution you'd like IEEE Superscalar SIMD architecture / loop parallelism or vectorization in code here can significantly speed up FP calculations, depending on the levels of floating precision needed. I would recommend evaluating how much precision is needed, and consider enabling this compiler optimization if there is room for small inaccuracy, for large speed increases. A paper with more on the topic can be found here : https://ieeexplore.ieee.org/document/234917 ;

Mar 31 '23 19:03 austinmorozane

Thanks for taking this into consideration @austinhutchen.

Apr 13 '23 13:04 aloiscochard