Jan Dupej
Results
1
comments of
Jan Dupej
I'm nitpicking here. For `f32`, this horizontal sum boils down to: ``` haddps xmm0, xmm0 ; ICL (p01 2p5) lat=6, thr=1/2 ; Zen3 lat=6 thr=1/2 haddps xmm0, xmm0 ; ICL...