Halide icon indicating copy to clipboard operation
Halide copied to clipboard

[ARM] Generate mixed-arg dot product instructions

Open rootjalex opened this issue 3 years ago • 0 comments

On the ARM backend, we should be targeting the USDOT/SUDOT instructions for mixed-sign dot products, i.e. when compiling conv3x3 with accumulator type Int(32). LLVM exposes an intrinsics for USDOT.

rootjalex avatar Sep 21 '22 22:09 rootjalex