Zalman Stern

Results 16 issues of Zalman Stern

Ran into this assert. The entire destructuring of For names things has to go, but the code only accesses two elements here and I'm not sure why a two level...

Using Halide generated code inside Python requires user_context. For JIT, this is always present. For AOT, this would require specifying the user_context option throughout the entire pipeline used inside of...

Heavily based on Steve Suzuki's work here: https://github.com/halide/Halide/pull/6781 . Hopefully easier to merge with less effect on existing ARM support and fewer constraints on CodeGen_LLVM.

AVX10 and APX support is being added here: https://github.com/halide/Halide/pull/8052 . This will require runtime detection of these features. Per https://github.com/halide/Halide/pull/7840 and the comment added about how to do dfeature detection...

LLVM crashes with scalable vectors that have minimum size of 1. Some cases that would use scalable vectors are using fixed vectors to avoid this. Architecture specific choice about whether...

In the SVE2 branch, slice_vector uses newish `llvm.vector.insert` and `llvm.vector.extract` intrinsics for the scalable vector case. We should evaluate whether this is a better approach to use always.

This is a tracking bug recording a general thing encountred while adding SVE2 and braoder scalable vector support. Currently code generation for LLVM IR works by setting the `value` member...

CodeGen_ARM, for SVE, generates a runtime check that the current processor supports the vector length compiled for. This is done in `begin_func` by checking if the current function does not...