tvm
tvm copied to clipboard
Open deep learning compiler stack for cpu, gpu and specialized accelerators
``` /root/zyhuang/tvm/3rdparty/cutlass_fpA_intB_gemm/cutlass/include/cutlass/matrix.h:7828:7: error: 'struct cutlass::Matrix' has no member named 'set_slice3x3'; did you mean 'set_slice_3x3'? [-Wtemplate-body] 7828 | m.set_slice3x3({ | ^~~~~~~~~~~~ | set_slice_3x3 /root/zyhuang/tvm/3rdparty/cutlass_fpA_intB_gemm/cutlass/include/cutlass/matrix.h: In static member function 'static cutlass::Matrix cutlass::Matrix::reflection(const...
# Summary This PR fixes https://github.com/apache/tvm/issues/17964 that occurs through the following process:  1. The Compress node produces a symbolic dimension "num_nonzero" 2. The `relax.op.add()` in the BiasGelu node adds...
### Expected behavior TVM should run the model correctly. ### Actual behavior For the following model, it can be executed by onnxruntime, the results are as follows: ```c ONNXRuntime: [array([[[[False,...
Before this PR, the randomly generated population size was sometimes wrong.
### Expected behavior TVM-compiled model should match PyTorch reference within `rtol=0.01`, `atol=0.001`. ### Actual behavior Mismatch detected in tensor **o2** (layer index 2): ### Environment | Item | Value |...