Ziming Mao

Results 4 comments of Ziming Mao

@pingyu Thanks I believe I am on the latest commit

The current nightly `FROM lmsysorg/sglang:nightly-dev-cu13-20251116-de7eaa7c` still gives the same issue ``` File "/usr/local/lib/python3.12/dist-packages/flashinfer/fused_moe/core.py", line 1379, in trtllm_fp8_block_scale_moe_op moe_op.trtllm_fp8_block_scale_moe( File "python/tvm_ffi/cython/function.pxi", line 901, in core.Function.__call__ TypeError: Mismatched type on argument #17...

Thanks for the answer. What do you mean by "written to disk"? I am wondering why low latency mode does not need these primitives