Yibin Li

Results 2 issues of Yibin Li

* Support Linear (row major) block scale factor layout in FP4 quantize kernel. This layout is used for trtllm-gen MOE FP4 kernel. * New Unit tests added to test the...

@coderabbitai summary ## Description Add LoRA adapter and perf test for the pytorch backend implementation of starcoder2 ## Test Coverage ## PR Checklist Please review the following before submitting your...