Chenhui Zhang
Results
1
issues of
Chenhui Zhang
Fixes [#1735](https://github.com/vllm-project/vllm/issues/1735). This PR modifies the weight loading logic when `tp_size` is larger than `num_kv_heads`.