左其右

Results 2 comments of 左其右

This unofficial AWQ model use q_group_size=128 (https://huggingface.co/PointerHQ/Qwen2.5-VL-72B-Instruct-Pointer-AWQ) Which has issue with both vLLM and SGLang (Not sure if it's intentional). As a workaround, I have quantized it with q_group_size=64, which...