Chris Scott
Chris Scott
I am also interested in this pr. I wanted to deploy the 235B in AWQ.
I have the same issue with [https://huggingface.co/Qwen/Qwen3-235B-A22B-GPTQ-Int4](url) using TP=4
I am also interested in this because I do have different local machines or maybe even the same machine that's running multiple VLLM endpoints. With different models
Anything you want to change to merge this?
> Thank you very much! Thank you for the awesome project.
Same Problem on 4 Ada a6000s.
I have this working with the latest 0.9. The GPTQ
> [@getfit-us](https://github.com/getfit-us) does it work with 0.8.5.post1? I am not sure, I do know if you install the latest version, it seems to work. Although, I believe the toolcalling is...
#800 . I know it works for me in the latest version.