Chris Scott

Results 9 comments of Chris Scott

I am also interested in this pr. I wanted to deploy the 235B in AWQ.

I have the same issue with [https://huggingface.co/Qwen/Qwen3-235B-A22B-GPTQ-Int4](url) using TP=4

I am also interested in this because I do have different local machines or maybe even the same machine that's running multiple VLLM endpoints. With different models

Anything you want to change to merge this?

> Thank you very much! Thank you for the awesome project.

I have this working with the latest 0.9. The GPTQ

> [@getfit-us](https://github.com/getfit-us) does it work with 0.8.5.post1? I am not sure, I do know if you install the latest version, it seems to work. Although, I believe the toolcalling is...

#800 . I know it works for me in the latest version.