fenghuohuo issues

Repositories
Issues
Comments

Results 1 issues of


                                            fenghuohuo

[Bug]: RuntimeError: Int8 not supported for this architecture

When I used the GPQT method of the llmcompressor library to perform int8 quantization on Qwen3-VL-4B with an RTX 5090 graphics card, and ran inference using vllm version 0.11.0, the...