GPTQModel
GPTQModel copied to clipboard
[DOC] All of the Quantization Examples have wrong reference to `backends`.
This is typo/wrong doc title. Actual Quantization currently is using a fixed backend and any backend you use will not improve or materially change the quantization process. Backend is primarily used for post-quant inference. We will fix this and make it more clean. Thanks for the report.