GPTQModel
GPTQModel copied to clipboard
Torch Linear instead of Triton Linear
Hey Team,
In our tests in transformers we were expecting the layer type to be tritonv2 for T4 gpus, but after the latest release it's torch. Any ideas why ? Thanks a lot !