QuaRot icon indicating copy to clipboard operation
QuaRot copied to clipboard

Support more models.

Open JamesTheZ opened this issue 1 year ago • 0 comments

Thanks for the great work!

This PR supports more models of LLaMA/Qwen2/Mistral. It also supports the model who has attention_bias (e.g., Qwen2.5 models).

JamesTheZ avatar Dec 16 '24 14:12 JamesTheZ