ZHENG, Zhen
Results
2
issues of
ZHENG, Zhen
It requires external only results have the same num-elements. Non external only results can have different number of elements.
enhancement
Thanks for the great work! This PR supports more models of LLaMA/Qwen2/Mistral. It also supports the model who has attention_bias (e.g., Qwen2.5 models).