transformers
transformers copied to clipboard
Bugfix convert_llama_weights_to_hf.py
What does this PR do?
Division with n_heads_kv_local is duplicated inside permute function, so removed.