TensorRT-LLM
TensorRT-LLM copied to clipboard
pass rotary_emb_base to gpt_attention
1.pass rotary_emb_base to gpt attention tensorrt_llm/models/qwen/model.py 2.change variable name: rotary_base -> rotary_emb_base examples/qwen/build.py