TensorRT-LLM
TensorRT-LLM copied to clipboard
Fix baichuan smoothquant/INT8 KV cache build error
The baichuan convert script lacks scale_y_accum_quant, scale_w_quant_orig value saving.