Fix baichuan smoothquant/INT8 KV cache build error

Open BasicCoder opened this issue 2 years ago • 0 comments

The baichuan convert script lacks scale_y_accum_quant, scale_w_quant_orig value saving.

Nov 17 '23 03:11 BasicCoder

triaged

Community want to contribute

Generic Runtime