ScaleLLM

ScaleLLM copied to clipboard

Published 3 months ago •

Reame
Issues

Quantization: Supporting FP8 for both models and KV caches

Open guocuimi opened this issue 1 year ago • 0 comments

Apr 28 '24 04:04 guocuimi