DeepSpeed icon indicating copy to clipboard operation
DeepSpeed copied to clipboard

improving int4 asymmetric quantization accuracy

Open HeyangQin opened this issue 2 years ago • 0 comments

Credits to Connor for this PR! This PR changes the way offset and scale are computed and applied for int4 asymmetric quantization to improve the quantization accuracy.

HeyangQin avatar Apr 11 '23 21:04 HeyangQin