DeepSpeed
DeepSpeed copied to clipboard
improving int4 asymmetric quantization accuracy
Credits to Connor for this PR! This PR changes the way offset and scale are computed and applied for int4 asymmetric quantization to improve the quantization accuracy.