Quantization method is different from paper?

Open hustzxd opened this issue 6 years ago • 1 comments

Hello, thanks for your excellent work, and this project is beneficial to me.

However, I found that the quantization method in this project is somewhat different from that mentioned in the paper.

Weights are quantized using: $w_q = Q(\frac{\tanh(w)}{2\max(|\tanh(w)|)} + \frac{1}{2})$ which I think may be a mistake in the paper.

Activations are quantized using: $x_q = Q(clip(x, 0, 1))$

Activations are quantized using: $x_q = 2Q(clip(x, 0, 1))-1$ which I think may be a mistake.

In fact, according to quantize_module_.py, weights and activations are quantized using gemm method(an asymmetric uniform quantization method)

Is there any comparison between the two methods?

Best wishes.

Jun 24 '19 09:06 hustzxd

There is a small mistake in weights quantization in the paper. His weights quantization implementation is correct.

Jul 25 '19 05:07 bohanzhuang