LIU Man

Results 4 comments of LIU Man

Hi @stolam I have also tried iPQ, but got the same problem that the quantized model cannot run only on CPU. I also tried torch dynamic_quantization, but encounter a lot...

同样的问题,也是匹配不上

It seems yes. Please check this: https://github.com/facebookresearch/fairseq/blob/920a548ca770fb1a951f7f4289b4d3a0c1bc226f/fairseq/model_parallel/modules/multihead_attention.py#L128