mattam301

Results 7 comments of mattam301

Are you intend to upload the immediate version? Or can you suggest me what to do to reproduce that version? Thanks in advance

> I am able to do quantization with: > > ``` > model_fp32 = 'model_Segformer.onnx' > model_quant = "model_dynamic_quant.onnx" > quantized_model = quantize_dynamic(model_fp32, model_quant, weight_type=QuantType.QUInt8) > ``` > > instead...

I have the same issue, what should I do to resolve the problem?

Hi, I am interested in that question too, since I have successfully reproduce int4 and int3 results in the paper, but when it came to int2, the ppl is totally...

I encountered the same problem, the installation of VILA now need updating

I supposed that he has the same issue as me when setting up AWQ kernel on the machine with CUDA version 11.2: run: python setup.py install get: RuntimeError: The detected...