Usage of convert and fuse ?

Open 592319702 opened this issue 2 years ago • 1 comments

In the first step, if I didn't use python convert.py -q to generate a quantized model, is that mean it is unnecessay to use -d, --de-quantize parameter to generate a de-quantized model when running the command python fuse.py ?

Feb 02 '24 02:02 592319702

If you are not using the quant model to do the fine-tuning, you shouldn't de-quantize it during fuse. And if you try to de-quantize a non-quant model, fuse may throw an error.

Feb 02 '24 03:02 mzbac