How to quantize customed models, such as LVM?

Open XA23i opened this issue 1 year ago • 6 comments

Accodring to https://github.com/NVIDIA/TensorRT-LLM/tree/main/examples/quantization, Can I define my model and calibration process and then simply use modelopt.torch.quantization.quantize() ?

Jul 15 '24 03:07 XA23i