TensorRT
TensorRT copied to clipboard
❓ [Question] How to compile a model with A16W8?
Hi Torch-TensorRT team:
I'm wondering how can I compile a model with 8 bit weights, but using 16 bit activations? Thanks a lot!
May I know how you've obtained this model (model source) ? Is it finetuned with any quantization technique ?