Nick Comly
Nick Comly
## Bug Description FX front-end simple example is broken due to API change. Reported by @jaybdub File /opt/conda/lib/python3.8/site-packages/torch_tensorrt/_compile.py:116, in compile(module, ir, inputs, enabled_precisions, **kwargs) 114 lower_precision = LowerPrecision.FP16 115 elif...
## Bug Description When using the PyT-QAT toolkit, QAT perf is slower than PTQ, for TRT this is not the case. Torch-TRT: Model | Accuracy | Performance -- | --...
Hi all, this issue will track the feature requests you've made to TensorRT-LLM & provide a place to see what TRT-LLM is currently working on. Last update: `Jan 14th, 2024`...