Po-Han Huang (NVIDIA) comments

Results 229 comments of


                                            Po-Han Huang (NVIDIA)

fp16 output mismatch

@DingJuPeng1 You can try using Polygraphy tool (https://github.com/NVIDIA/TensorRT/blob/main/tools/Polygraphy/how-to/debug_accuracy.md ) to see which layer(s) produce wrong results. Also, I would suggest that you try newer TRT version like TRT 8.4 GA...

why 7234 和 8406 diffs in layer's output dimensions when constructing network

Yes, that was a bug already fixed in TRT 8.4.

ONNX model from TensorFlow failed to convert to TensorRT

@MarceJara Which TRT version did you use? Could you try TRT 8.4.1?

(Could not find any implementation for node {ForeignNode[Transpose_2713 + (Unnamed Layer* 4032) [Shuffle]...MatMul_2714]}.)

![2022-07-06 10_37_29-D__workD_m2m100_418M-decoder onnx - Netron](https://user-images.githubusercontent.com/53919306/177456063-6714bc40-c056-47e6-8210-5a5d584458d5.png) The input shapes of the `input_ids` and `encoder_attention_mask` are both `[batch,seq_length]` in your ONNX model, but the optShapes you provide has different `seq_length` between the...

(Could not find any implementation for node {ForeignNode[Transpose_2713 + (Unnamed Layer* 4032) [Shuffle]...MatMul_2714]}.)

Thanks. We will debug this. @zerollzeng Could you repro this and file an internal tracker bug? Thanks

Po-Han Huang (NVIDIA)

fp16 output mismatch

why 7234 和 8406 diffs in layer's output dimensions when constructing network

ONNX model from TensorFlow failed to convert to TensorRT

(Could not find any implementation for node {ForeignNode[Transpose_2713 + (Unnamed Layer* 4032) [Shuffle]...MatMul_2714]}.)

(Could not find any implementation for node {ForeignNode[Transpose_2713 + (Unnamed Layer* 4032) [Shuffle]...MatMul_2714]}.)

What does “Reformatting CopyNode for Input Tensor” mean in trtexec' dump profile

Build engine file failed with INT8 calibration mode

Build engine file failed with INT8 calibration mode

TensorRT 8.2.2 and 8.2.3 objects are not cleaned properly

3D asymmetric padding is not supported on pre-sm70 GPUs