TensorRT Does TensoRT support the fusion of leayrelu and conv?

Jul 08 '22 07:07 pangr

you may refer to https://docs.nvidia.com/deeplearning/tensorrt/developer-guide/index.html#fusion-types

Jul 08 '22 13:07 zerollzeng

or create a case and run it with trtexec --verbose, you are able to see the final engine structure in the log. which will tell if TRT can support your case.

Jul 08 '22 13:07 zerollzeng

or create a case and run it with trtexec --verbose, you are able to see the final engine structure in the log. which will tell if TRT can support your case.

Ok, Thank you for your reply

Jul 12 '22 01:07 pangr

Does TensorRT support leakyrelu quantization?

Jul 12 '22 03:07 pangr

Whether the fusion happens depends on whether TRT has tactics supporting that. The very rough guidelines are:

Conv+LeakyReLU should be fused in FP16 or in INT8 on Turing and Ampere GPUs.
Other precisions and older GPUs are not guaranteed. If TRT doesn't have a tactic supporting this fusion, TRT will fall back to two-kernel approach.
If QAT (i.e. Q/DQ ops) is used, then Conv+LeakyReLU can only be fused if there are no Q/DQ ops between Conv and LeakyReLU and There are Q/DQ ops before Conv and after LeakyReLU. Basically, it will be fused if the pattern looks like: ...->Q->DQ->Conv->LeakyReLU->Q->DQ->...

Jul 12 '22 03:07 nvpohanh

Whether the fusion happens depends on whether TRT has tactics supporting that. The very rough guidelines are:

Conv+LeakyReLU should be fused in FP16 or in INT8 on Turing and Ampere GPUs.

Other precisions and older GPUs are not guaranteed. If TRT doesn't have a tactic supporting this fusion, TRT will fall back to two-kernel approach.

If QAT (i.e. Q/DQ ops) is used, then Conv+LeakyReLU can only be fused if there are no Q/DQ ops between Conv and LeakyReLU and There are Q/DQ ops before Conv and after LeakyReLU. Basically, it will be fused if the pattern looks like: ...->Q->DQ->Conv->LeakyReLU->Q->DQ->...

Thank you for your reply, I tried the above method, but the model inference dump profile is as follows, leakyrelu and conv do not seem to be fused

Jul 12 '22 12:07 pangr

@pangr what's your GPU and have you tried latest 8.6GA? thanks!

Jul 18 '23 17:07 ttyio

closing legacy issues, please reopen if you still have issue with latest TRT, thanks!

Aug 15 '23 17:08 ttyio