Jeremy Malloch

Results 3 comments of Jeremy Malloch

Hey, a couple questions to tack on: 1. What version of TensorRT will this be included in? Is that TRT 8.7? And if `--profilingVerbosity=detailed` dumps the data, does that mean...

Hey, one followup question. How does the fusion scheme work for pre-norm transformers (as the layer norm would only be applied to the residual branch, and not the identity branch)?...