TensorRT-LLM
TensorRT-LLM copied to clipboard
Attention Pattern Matching with Inductor Utilities
Apply approach in #4064 for attention pattern matching. This will greatly simplify our pattern matchers in this file