TensorRT-LLM
TensorRT-LLM copied to clipboard
Question regarding RowLinear and ColumnLinear
Hi, I would like to know when to use RowLinear and ColumnLinear. I see it used in conjuction in mlp.py and attention.py and I'm finding it difficult to know what's the efficient or correct way to identify the correct one to choose when defining custom Linear layers.