Siddharth Singh
Results
2
comments of
Siddharth Singh
If one wants to use per-layer cuda-graphs (--cuda-graph-scope full as of today in main), do we set --cuda-graph-scope as `attn mlp`? In that case, are we doubling the number of...
@sidsingh-nvidia [helps me keep track of MRs I am reviewing]