Benhao Huang
Results
4
comments of
Benhao Huang
Same Issue
try
@CodiumAI-Agent /review
Yeah, your concern is correct. You can take a look at this paper from nvidia, which has discussed about this problem in section 2.2, curse of parallel decoding: https://arxiv.org/abs/2505.22618 hope...