briskerkazoos

Results 2 comments of briskerkazoos

After testing both `baseline` and `greedy` on C4 dataset on A100, I get the following result: Baseline: `total time :110.10318s, latency :0.02298s, decoding step: 4791` Greedy: `total time :144.56247s, latency...

> > Some explanation: draft_time is the time for one draft model's forward pass. target_time is the time for one draft model's forward pass corresponding to the valid budget. >...