language-model-arithmetic
language-model-arithmetic copied to clipboard
SPECULATIVE SAMPLING
Hello, I would like to ask why p2 needs to be corrected after the sampling is rejected, instead of directly using p2?