boblee22

Results 2 issues of boblee22

Hi, Thanks for your amazing work! When do you plan to release finetuned model checkpoints? Thank you very much!

Hi, thanks for your great work! I have a question about the sampling process. When both top-K and top-p are enabled (e.g., https://github.com/allenai/RL4LMs/blob/main/scripts/training/task_configs/common_gen/t5_nlpo.yml#L44-L51), isn't top-p just ignored because the K...