boblee22
Results
2
issues of
boblee22
Hi, Thanks for your amazing work! When do you plan to release finetuned model checkpoints? Thank you very much!
Hi, thanks for your great work! I have a question about the sampling process. When both top-K and top-p are enabled (e.g., https://github.com/allenai/RL4LMs/blob/main/scripts/training/task_configs/common_gen/t5_nlpo.yml#L44-L51), isn't top-p just ignored because the K...