Qwen2.5-Math
Qwen2.5-Math copied to clipboard
Any plan to release the GRPO code?
Congratulations to Qwen team! Another outstanding job!
I noticed that you use GRPO to RL your math model. For now, there is no released implementation of GRPO. Do you have any plan to release the code?
Thank you very much!
+1
+1 Any updates?