verl icon indicating copy to clipboard operation
verl copied to clipboard

Example for code RL training

Open c-box opened this issue 11 months ago • 1 comments

Very nice library!

I noticed that the current examples are for math task training. Would you consider adding an example for code generation tasks, including some recommended settings?

Moreover, while currently attempting some simple training for code generation, I found that the training speed is significantly slower compared to math tasks, and GPU utilization is often very low. Can you provide some possible suggestions?

c-box avatar Feb 24 '25 12:02 c-box

+1

lihaoling avatar Mar 06 '25 16:03 lihaoling

+1

wenyi-li avatar Aug 15 '25 08:08 wenyi-li