Yuhang He
Yuhang He
I have the same question as well
Same problem. The program gets stuck at `ray::WorkerDict.ref_init_model`. I think it's related to vLLM?
Same request. I am looking forward to using Megatron for training LLM with GRPO. It can help me save GPU resources. I have seen that Verl added Megatron GRPO support....
> decrease `vllm_gpu_memory_utilization` > > btw > > ``` > --sleep_level 1 > --offload_model true > --offload_optimizer true > --gc_collect_after_offload true > ``` > > These options are intended for...