ms-swift
ms-swift copied to clipboard
Any plans to support megatron for GRPO training?
Hi guys,
Thanks for this great job! I am wondering if you have any plans to support megatron for GRPO training?
It would really help when it's needed to full GRPO train large size model, like 72B Qwen2.5VL.
Same request. I am looking forward to using Megatron for training LLM with GRPO. It can help me save GPU resources.
I have seen that Verl added Megatron GRPO support. I am wondering if ms-swift will add this feature in the future. (https://verl.readthedocs.io/en/latest/workers/megatron_workers.html)
stay tuned at https://github.com/modelscope/ms-swift/issues/4561