finetuning-rl topic

List finetuning-rl repositories

LLMtuner

228
Stars
14
Forks
Watchers

FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)

ReinFlow

227
Stars
21
Forks
227
Watchers

[NeurIPS 2025] Flow x RL. "ReinFlow: Fine-tuning Flow Policy with Online Reinforcement Learning". Support VLAs e.g., pi0, pi0.5. Fully open-sourced.