FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)
promptslab
[NeurIPS 2025] Flow x RL. "ReinFlow: Fine-tuning Flow Policy with Online Reinforcement Learning". Support VLAs e.g., pi0, pi0.5. Fully open-sourced.
ReinFlow