rl-training topic

List rl-training repositories

Text-Summarizer-Pytorch

316
Stars
75
Forks
Watchers

Pytorch implementation of "A Deep Reinforced Model for Abstractive Summarization" paper and pointer generator network

Sim4Rec

43
Stars
1
Forks
Watchers

Simulator for training and evaluation of Recommender Systems

qa_metrics

59
Stars
7
Forks
59
Watchers

An easy python package to run quick basic QA evaluations. This package includes standardized QA evaluation metrics and semantic evaluation metrics: Black-box and Open-Source large language model promp...

Gym

463
Stars
31
Forks
463
Watchers

Build RL environments for LLM training

free-form-grpo

16
Stars
0
Forks
16
Watchers

grpo to train long form QA and instructions with long-form reward model

AWorld

1.1k
Stars
113
Forks
1.1k
Watchers

Build, evaluate and train General Multi-Agent Assistance with ease