reward-modeling topic

List reward-modeling repositories

tasksource

189
Stars
11
Forks
189
Watchers

Datasets collection and preprocessings framework for NLP extreme multitask learning

DMoERM

18
Stars
0
Forks
18
Watchers

[ACL2024 Findings]DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling

IterComp

203
Stars
11
Forks
203
Watchers

[ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation

vector-inference

87
Stars
12
Forks
87
Watchers

Efficient LLM inference on Slurm clusters using vLLM.

RewardModelingBeyondBradleyTerry

70
Stars
4
Forks
70
Watchers

official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and Alternatives

Science-T2I

62
Stars
4
Forks
62
Watchers

[CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis

learning-from-rewards-llm-papers

60
Stars
2
Forks
60
Watchers

A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward models and learning strategies across training, inference, and post-...

qa_metrics

59
Stars
7
Forks
59
Watchers

An easy python package to run quick basic QA evaluations. This package includes standardized QA evaluation metrics and semantic evaluation metrics: Black-box and Open-Source large language model promp...

hybrid-preferences

26
Stars
3
Forks
26
Watchers

Learning to route instances for Human vs AI Feedback (ACL Main '25)

LongRM

20
Stars
0
Forks
20
Watchers

Revealing and unlocking the context boundary of reward models