reward-modeling topics

official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and Alternatives

holarissun

inverse-reinforcement-learning

large-language-models

largelanguagemodels

llm-aligment

Science-T2I

62

Stars

4

Forks

62

Watchers

[CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis

Jialuo-Li

benchmark

computer-vision

dataset

generative-model

learning-from-rewards-llm-papers

60

Stars

2

Forks

60

Watchers

A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward models and learning strategies across training, inference, and post-...

bobxwu

guided-decoding

large-language-models

llm

llms

qa_metrics

59

Stars

7

Forks

59

Watchers

An easy python package to run quick basic QA evaluations. This package includes standardized QA evaluation metrics and semantic evaluation metrics: Black-box and Open-Source large language model promp...

zli12321

exact-matching

llm

llm-evaluation

llm-evaluation-framework