scaleeval
scaleeval copied to clipboard

→

Scalable Meta-Evaluation of LLMs as Evaluators

Results 1 scaleeval issues

Sort by recently updated

Hi, Thanks for the super cool work. Do you plan to release the evaluation dataset? (with human judgments)

Scalable Meta-Evaluation of LLMs as Evaluators

nlp

evaluation-framework

llm

generative-ai

Stars

Forks

Watchers

Stars

Forks

Watchers

Scalable Meta-Evaluation of LLMs as Evaluators