scaleeval icon indicating copy to clipboard operation
scaleeval copied to clipboard

Scalable Meta-Evaluation of LLMs as Evaluators

Results 1 scaleeval issues
Sort by recently updated
recently updated
newest added

Hi, Thanks for the super cool work. Do you plan to release the evaluation dataset? (with human judgments)