scaleeval
scaleeval copied to clipboard
Scalable Meta-Evaluation of LLMs as Evaluators
Results
1
scaleeval issues
Sort by
recently updated
recently updated
newest added
Hi, Thanks for the super cool work. Do you plan to release the evaluation dataset? (with human judgments)