evaluation
evaluation copied to clipboard
Add LinCE Testbed to Full Benchmark
I could try working on this one; contains multiple subtasks though, not sure how to handle them all.
See bigscience-workshop/promptsource#746