What are the accurate scores of the task-level Radar graph

Open hzhua opened this issue 1 year ago • 0 comments

Could please share the accurate scores (before normalization) for the radar graph in the leaderboard? This will help people to compare the task-level performance with these models.

Thanks

Jan 23 '25 05:01 hzhua