LongBench icon indicating copy to clipboard operation
LongBench copied to clipboard

What are the accurate scores of the task-level Radar graph

Open hzhua opened this issue 1 year ago • 0 comments

Image

Could please share the accurate scores (before normalization) for the radar graph in the leaderboard? This will help people to compare the task-level performance with these models.

Thanks

hzhua avatar Jan 23 '25 05:01 hzhua