Regarding instance-specific scoring criteria

Open prapti19 opened this issue 1 year ago • 0 comments

Hi, In the paper it is mentioned that "instance-specific" scoring criteria was created for the FLASK-HARD subset. Is there any way to create or use the subquestions/scoring criteria . It would be very nice if there would be a way to access them and benchmark models on it.

Thanks

Oct 10 '24 21:10 prapti19