Thomas YIu

Results 3 comments of Thomas YIu

please help me fix it

That is what I am getting as well for lm_eval for mmlu. I wonder what MMLU did OpenAI test with? Was it with tools?