Thomas YIu
Results
3
comments of
Thomas YIu
please help me fix it
That is what I am getting as well for lm_eval for mmlu. I wonder what MMLU did OpenAI test with? Was it with tools?