Ritvik Kapila
Results
2
issues of
Ritvik Kapila
I am trying to run evaluations on Claude models with the following command: ``` lm_eval --model anthropic-completions --model_args model=claude-3-haiku-20240307 --tasks gsm8k -L20 ``` First the command fails with the error...
### System Info Hi, I was looking at the GRPO scripts for Qwen3 MOE models, particularly, `examples/grpo_trainer/run_qwen3moe-30b_megatron_96gb.sh` and `examples/grpo_trainer/run_qwen3-235b_megatron_96gb.sh`. There seem to be some inconsistencies and I wanted to flag...
bug