MoTCoder
MoTCoder copied to clipboard
This is the official code repository of MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tasks.
Hi, I'm a little confused about the "pass@k" metric in the paper. It appears that on APPS you reported test case average (unstrict accuracy), although you referred to the "proportion...
Hello, I have downloaded the released model and followed the inference command you provided. However, it seems that the strict accuracy is not matched with the number you reported in...
Hello, Would you mind sharing your curated dataset?
Hello. I am reproducing your results, but I have a trouble time reproducing your baseline named `deepseek-coder-6.7b-instruct`. I use your prompt provided in this repository, but the APPS introductory pass@1...
It appears that the test list is not being used, and only the sample I/O is utilized in the code verification process. Based on this, I have two questions: 1....
It appears that the test list is not being used, and only the sample I/O is utilized in the code verification process. Based on this, I have two questions: 1....
I want to reproduce the evaluation pipeline for APPS, while it seems the `../data/apps_metric` invoked in the `test_apps.py` has been removed. How am I supposed to run the evaluation for...