shaojie zhang
Results
3
comments of
shaojie zhang
Ops, so..... no more data?
I noticed a similar issue. For the 1.5B base model, the reported pass@1 rate was 43.9%, but the actual pass@1 rate without using post-processing other than truncation was 5.49%.
> Additionally, I noticed that Qwen2.5-Coder applies various dataset-specific processing steps. Could this be the reason why the results in the technical report are significantly better? > > See the...