Zhen
Zhen
Sorry, I still can not achieve the results mentioned in the paper.
It is very helpful for me, nice work!
> I’m on board with the PR itself, but our NPU-patch has added more and more Transformers compatibility changes over time—leading to noticeable maintenance difficulties. It’s time to upgrade the...
@Serzhanov @zwxandy I have met the same problem, have this problem be solved?
> [@Serzhanov](https://github.com/Serzhanov) [@zwxandy](https://github.com/zwxandy) I have met the same problem, have this problem be solved? I try to downgrade datasets version to 2.20.0,and it works for me @Serzhanov @dshwei , hope...
@Lokiscripter Some CI failed, please fix pre-commit and rebase your code to latest version
e2e_ppo_trainer_megatron_vllm_2 fail but not related with this PR, ignore.
@ZLiao097 please pull and rebase the newest code from `main`, there are some modifications about `e2e_ascend`.
> After configuring profiling for tests/speical_npu/run_qwen2_5_05b_grpo.sh, the profiling directory is as follows, > > * * root > | - actor_compute_log_prob > | - actor_update > | - ref_compute_log_prob >...