Zhen

Results 9 comments of Zhen

Sorry, I still can not achieve the results mentioned in the paper.

It is very helpful for me, nice work!

> I’m on board with the PR itself, but our NPU-patch has added more and more Transformers compatibility changes over time—leading to noticeable maintenance difficulties. It’s time to upgrade the...

> [@Serzhanov](https://github.com/Serzhanov) [@zwxandy](https://github.com/zwxandy) I have met the same problem, have this problem be solved? I try to downgrade datasets version to 2.20.0,and it works for me @Serzhanov @dshwei , hope...

@Lokiscripter Some CI failed, please fix pre-commit and rebase your code to latest version

e2e_ppo_trainer_megatron_vllm_2 fail but not related with this PR, ignore.

@ZLiao097 please pull and rebase the newest code from `main`, there are some modifications about `e2e_ascend`.

> After configuring profiling for tests/speical_npu/run_qwen2_5_05b_grpo.sh, the profiling directory is as follows, > > * * root > | - actor_compute_log_prob > | - actor_update > | - ref_compute_log_prob >...