X. HU
X. HU
> Since the existing test scripts all use small models like 0.5B or 0.6B, while the smallest Qwen3-MoE model is 30B, this would significantly increase the runtime when pulling the...
There should be testcases in the CI. Since there were no strict version constraints before, why can those cases pass the test? Can u figure it out? @duesdues
PROFILE_STEPS="[2,4]" PROFILE_RANKS_ALL=True DISCRETE=True SAVE_PATH="/home/profile_data_discrete" LEVEL="level1" CONTENTS=['npu','cpu'] ANALYSIS=True actor_rollout_ref.actor.profiler.enable=True \ actor_rollout_ref.actor.profiler.all_ranks=$PROFILE_RANKS_ALL \ actor_rollout_ref.actor.profiler.tool_config.npu.discrete=$DISCRETE \ actor_rollout_ref.actor.profiler.tool_config.npu.contents=$CONTENTS \ actor_rollout_ref.actor.profiler.tool_config.npu.level=$LEVEL \ actor_rollout_ref.actor.profiler.tool_config.npu.analysis=$ANALYSIS \ actor_rollout_ref.ref.profiler.enable=True \ actor_rollout_ref.ref.profiler.all_ranks=$PROFILE_RANKS_ALL \ actor_rollout_ref.ref.profiler.tool_config.npu.discrete=$DISCRETE \ actor_rollout_ref.ref.profiler.tool_config.npu.contents=$CONTENTS \ actor_rollout_ref.ref.profiler.tool_config.npu.level=$LEVEL \ actor_rollout_ref.ref.profiler.tool_config.npu.analysis=$ANALYSIS...
@Shangwei-Li can u review it?
@FightingZhen ready for review~
moe+mindspeed,跑到grouped_linear会报错,对应的mindspeed需要930的pta配套。ci上同样存在这个问题,需要等待ci更新pta 930的包。 不升级pta的话,可以跟随@wlf-darkmatter 的写法修改。
It can be seen that the overall process control and the actual profiling startup are far apart, and the original solution requires passing the calling function many times, which is...
@FightingZhen @wuxibin89 ready for review. This pr is bugfix for legacy_workers.
Is there any way to unify start_e2e_profiler and start_capture_profiler?
After configuring profiling for tests/speical_npu/run_qwen2_5_05b_grpo.sh, the profiling directory is as follows, - - root | - actor_compute_log_prob | - actor_update | - ref_compute_log_prob And the result of tests/speical_npu/run_qwen2_5_05b_grpo_mindspeed.sh are also...