Ahmad Kiswani
Ahmad Kiswani
Calling GridEncoder with empty positions `positions.shape=[0, 3]` causes CUDA to enter an invalid state silently. Subsequent calls to CUDA kernels would fail with the following error: ``` RuntimeError: CUDA error:...
# What does this PR do ? Add qwen3-next support, recipes and tests to mcore path. # Issues https://github.com/NVIDIA-NeMo/RL/issues/1431 # Usage SFT: ```bash uv run --extra mcore --extra vllm --extra...
# What does this PR do ? sequence parallel + tp_size >1 is currently broken in torch==2.8.0, no point in running the test as it will raise an exception ##...
Replacing PR #732 Waiting for runs to confirm the convergence graph attached in the original PR # What does this PR do ? Add an simple example on multi-turn GRPO...
An issue to track supporting Qwen-3 next in MCore path
# CLI Architecture Redesign **Use cases, pain points, and background** - The current CLI has 17+ separate entry points (`ng_run`, `ng_test`, `ng_collect_rollouts`, etc.) - The commands don't have a consistent...