Hanning Wang
Results
1
issues of
Hanning Wang
**Describe the bug** When running multi-turn rollout with a Qwen3 model (in my case `Qwen/Qwen3-4B`), the `Inconsistent training and inference tokenization detected` warning is still triggered, even when the `tokenization_sanity_check_mode`...