Congmin(Xavier) Qiu

Results 8 issues of Congmin(Xavier) Qiu

line 25 in this file : https://github.com/tristandeleu/ntm-one-shot/blob/master/mann/utils/init.py is there some special reason to initialize read weights like this?

## Summary - Add early validation of tool names in `_call_tool()` method - Return informative error message listing available tools when model calls non-existent tool - Prevents KeyError and provides...

## Summary ✅ **Verified on Google Colab** - Add `max_colocate_count` field to `ResourcePoolManager` dataclass - Add Ray version check using `packaging` library (requires >= 2.39.0 for max_colocate_count > 1) -...

## 🐛 Problem When running Online DPO or SPIN training, the program crashes during initialization with: ```python RuntimeError: no running event loop at verl/workers/rollout/vllm_rollout/vllm_rollout_spmd.py:575 ``` ### Root Cause The `vLLMAsyncRollout.__init__()`...

## 🐛 Problem When `free_cache_engine=False`, weight synchronization between actor and rollout is completely skipped, causing: - Rollout model weights never update after first epoch - Extreme off-policy training (rollout uses...

Fixes #4159 - Changed response_position_ids padding from left to right - Ensures alignment with response_ids for variable-length sequences - Critical for 2D position_ids in multimodal models (e.g., Qwen2-VL) - Added...

## Summary Fixes #4162 This PR ensures the `rollout_actor_probs_pearson_corr` metric accurately reflects the correlation between rollout and actor probabilities by computing log probabilities from unscaled logits for metrics calculation. ##...

## Summary Fixes #4229 This PR optimizes GPU memory usage in async rollout mode by skipping unnecessary FSDP model loading. **Memory Savings**: ~50% for rollout workers (e.g., 14GB vs 22GB...