weimakeit

Results 2 comments of weimakeit

> Setting overlap_comm to False can avoid this problem. This works in my multi-node training scenario.

same issue with vllm == 0.5.4 with APC enabled ![image](https://github.com/user-attachments/assets/52c42e42-8d58-4e25-9b6a-091125c3661f)