weimakeit
Results
2
comments of
weimakeit
> Setting overlap_comm to False can avoid this problem. This works in my multi-node training scenario.
same issue with vllm == 0.5.4 with APC enabled 