chongxing
Results
1
comments of
chongxing
Is there any plan to support two-batch overlap? or any other solutions, to hide moe communication time, especially for hardware platform with only rdma-based node-to-node communications.