chongxing

Results 1 comments of chongxing

Is there any plan to support two-batch overlap? or any other solutions, to hide moe communication time, especially for hardware platform with only rdma-based node-to-node communications.