FasterTransformer icon indicating copy to clipboard operation
FasterTransformer copied to clipboard

Supporting for expert parallelism in MoE inference

Open iteratorlee opened this issue 2 years ago • 0 comments

#743 also mentions this issue. So is there a guiding tutorial about how to use expert parallelism in MoE inference?

iteratorlee avatar Oct 19 '23 08:10 iteratorlee