DeepSpeed
DeepSpeed copied to clipboard
[REQUEST] Examples for tensor parallelism and pipeline parallelism
I noticed there are some settings about tensor parallelism in DeepSpeedEngine and PipielineEngine. Can you please provide us with some examples of combinig tensor parallelism with pipeline parallelism? I would be appreciate if there is a clear example to follow.
+1. I am also confusing how to train a model with both tensor parallelism and pipeline parallelism in deepspeed.
Great question. It would be even better if it could be combined with DDP.