DeepSpeed
DeepSpeed copied to clipboard
Skip autoTP if tp_size is 1
Skip auto TP if no tensor parallelism is needed / using only 1 GPU.
https://github.com/microsoft/DeepSpeed/issues/3285