DeepSpeed
DeepSpeed copied to clipboard
model parallel norm and overflow tests
Can one of the admins verify this patch?