DeepSpeed-MII icon indicating copy to clipboard operation
DeepSpeed-MII copied to clipboard

Add support for HuggingFace GPT-NeoX implementation

Open mrwyattii opened this issue 3 years ago • 0 comments

I'm running into a CUDA OOM error when loading this model due to the large size and lack of support for multi-GPU in HF pipeline.

mrwyattii avatar May 27 '22 17:05 mrwyattii