Add support for HuggingFace GPT-NeoX implementation

Open mrwyattii opened this issue 3 years ago • 0 comments

I'm running into a CUDA OOM error when loading this model due to the large size and lack of support for multi-GPU in HF pipeline.

May 27 '22 17:05 mrwyattii