FlexGen
FlexGen copied to clipboard
Suggestion: Add GPT-NeoX 20B support
JAX is already a library that is optimized for GPU training, and the NeoX repo itself already requires significant GPU resources that could benefit from offloading.