DeepSpeed-MII
DeepSpeed-MII copied to clipboard
Support for Codellama Model in deepspeed-fastgen
It seems that the current implementation does not include compatibility for Codellama. I would like to propose the inclusion of CodeLlama support in deepspeed-fastgen
Hi @LinKiling it looks like we are able to load the Codellama models and run generation, but the output seems a bit off. I'll ask our kernel devs to take a look. Thanks!
Hi @LinKiling it looks like we are able to load the Codellama models and run generation, but the output seems a bit off. I'll ask our kernel devs to take a look. Thanks!
any updates on this? i also get wrong output on codellama