distributed-llama icon indicating copy to clipboard operation
distributed-llama copied to clipboard

Support for Gemma 2?

Open sdmorrey opened this issue 1 year ago • 2 comments

What would be required to support Gemma 2? I'd be happy to chip in and help with the code, I just need to have a bit of insight into what would need to be changed?

sdmorrey avatar Aug 04 '24 23:08 sdmorrey

Hello @sdmorrey,

you should check llama2-tasks.cpp and grok1-tasks.cpp files. For different architectures DL builds a different task list. Tasks are reused of course (in grok1-tasks.cpp you can see the implementation of different tasks than Llama model uses).

I see Gemma 2 has more norm layers. Rope layer it seems it's already implemented (FalconRopeCommand). Probably the tokenizer is something that may need more work (converter), but I'm not sure.

b4rtaz avatar Aug 05 '24 21:08 b4rtaz

+1 for Gemma 2

unclemusclez avatar Aug 25 '24 21:08 unclemusclez