FastChat
FastChat copied to clipboard
Plans to support Gemma 3?
Are there any plans or is anyone working on integrating support for the Gemma 3 models within Fastchat? Natively loading a Gemma 3 model doesn't directly work and opens up loading as well as inference issues.
Perhaps it would make sense to publishing’s bug on SGLang, as the model_worker is not really where you want to run these models?