Google Gemma #214

surak · 2024-02-21T15:52:42Z

Google released their new open model, Gemma. https://huggingface.co/google/gemma-7b-it

But we can't use it on fastchat with sglang yet:

File "/p/haicluster/llama/FastChat/sc_venv_sglang/sglang/python/sglang/srt/managers/router/model_runner.py", line 49, in get_model_cls_by_arch_name
2024-02-21 15:21:34 | INFO | stdout |     raise ValueError(
2024-02-21 15:21:34 | INFO | stdout | ValueError: Unsupported architectures: GemmaForCausalLM. Supported list: ['QWenLMHeadModel', 'Qwen2ForCausalLM', 'LlavaLlamaForCausalLM', 'MixtralForCausalLM', 'MistralForCausalLM', 'YiVLForCausalLM', 'LlamaForCausalLM']

The text was updated successfully, but these errors were encountered:

nopepper · 2024-02-21T17:45:52Z

👍 I'd also be interested in this. I don't know how hard it would be to add support, though.

Arcmoon-Hu · 2024-02-22T03:30:02Z

Apparently sglang doesn't support this model yet. May be you can create a branch support it.

hnyls2002 mentioned this issue Mar 4, 2024

Gemma Support #256

Merged

hnyls2002 linked a pull request Mar 4, 2024 that will close this issue

Gemma Support #256

Merged

hnyls2002 closed this as completed in #256 Mar 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Google Gemma #214

Google Gemma #214

surak commented Feb 21, 2024

nopepper commented Feb 21, 2024

Arcmoon-Hu commented Feb 22, 2024

Google Gemma #214

Google Gemma #214

Comments

surak commented Feb 21, 2024

nopepper commented Feb 21, 2024

Arcmoon-Hu commented Feb 22, 2024