Add Gemma model #2964

xiangxu-google · 2024-02-21T17:06:33Z

https://blog.google/technology/developers/gemma-open-models/

The PR contains the Gemma model implementation which is able to load checkpoints from HF:

google/gemma-2b
google/gemma-2b-it
google/gemma-7b
google/gemma-7b-it

WoosukKwon

LGTM! Thanks for submitting the PR!

This version is for more model support. Add support for Gemma models (#2964) and OLMo models (#2832).

This version is for more model support. Add support for Gemma models (vllm-project#2964) and OLMo models (vllm-project#2832).

MrRace · 2024-03-06T10:12:59Z

@xiangxu-google
After deploying Gemma using the VLLM deployment in the OpenAI API format, it seems that inputs from the Chat API are not supported.

The deployment command used is:

python3 -m vllm.entrypoints.openai.api_server --served-model-name gemma-2b-it --model /data/share_model_zoo/LLM/google/gemma-2b-it

The test request used is:

curl http://localhost:8000/v1/chat/completions \
    -H "Content-Type: application/json" \
    -d '{
        "model": "gemma-2b-it",
        "messages": [
            {"role": "system", "content": "You are a helpful assistant."},
            {"role": "user", "content": "Hello"}
        ]
    }'

The response returned is as follows:

{"object":"error","message":"System role not supported","type":"BadRequestError","param":null,"code":400}

xiangxu-google · 2024-03-06T17:42:22Z

Is this error Gemma specific? This PR added the modeling code only, which should not affect the API server.

simon-mo · 2024-03-08T06:01:42Z

The chat template doesn't support system role https://huggingface.co/google/gemma-2b-it/blob/718cb189da9c5b2e55abe86f2eeffee9b4ae0dad/tokenizer_config.json#L59

Add Gemma model

e432db1

xiangxu-google mentioned this pull request Feb 21, 2024

Add support for Gemma models #2960

Closed

WoosukKwon self-requested a review February 21, 2024 17:28

WoosukKwon approved these changes Feb 21, 2024

View reviewed changes

WoosukKwon merged commit 5253eda into vllm-project:main Feb 21, 2024
19 of 21 checks passed

zhuohan123 added a commit that referenced this pull request Feb 21, 2024

Bump up version to v0.3.2

9e38ef6

This version is for more model support. Add support for Gemma models (#2964) and OLMo models (#2832).

zhuohan123 mentioned this pull request Feb 21, 2024

Bump up version to v0.3.2 #2968

Merged

simon-mo pushed a commit that referenced this pull request Feb 21, 2024

Bump up version to v0.3.2 (#2968)

8fbd84b

This version is for more model support. Add support for Gemma models (#2964) and OLMo models (#2832).

zhyncs mentioned this pull request Feb 22, 2024

[Feature] support google gemma InternLM/lmdeploy#1178

Closed

xjpang pushed a commit to xjpang/vllm that referenced this pull request Feb 22, 2024

Add Gemma model (vllm-project#2964)

2c880b5

xjpang pushed a commit to xjpang/vllm that referenced this pull request Feb 22, 2024

Bump up version to v0.3.2 (vllm-project#2968)

70d19ef

This version is for more model support. Add support for Gemma models (vllm-project#2964) and OLMo models (vllm-project#2832).

andy-neuma mentioned this pull request Feb 23, 2024

andy/bump main to v0.3.2 neuralmagic/nm-vllm#49

Closed

xjpang pushed a commit to xjpang/vllm that referenced this pull request Mar 4, 2024

Add Gemma model (vllm-project#2964)

08cc7c2

xjpang pushed a commit to xjpang/vllm that referenced this pull request Mar 4, 2024

Bump up version to v0.3.2 (vllm-project#2968)

1176f2f

This version is for more model support. Add support for Gemma models (vllm-project#2964) and OLMo models (vllm-project#2832).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Gemma model #2964

Add Gemma model #2964

xiangxu-google commented Feb 21, 2024 •

edited

Loading

WoosukKwon left a comment

MrRace commented Mar 6, 2024

xiangxu-google commented Mar 6, 2024

simon-mo commented Mar 8, 2024

Add Gemma model #2964

Add Gemma model #2964

Conversation

xiangxu-google commented Feb 21, 2024 • edited Loading

WoosukKwon left a comment

Choose a reason for hiding this comment

MrRace commented Mar 6, 2024

xiangxu-google commented Mar 6, 2024

simon-mo commented Mar 8, 2024

xiangxu-google commented Feb 21, 2024 •

edited

Loading