llamamodel: add gemma model support #1992

cebtenzzre · 2024-02-21T19:15:40Z

Add Gemma support. Using a Q4_0 quant of this model: https://huggingface.co/google/gemma-7b-it

Also whitelist Kompute for Gemma, Phi and Phi-2, Qwen2, and StableLM, as I have tested Q4_0 quants of all of them without issue.

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

cebtenzzre · 2024-02-21T19:30:22Z

The output is poor as it is taking "start_of_turn" literally and talking about board games. So this is blocked on #1970

cebtenzzre added 2 commits February 21, 2024 13:48

llamamodel: add gemma model support

5608b93

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

models2.json: add gemma model

f8f92cb

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

cebtenzzre requested a review from manyoso February 21, 2024 19:15

gemma: fix default prompt template

7b8d3f7

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

manyoso approved these changes Feb 21, 2024

View reviewed changes

manyoso merged commit 4a8c6d7 into main Feb 21, 2024
6 of 17 checks passed

cebtenzzre mentioned this pull request Feb 21, 2024

[Feature] Add support for Gemma LLMs #1988

Closed

Provide feedback