llamamodel: add 12 new architectures for CPU (and Metal) inference #1914

cebtenzzre · 2024-02-02T20:58:03Z

This PR enables support for the following model architectures:

Baichuan
BLOOM
CodeShell
GPT-2
Orion
Persimmon
Phi and Phi-2
Plamo
Qwen
Qwen2
Refact
StableLM

These are only in the CPU/Metal inference whitelist until we know that any of them are supported on Vulkan.

- Baichuan - BLOOM - CodeShell - GPT-2 - Orion - Persimmon - Phi and Phi-2 - Plamo - Qwen - Qwen2 - Refact - StableLM Signed-off-by: Jared Van Bortel <jared@nomic.ai>

Baichuan, BLOOM, CodeShell, GPT-2, Orion, Persimmon, Phi and Phi-2, Plamo, Qwen, Qwen2, Refact, StableLM Signed-off-by: Jared Van Bortel <jared@nomic.ai>

llamamodel: add 12 new architectures for CPU inference

79ccdbe

- Baichuan - BLOOM - CodeShell - GPT-2 - Orion - Persimmon - Phi and Phi-2 - Plamo - Qwen - Qwen2 - Refact - StableLM Signed-off-by: Jared Van Bortel <jared@nomic.ai>

cebtenzzre requested a review from manyoso February 2, 2024 20:58

cebtenzzre mentioned this pull request Feb 2, 2024

Add support for Mixtral 8x7B #1747

Open

apage43 approved these changes Feb 5, 2024

View reviewed changes

cebtenzzre merged commit 92c025a into main Feb 5, 2024
6 of 10 checks passed

cebtenzzre deleted the add-new-arches branch February 5, 2024 21:49

cebtenzzre mentioned this pull request Feb 7, 2024

chat: set version to 2.7.0 #1940

Merged

cebtenzzre changed the title ~~llamamodel: add 12 new architectures for CPU inference~~ llamamodel: add 12 new architectures for CPU (and Metal) inference Feb 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llamamodel: add 12 new architectures for CPU (and Metal) inference #1914

llamamodel: add 12 new architectures for CPU (and Metal) inference #1914

cebtenzzre commented Feb 2, 2024 •

edited

llamamodel: add 12 new architectures for CPU (and Metal) inference #1914

llamamodel: add 12 new architectures for CPU (and Metal) inference #1914

Conversation

cebtenzzre commented Feb 2, 2024 • edited

cebtenzzre commented Feb 2, 2024 •

edited