Support for QWEN and Baichuan2 models #1731

sorasoras · 2023-12-08T18:17:48Z

Feature request

recently, https://github.com/ggerganov/llama.cpp has add support for both QWEN and Baichuan2.
It has added QWEN at 1610.
ggerganov/llama.cpp#4281
I have look up the Nomic Vulkan Fork of LLaMa.cpp,
it does have support for Baichuan2 but not QWEN, but GPT4ALL itself does not support Baichuan2.

Motivation

I failed to load baichuan2 and QWEN models, GPT4ALL supposed to be easy to use

Your contribution

Not quite as i am not a programmer but i would look up if that helps

cebtenzzre · 2023-12-10T17:25:38Z

Baichuan2 should be supported in the next release - the current release isn't using the latest version of our llama.cpp fork.

Qwen got merged upstream a little too late, but it should be supported here soon.

AdkinsHan · 2024-02-18T09:41:49Z

There is a problem. When I use Qwen, no matter the graphics card or CPU specified by my device, it has been calling the CPU and memory for calculation instead of calling the GPU.
mode_name="qwen1_5-14b-chat-q8_0.gguf"
gpt4all version 2.2.1.post1
https://github.com/QwenLM/Qwen

When I use it, I see that the memory and CPU are being called, but the video memory usage has not changed.

AdkinsHan · 2024-02-18T09:50:14Z

When I use other GGUF models I can see the following output
llama.cpp: using Vulkan on NVIDIA GeForce RTX 3070

cebtenzzre · 2024-02-22T20:28:04Z

As of #2005, Qwen, Qwen2, and Baichuan all have GPU support enabled on both the Metal and Vulkan backends.

cebtenzzre · 2024-03-06T17:48:29Z

This is implemented in the v2.7.2 release.

cebtenzzre added backend gpt4all-backend issues models labels Dec 10, 2023

cebtenzzre added the awaiting-release issue is awaiting next release label Feb 22, 2024

cebtenzzre closed this as completed Mar 6, 2024

cebtenzzre removed the awaiting-release issue is awaiting next release label Mar 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for QWEN and Baichuan2 models #1731

Support for QWEN and Baichuan2 models #1731

sorasoras commented Dec 8, 2023

cebtenzzre commented Dec 10, 2023

AdkinsHan commented Feb 18, 2024

AdkinsHan commented Feb 18, 2024

cebtenzzre commented Feb 22, 2024 •

edited

Loading

cebtenzzre commented Mar 6, 2024

Support for QWEN and Baichuan2 models #1731

Support for QWEN and Baichuan2 models #1731

Comments

sorasoras commented Dec 8, 2023

Feature request

Motivation

Your contribution

cebtenzzre commented Dec 10, 2023

AdkinsHan commented Feb 18, 2024

AdkinsHan commented Feb 18, 2024

cebtenzzre commented Feb 22, 2024 • edited Loading

cebtenzzre commented Mar 6, 2024

cebtenzzre commented Feb 22, 2024 •

edited

Loading