chat: fix blank device in UI after model switch and improve Mixpanel reporting #2409

cebtenzzre · 2024-06-04T20:00:20Z

Functional Changes

When you switch chats in the UI and they both use the same model, the device and fallback reason reported in the lower-right corner of the chat no longer become blank.
In Mixpanel, actualDevice was blank about 7% of the time because of the above issue. This is now fixed.
In Mixpanel, v2.8.0 reported devices as e.g. actualDevice="GTX 970 (CUDA)" and actualDevice="Tesla P40 (Vulkan)". Now they are reported as actualDevice="GTX 970", device_backend="CUDA" and actualDevice="Tesla P40", device_backend="Vulkan".
default_device on Mixpanel still shows plain device names in v2.8.0. default_device_backend is added to supplement this (should always be "Vulkan" in official releases).

Other Changes

Remove LLModel::hasGPUDevice and llmodel_has_gpu_device. In current GPT4All, it is always true if the last call to initializeGPUDevice returned true. There are no users of this function. (cc @jacoobes @iimez - the ts bindings still reference this, likely incorrectly)
Replace the llama.cpp hack that sets n_gpu_layers to zero when Kompute falls back to CPU with a more reasonable API that introduces llama_model_using_gpu.

This function has never been used and is equivalent to whether the last call to initializeGPUDevice() returned true. Signed-off-by: Jared Van Bortel <jared@nomic.ai>

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

This fixes the issue where reusing a model for a new chat would cause the device name and fallback reason to be lost - both in the UI, and on Mixpanel. Also, separate the device name from the backend (now at the "deviceBackend" prop) on Mixpanel. Signed-off-by: Jared Van Bortel <jared@nomic.ai>

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

cebtenzzre · 2024-06-04T20:01:08Z

The merge conflicts will be resolved after #2408 is merged.

Signed-off-by: limez <limez@protonmail.com>

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

QVariant() is `undefined` in QML, while QString() is `null`. Signed-off-by: Jared Van Bortel <jared@nomic.ai>

When we do CPU fallback and request CUDA with ngl=0, it still uses the GPU for matrix multipliciation. It shouldn't clear the GPU device on failure, as it is still relevant when we fall back. We also shouldn't allow loadModel with an unset device. This fixes a case where gpuDeviceName() would return "", because usingGPUDevice() reported true while we had no known device name. Signed-off-by: Jared Van Bortel <jared@nomic.ai>

cebtenzzre added 5 commits June 4, 2024 12:30

llmodel: remove hasGPUDevice

ab17272

This function has never been used and is equivalent to whether the last call to initializeGPUDevice() returned true. Signed-off-by: Jared Van Bortel <jared@nomic.ai>

llamamodel: simplify current GPU device logic

bbbf330

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

mixpanel: s/deviceBackend/device_backend/

a8e68eb

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

mixpanel: report default_device_backend (typically Vulkan)

18e9339

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

cebtenzzre requested a review from manyoso June 4, 2024 20:00

iimez added a commit to iimez/gpt4all that referenced this pull request Jun 4, 2024

remove hasGpuDevice pr nomic-ai#2409

d107f52

Signed-off-by: limez <limez@protonmail.com>

Merge branch 'main' into fix-cur-gpu-device

b0c83c3

manyoso approved these changes Jun 26, 2024

View reviewed changes

cebtenzzre added 3 commits June 26, 2024 14:50

style: fix function braces

96e21dc

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

chat: fix null vs undefined issue by using QString() for null

904e23f

QVariant() is `undefined` in QML, while QString() is `null`. Signed-off-by: Jared Van Bortel <jared@nomic.ai>

cebtenzzre merged commit 01870b4 into main Jun 26, 2024
6 of 18 checks passed

cebtenzzre deleted the fix-cur-gpu-device branch June 26, 2024 19:26

ellipsis-dev bot mentioned this pull request Jul 2, 2024

release.json: update release notes for v3.0.0 #2514

Merged

cebtenzzre mentioned this pull request Jul 10, 2024

llama.cpp: update submodule for CPU fallback fix #2640

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chat: fix blank device in UI after model switch and improve Mixpanel reporting #2409

chat: fix blank device in UI after model switch and improve Mixpanel reporting #2409

cebtenzzre commented Jun 4, 2024

cebtenzzre commented Jun 4, 2024

chat: fix blank device in UI after model switch and improve Mixpanel reporting #2409

chat: fix blank device in UI after model switch and improve Mixpanel reporting #2409

Conversation

cebtenzzre commented Jun 4, 2024

Functional Changes

Other Changes

cebtenzzre commented Jun 4, 2024