bug: Model downloaded from Hugging Face can only run on CPU because it doesn't retrieve the correct GGUF model's metadata #3708

milen-prg · 2024-09-20T07:49:18Z

Jan version

Jan v0.5.4

Describe the Bug

The installed models via app hub, works on GPU as was, but in this version the user installed models use only CPU. In the older version the user installed models also used GPU.
Each new version ruins using of the user installed models 😭🤬

Steps to Reproduce

Install manually models from hugging face and from Jan hub.
Run one, then the other.
In this version the user installed not uses GPU.

Screenshots / Logs

No response

What is your OS?

MacOS
Windows
Linux

josepgl · 2024-09-21T10:27:03Z

In my case, on Linux, trying to use the GPU I see on the logs:

ERROR Could not load engine: Could not load library "/home/jose/.config/Jan/data/extensions/@janhq/inference-cortex-extension/dist/bin/linux-cuda-12-0/engines/cortex.llamacpp/libengine.so"
libcudart.so.12: cannot open shared object file: No such file or directory - server.cc:299

@milen-prg On what previous version it worked for you?

milen-prg · 2024-09-21T13:06:52Z

@josepgl , on v0.5.3 worked (there was another problem, the models was disappeared, but here easy helped me then).
On Windows where to see the logs?

josepgl · 2024-09-21T13:19:22Z

@milen-prg in Settings > Advanced Settings > Jan Data Folder is the data path, I see a log folder there in Linux.

milen-prg · 2024-09-21T15:06:34Z

There is logs folder, but it is empty.

imtuyethan · 2024-09-23T09:25:46Z

There is logs folder, but it is empty.

We only store your logs for 24h

imtuyethan · 2024-09-23T09:34:09Z

This is a known issue. The Hugging Face model download from the Search Box in model hub is pretty broken for now, and it doesn't retrieve the correct GGUF model's metadata. We've filed an issue on this and are working on the fix from the engine: #3558

In the meantime, please help us add an ngl setting to the settings section for now to enable GPU acceleration. It worked fine before because the previous versions hardcoded an ngl setting, which is hacky and not correct for all models.

milen-prg · 2024-10-02T17:22:18Z

The "bug: Model downloaded from Hugging Face can only run on CPU"
is not solved in:
v0.5.5

louis-jan · 2024-10-02T17:31:11Z

The "bug: Model downloaded from Hugging Face can only run on CPU"

is not solved in:

v0.5.5

Hi @milen-prg, the fix would work only with new downloaded models, in case you have HF models downloaded before but do not have ngl settings, please help redownload.

Also could you please share your scenario, screenshots. Thanks

louis-jan · 2024-10-02T17:33:42Z

Also, it looks like the issue is not about model GGUF. Could you please share your specs, and log file?

milen-prg · 2024-10-03T08:51:29Z

The logs folder is permanently empty.
If I try to import the gguf file, the model.json doesn't generates automatically (as was in the older versions), so the model is not recognized, it is not in list and can't be selected. When I put the old model.json, the model is in the list, but works only with CPU, not with GPU.
I see, your app with time targets to work only with self suggested models, which is enormous problem. I'll search for something more liberal. Thank, you.

louis-jan · 2024-10-03T08:53:53Z

Hi @milen-prg, it's likely a bug, as it's not targeting self-suggested models but also HuggingFace models.

milen-prg · 2024-10-04T11:16:24Z

In v.0.5.6 if reimport the models, they again works with GPU.

Several times after the model reply, the GPU loading keeps 100% and it not stops at trying the stop button for the reply, or delete the conversation entire thread, must close the Jan to not to overheat the GPU.
Will investigate this new problem, it seems appears randomly, but frequently with the imported models.

milen-prg added the type: bug Something isn't working label Sep 20, 2024

imtuyethan changed the title ~~bug: [DESCRIPTION] User installed models run on CPU~~ bug: User installed models run on CPU Sep 21, 2024

imtuyethan self-assigned this Sep 21, 2024

imtuyethan added the needs info Not enough info, more logs/data required label Sep 21, 2024

imtuyethan assigned louis-jan and unassigned imtuyethan Sep 23, 2024

imtuyethan added the category: model running label Sep 23, 2024

imtuyethan changed the title ~~bug: User installed models run on CPU~~ bug: Model downloaded from Hugging Face can only run on CPU because it doesn't retrieve the correct GGUF model's metadata Sep 23, 2024

imtuyethan added this to the v0.5.5 milestone Sep 23, 2024

louis-jan mentioned this issue Sep 23, 2024

fix: #3558 wrong model metadata import or download from HuggingFace #3725

Merged

louis-jan closed this as completed in #3725 Sep 24, 2024

This was referenced Oct 3, 2024

fix: error handling for model imports should be handled gracefully #3763

Merged

hotfix: graceful error handling model import #3766

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: Model downloaded from Hugging Face can only run on CPU because it doesn't retrieve the correct GGUF model's metadata #3708

bug: Model downloaded from Hugging Face can only run on CPU because it doesn't retrieve the correct GGUF model's metadata #3708

milen-prg commented Sep 20, 2024

josepgl commented Sep 21, 2024

milen-prg commented Sep 21, 2024

josepgl commented Sep 21, 2024

milen-prg commented Sep 21, 2024

imtuyethan commented Sep 23, 2024

imtuyethan commented Sep 23, 2024 •

edited

Loading

milen-prg commented Oct 2, 2024

louis-jan commented Oct 2, 2024 •

edited

Loading

louis-jan commented Oct 2, 2024

milen-prg commented Oct 3, 2024

louis-jan commented Oct 3, 2024

milen-prg commented Oct 4, 2024

bug: Model downloaded from Hugging Face can only run on CPU because it doesn't retrieve the correct GGUF model's metadata #3708

bug: Model downloaded from Hugging Face can only run on CPU because it doesn't retrieve the correct GGUF model's metadata #3708

Comments

milen-prg commented Sep 20, 2024

Jan version

Describe the Bug

Steps to Reproduce

Screenshots / Logs

What is your OS?

josepgl commented Sep 21, 2024

milen-prg commented Sep 21, 2024

josepgl commented Sep 21, 2024

milen-prg commented Sep 21, 2024

imtuyethan commented Sep 23, 2024

imtuyethan commented Sep 23, 2024 • edited Loading

milen-prg commented Oct 2, 2024

louis-jan commented Oct 2, 2024 • edited Loading

louis-jan commented Oct 2, 2024

milen-prg commented Oct 3, 2024

louis-jan commented Oct 3, 2024

milen-prg commented Oct 4, 2024

imtuyethan commented Sep 23, 2024 •

edited

Loading

louis-jan commented Oct 2, 2024 •

edited

Loading