IBM Granite #1502

eltonjohnfanboy · 2024-06-01T17:05:43Z

Hi!

I am writing to inquire about the future support plans for the Granite 3B and 8B models in the llama-cpp-python library. While attempting to load the small GGUF models for these Granite models using llama-cpp-python, I encountered the following error:
error loading model: done_getting_tensors: wrong number of tensors; expected 578, got 470
I suspect we get this issue because the small Granite models (3B and 8B) are not yet supported by this library. Are there any information on any plans to support these models in the future?

Thanks! :))

The text was updated successfully, but these errors were encountered:

abetlen · 2024-06-04T14:00:56Z

@eltonjohnfanboy should be in the newest release (0.2.77), let me know if you have any issues.

eltonjohnfanboy · 2024-06-06T04:56:04Z

@abetlen Great, it's working properly! Thanks a lot:))

abetlen closed this as completed Jun 4, 2024

tombenninger mentioned this issue Jun 4, 2024

Include Granite code-instruct models in catalog. containers/podman-desktop-extension-ai-lab#1157

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IBM Granite #1502

IBM Granite #1502

eltonjohnfanboy commented Jun 1, 2024

abetlen commented Jun 4, 2024

eltonjohnfanboy commented Jun 6, 2024

IBM Granite #1502

IBM Granite #1502

Comments

eltonjohnfanboy commented Jun 1, 2024

abetlen commented Jun 4, 2024

eltonjohnfanboy commented Jun 6, 2024