Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

IBM Granite #1502

Closed
eltonjohnfanboy opened this issue Jun 1, 2024 · 2 comments
Closed

IBM Granite #1502

eltonjohnfanboy opened this issue Jun 1, 2024 · 2 comments

Comments

@eltonjohnfanboy
Copy link

Hi!

I am writing to inquire about the future support plans for the Granite 3B and 8B models in the llama-cpp-python library. While attempting to load the small GGUF models for these Granite models using llama-cpp-python, I encountered the following error:
error loading model: done_getting_tensors: wrong number of tensors; expected 578, got 470
I suspect we get this issue because the small Granite models (3B and 8B) are not yet supported by this library. Are there any information on any plans to support these models in the future?

Thanks! :))

@abetlen
Copy link
Owner

abetlen commented Jun 4, 2024

@eltonjohnfanboy should be in the newest release (0.2.77), let me know if you have any issues.

@eltonjohnfanboy
Copy link
Author

@abetlen Great, it's working properly! Thanks a lot:))

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants