-
Notifications
You must be signed in to change notification settings - Fork 659
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Assertion Error #68
Comments
same issues |
Yes, just tried this - same here -70b model as well. |
Model works in llama.cpp so perhaps this is a new config in the model issue? |
Seems to be a problem with the latest version of llama.cpp from what I can gather in https://huggingface.co/TheBloke/Llama-2-13B-chat-GGML/discussions/14 I got it working locally (for now until the core problem is resolved) by following the suggestion to go back to an older version, so I replaced the image version with an older one. For example
to
This one was about 9 days ago (there were newer ones, but most were about a day old, so figured they might all share the same problem). So you could always try a later one than that. You can see all of them here: https://github.com/abetlen/llama-cpp-python/pkgs/container/llama-cpp-python/versions?filters%5Bversion_type%5D=untagged&page=1 |
Sorry for the confusion, folks! This was resolved as a part of #71 which added support for Code Llama models. It should be fixed in the master branch now. You were correct in your assessment @THeivers. You can retry with: git pull origin master
./run.sh --model 7b # or run this if you are on an M1/M2 mac: ./run-mac.sh --model 7b Replace |
Thanks - llama.cpp is now GGUF by default and not GGML as I understand getting this working yesterday? |
When i try to run the "docker compose up" command it downloads the model and then throws an AssertionError. I have tried deleting the model manually multiple times, but it still doesnt seem to work.
The text was updated successfully, but these errors were encountered: