Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: add dolly/redpajama/bloomz models support #214

Merged
merged 5 commits into from May 10, 2023
Merged

feat: add dolly/redpajama/bloomz models support #214

merged 5 commits into from May 10, 2023

Conversation

mudler
Copy link
Owner

@mudler mudler commented May 9, 2023

This adds experimental supports for dolly models and red pajamas, however I didn't had still have success in running any of the models I've found:

main: seed = 1683660204
dollyv2_model_load: loading model from '/home/mudler/_git/LocalAI/models/ggml-dolly-q5_0.bin' - please wait ...
dollyv2_model_load: n_vocab = 50280
dollyv2_model_load: n_ctx   = 2048
dollyv2_model_load: n_embd  = 4096
dollyv2_model_load: n_head  = 32
dollyv2_model_load: n_layer = 32
dollyv2_model_load: n_rot   = 32
dollyv2_model_load: ftype   = 8
dollyv2_model_load: ggml ctx size = 8596.22 MB
dollyv2_model_load: memory_size =  1024.00 MB, n_mem = 65536
dollyv2_model_load: unknown tensor 'gpt_neox.embed_in.weight' in model file
main: failed to load model from '/home/mudler/_git/LocalAI/models/ggml-dolly-q5_0.bin'

However, I get the same error while running the main.cpp in ggml too.

Bloomz (https://github.com/NouamaneTazi/bloomz.cpp) seems to work, but no good model found so far.

Makefile Outdated Show resolved Hide resolved
@mudler mudler changed the title feat: add dolly models support feat: add dolly/redpajama/bloomz models support May 10, 2023
@mudler
Copy link
Owner Author

mudler commented May 10, 2023

Next on my list is to put gpt4all-j on par with the bindings

@mudler mudler merged commit 1167593 into master May 10, 2023
3 checks passed
@mudler mudler deleted the dolly branch May 10, 2023 23:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant