llama_bootstrap: failed to load model from '/model.bin' #9

nightvision04 · 2023-03-31T04:17:19Z

Look's like the latest is failing. Perhaps a broken path?

docker run -p 8080:8080 -ti --rm quay.io/go-skynet/llama-cli:v0.4 api --context-size 700 --threads 4
Unable to find image 'quay.io/go-skynet/llama-cli:v0.4' locally
v0.4: Pulling from go-skynet/llama-cli
8022b074731d: Already exists 
7971239fe1d6: Already exists 
26c861b53509: Already exists 
1714880ecc1c: Already exists 
c71e83b44ada: Already exists 
e4448c041760: Already exists 
736f744dca4b: Already exists 
7517d65a7897: Pull complete 
0afdf5bf81eb: Pull complete 
c7aef89193c7: Pull complete 
ea356902fa2d: Pull complete 
8865ead58fd1: Pull complete 
487435084471: Pull complete 
Digest: sha256:b4a2556985d4496a1db89db50688fd3f15ffc21e76cce6b713fc4feefabd9268
Status: Downloaded newer image for quay.io/go-skynet/llama-cli:v0.4
llama_model_load: failed to open '/model.bin'
llama_bootstrap: failed to load model from '/model.bin'
Loading the model failed: failed loading model

The text was updated successfully, but these errors were encountered:

mudler · 2023-04-04T22:35:51Z

models are not bundled in the image due to licensing - models like gpt4all, alpaca, and vicuna are based on LLaMA from Facebook, which prohibits modifications, alterations, and re-distributions of the weights in every form. See for instance nomic-ai/gpt4all#75.

Sadly, until there is a model with a free license that allows re-distribution, we can't embed it in the image, or we risk yet another DCMA takedown. You need to get the model somehow, and specify it as described in https://github.com/go-skynet/llama-cli#using-other-models

regstuff · 2023-04-05T16:10:47Z

models are not bundled in the image due to licensing - models like gpt4all, alpaca, and vicuna are based on LLaMA from Facebook, which prohibits modifications, alterations, and re-distributions of the weights in every form. See for instance nomic-ai/gpt4all#75.

Sadly, until there is a model with a free license that allows re-distribution, we can't embed it in the image, or we risk yet another DCMA takedown. You need to get the model somehow, and specify it as described in https://github.com/go-skynet/llama-cli#using-other-models

I get this error despite mounting. Here's my command: sudo docker run -v ~/llama_models/gpt4-x-alpaca-13b-native-4bit-128g/gpt4-x-alpaca-13b-ggml-q4_1-from-gptq-4bit-128g/:/models -p 8080:14004 -ti --rm quay.io/go-skynet/llama-cli:v0.4 api --context-size 700 --threads 12 --alpaca true --model /models/model.bin
I have a model.bin file inside the ~/llama_models/gpt4-x-alpaca-13b-native-4bit-128g/gpt4-x-alpaca-13b-ggml-q4_1-from-gptq-4bit-128g folder

mudler · 2023-04-05T19:59:53Z

Can you try by using the MODEL_PATH env var instead?

sudo docker run -e MODEL_PATH=/models/model.bin -v ~/llama_models/gpt4-x-alpaca-13b-native-4bit-128g/gpt4-x-alpaca-13b-ggml-q4_1-from-gptq-4bit-128g/:/models -p 8080:14004 -ti --rm quay.io/go-skynet/llama-cli:v0.4 api --context-size 700 --threads 12 --alpaca true

Just noticed this is being set on the main container image, a fix is landing in master! (bf85a31)

mudler · 2023-04-05T20:14:46Z

@regstuff now the master image is fixed, you can also try with the same command but using quay.io/go-skynet/llama-cli:latest instead

jonit-dev · 2023-04-07T05:48:38Z

@mudler Its not working for me either....

➜  llama-cli docker run -ti --rm quay.io/go-skynet/llama-cli:v0.4  --instruction "What's an alpaca?" --topk 10000 --gpt4all=true --model ./models/ggml-alpaca-7b-q4.bin
llama_model_load: failed to open './models/ggml-alpaca-7b-q4.bin'
llama_bootstrap: failed to load model from './models/ggml-alpaca-7b-q4.bin'
Loading the model failed: failed loading model
➜  llama-cli 
➜  llama-cli docker run -ti --rm quay.io/go-skynet/llama-cli:v0.4  --instruction "What's an alpaca?" --topk  --model ./models/ggml-alpaca-7b-q4.bin         
➜  llama-cli docker run -ti --rm quay.io/go-skynet/llama-cli:v0.4  --instruction "What's an alpaca?" --topk 10000 --alpaca true --model ./models/ggml-alpaca-7b-q4.bin
llama_model_load: failed to open '/model.bin'
llama_bootstrap: failed to load model from '/model.bin'
Loading the model failed: failed loading model
➜  llama-cli docker run -ti --rm quay.io/go-skynet/llama-cli:v0.4  --instruction "What's an alpaca?" --topk 10000 --alpaca "true" --model ./models/ggml-alpaca-7b-q4.bin
llama_model_load: failed to open '/model.bin'
llama_bootstrap: failed to load model from '/model.bin'
Loading the model failed: failed loading model
➜  llama-cli docker run -ti --rm quay.io/go-skynet/llama-cli:v0.4  --instruction "What's an alpaca?" --topk 10000 --model ./models/ggml-alpaca-7b-q4.bin      
llama_model_load: failed to open './models/ggml-alpaca-7b-q4.bin'
llama_bootstrap: failed to load model from './models/ggml-alpaca-7b-q4.bin'
Loading the model failed: failed loading model
➜  llama-cli docker run -ti --rm quay.io/go-skynet/llama-cli:v0.4  --instruction "What's an alpaca?" --topk 10000 --model ./models/ggml-alpaca-7b-q4.bin --alpaca true
llama_model_load: failed to open './models/ggml-alpaca-7b-q4.bin'
llama_bootstrap: failed to load model from './models/ggml-alpaca-7b-q4.bin'
Loading the model failed: failed loading model
➜  llama-cli docker run -ti -e MODEL_PATH=/models/ggml-alpaca-7b-q4.bin --rm quay.io/go-skynet/llama-cli:v0.4  --instruction "What's an alpaca?" --topk 10000 --model ./models/ggml-alpaca-7b-q4.bin --alpaca true
llama_model_load: failed to open './models/ggml-alpaca-7b-q4.bin'
llama_bootstrap: failed to load model from './models/ggml-alpaca-7b-q4.bin'
Loading the model failed: failed loading model
➜  llama-cli docker run -ti -e MODEL_PATH=/models/ggml-alpaca-7b-q4.bin --rm quay.io/go-skynet/llama-cli:v0.4  --instruction "What's an alpaca?" --topk 10000                                                     
llama_model_load: failed to open '/models/ggml-alpaca-7b-q4.bin'
llama_bootstrap: failed to load model from '/models/ggml-alpaca-7b-q4.bin'
Loading the model failed: failed loading model
➜  llama-cli docker run -ti -e MODEL_PATH=/models/ggml-alpaca-7b-q4.bin --rm quay.io/go-skynet/llama-cli:v0.4  --instruction "What's an alpaca?" --topk 10000
➜  llama-cli docker run -ti --rm quay.io/go-skynet/llama-cli:latest  --instruction "What's an alpaca?" --topk 10000 --model ./models/ggml-alpaca-7b-q4.bin --alpaca true                                      
Unable to find image 'quay.io/go-skynet/llama-cli:latest' locally
latest: Pulling from go-skynet/llama-cli
3e440a704568: Already exists 
68a71c865a2c: Already exists 
670730c27c2e: Already exists 
5a7a2c95f0f8: Already exists 
db119aaf144b: Already exists 
92ac76a462cb: Pull complete 
5997e4205ef7: Pull complete 
33d4a96cf7d6: Pull complete 
c8a35e5c3705: Pull complete 
abacb88fc6dd: Pull complete 
756caf9df70c: Pull complete 
0a7f01cc46c5: Pull complete 
92ed784c8873: Pull complete 
Digest: sha256:3698dea8ece687b23903afe347cee47b37d6883053533eacfab26619b55b97c7
Status: Downloaded newer image for quay.io/go-skynet/llama-cli:latest
llama_model_load: failed to open './models/ggml-alpaca-7b-q4.bin'
llama_bootstrap: failed to load model from './models/ggml-alpaca-7b-q4.bin'
Loading the model failed: failed loading model
➜  llama-cli docker run -ti --rm quay.io/go-skynet/llama-cli:latest  --instruction "What's an alpaca?" --topk 10000 --alpaca rue --model ./models/ggml-alpaca-7b-q4.bin
llama_model_load: failed to open ''
llama_bootstrap: failed to load model from ''
Loading the model failed: failed loading model
➜  llama-cli 
➜  llama-cli docker run -ti --rm quay.io/go-skynet/llama-cli:latest  --instruction "What's an alpaca?" --topk 10000 --alpaca true --model ./models/ggml-alpaca-7b-q4.bin
llama_model_load: failed to open ''
llama_bootstrap: failed to load model from ''
Loading the model failed: failed loading model
➜  llama-cli docker run -ti --rm quay.io/go-skynet/llama-cli:latest  --alpaca true --instruction "What's an alpaca?" --topk 10000 --model ./models/ggml-alpaca-7b-q4.bin              
llama_model_load: failed to open ''
llama_bootstrap: failed to load model from ''
Loading the model failed: failed loading model
➜  llama-cli docker run -ti --rm quay.io/go-skynet/llama-cli:latest  --instruction "What's an alpaca?" --topk 10000 --model ./models/ggml-alpaca-7b-q4.bin --alpaca true              
llama_model_load: failed to open './models/ggml-alpaca-7b-q4.bin'
llama_bootstrap: failed to load model from './models/ggml-alpaca-7b-q4.bin'
Loading the model failed: failed loading model
➜  llama-cli docker run -ti --rm quay.io/go-skynet/llama-cli:latest  --instruction "What's an alpaca?" --topk 10000 --model ./models/ggml-alpaca-7b-q4.bin              
llama_model_load: failed to open './models/ggml-alpaca-7b-q4.bin'
llama_bootstrap: failed to load model from './models/ggml-alpaca-7b-q4.bin'
Loading the model failed: failed loading model

even with the latest image

jonit-dev · 2023-04-07T05:49:34Z

The project is great, but I'd recommend refactoring the documentation to make it clearer.

It's kind of confusing to understand what to do.

I'm also preparing a docker-compose.yml file which I can share when its done

mudler · 2023-04-07T20:48:46Z

Hi @jonit-dev ,

You need to specify a volume to docker so it mounts a path local to the host inside the container with -v, see the instructions here
https://github.com/go-skynet/llama-cli#using-other-models

For a docker compose file, have a look at #10

On the other hand I do completely agree, I will rework the documentation as soon as possible, there are many lacunas and also other new features being added that needs to be documented too.

mudler · 2023-04-11T22:30:20Z

documentation is getting some revamp, and @mkellerman worked a nice integration with chatgpt-web: an e2e docker-compose file would be just great!

mudler · 2023-04-12T23:25:15Z

instructions updated to run with docker-compose, and multi-model support too: https://github.com/go-skynet/llama-cli#usage

I'd close this issue for now, if you are still facing issues, just re-open it!

mudler closed this as completed Apr 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama_bootstrap: failed to load model from '/model.bin' #9

llama_bootstrap: failed to load model from '/model.bin' #9

nightvision04 commented Mar 31, 2023

mudler commented Apr 4, 2023 •

edited

regstuff commented Apr 5, 2023

mudler commented Apr 5, 2023 •

edited

mudler commented Apr 5, 2023

jonit-dev commented Apr 7, 2023

jonit-dev commented Apr 7, 2023

mudler commented Apr 7, 2023

mudler commented Apr 11, 2023

mudler commented Apr 12, 2023

llama_bootstrap: failed to load model from '/model.bin' #9

llama_bootstrap: failed to load model from '/model.bin' #9

Comments

nightvision04 commented Mar 31, 2023

mudler commented Apr 4, 2023 • edited

regstuff commented Apr 5, 2023

mudler commented Apr 5, 2023 • edited

mudler commented Apr 5, 2023

jonit-dev commented Apr 7, 2023

jonit-dev commented Apr 7, 2023

mudler commented Apr 7, 2023

mudler commented Apr 11, 2023

mudler commented Apr 12, 2023

mudler commented Apr 4, 2023 •

edited

mudler commented Apr 5, 2023 •

edited