-
Notifications
You must be signed in to change notification settings - Fork 370
Open
Description
System Info
Information
- Docker
- The CLI directly
Tasks
- An officially supported command
- My own modifications
Reproduction
- I have downloaded the model in the path /data/models/bge-reranker-v2-m3
- excute the commond
docker run -d \ --name text-embedding-service \ --restart always \ --gpus all \ -p 8082:80 \ -v /data/models/bge-m3:/data/ \ -v /data/models/.cache:/root/.cache \ -e MODEL_ID=BAAI/bge-m3 \ -e TOKENIZATION_WORKERS=10 \ -e MAX_BATCH_REQUESTS=10 \ -e MAX_CLIENT_BATCH_SIZE=500 \ -e HOSTNAME=0.0.0.0 \ ghcr.io/huggingface/text-embeddings-inference:1.5 - It will download the model every time. finally use the mount path
Expected behavior
If I download the model and mounted, there is no need to download it again.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels