xinference-docker-built-in

This project allows easy deployment of any built-in LLM of Xinference using Docker. Uses Docker and Docker compose, available here. Note that this setup only works on Linux machines with dedicated Nvidia graphics cards. For other solutions, check the Xinference docs; for instance, you can run the xinference library natively on Mac machines.

Installation and usage

The pre-built image is available on Docker Hub under the name biocypher/xinference-builtin as a multi-arch image. You can pull it using docker pull biocypher/xinference-builtin. The image is built for amd64 and arm64 architectures. If you want to build the image yourself, you can use the Dockerfile in this repository (step 2).

Install nvidia-docker libraries (find details about the Nvidia-Container Toolkit here).
Run docker compose pull to use a pre-built image or docker compose build to build it locally.
Run docker compose up -d. This should start a container in the background that downloads and runs the zephyr-7b model. To change the model, change the env_file parameter in the docker-compose.yml file, for instance to llama-2-13b.env.
Optional: There are two example environment file examples that can be commented and un-commented in the docker-compose.yml. The llama-2-chat file shows you how to use models that require a huggingface access token (if the token is placed in the .env file).

Info

You can find a list of available LLM models in two ways:

Set the environment variable LIST=1 in the active .env file. Run docker-compose up, which will run the container attached until it prints a list of all available LLMs
Find a maybe not up-to-date list in the xinference documentation here

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
download_register_and_launch.sh		download_register_and_launch.sh
launch_script.sh		launch_script.sh
llama-2-13b.env		llama-2-13b.env
login.py		login.py
zephyr-7b-beta.env		zephyr-7b-beta.env

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

Dockerfile

Dockerfile

README.md

README.md

docker-compose.yml

docker-compose.yml

download_register_and_launch.sh

download_register_and_launch.sh

launch_script.sh

launch_script.sh

llama-2-13b.env

llama-2-13b.env

login.py

login.py

zephyr-7b-beta.env

zephyr-7b-beta.env

Repository files navigation

xinference-docker-built-in

Installation and usage

Info

About

Releases

Packages

Languages

biocypher/xinference-docker-builtin

Folders and files

Latest commit

History

Repository files navigation

xinference-docker-built-in

Installation and usage

Info

About

Resources

Stars

Watchers

Forks

Languages