RunAI

Run AI allows you to run a threaded Stable Diffusion low level python socket server.

Features

Offline friendly - works completely locally with no internet connection (must first download models)
Sockets: handles byte packets of an arbitrary size
Threaded: asynchronously handle requests and responses
Queue: requests and responses are handed off to a queue
Auto-shutdown: server automatically shuts down after client disconnects
Does not save images or logs to disc

Limitations

Data between server and client is not encrypted

This only matters if someone wants to create a production ready version of this server which would be hosted on the internet. This server is not designed for that purpose. It was designed with a single use-case in mind: the ability to run Stable Diffusion (and other AI models) locally. It was designed for use with the Krita Stable Diffusion plugin, but can work with any interface provided someone writes a client for it.

Only uses float16 (half floats)

If someone wants to build in functionality for float32 I will merge the code but currently this is not a priority feature.

Installation

First run sh bin/install.sh to install the required models. These will be placed in ~/stablediffusion. See Stable Diffusion directory structure for more information.

Docker

Easiest method

Install docker
Install nvdia-container-runtime
sudo apt install nvidia-container-toolkit
Copy daemon.json to /etc/docker/daemon.json (if you already have a daemon.js file in that directory, just copy the contents)
docker-compose up

Docker commands

All of the following commands are contained in /bin/dc, you can add it to your path or run it directly.

Run ./bin/dc start run the server
Shell ./bin/dc bash enter shell
Update ./bin/dc updatereqs update pip
Build ./bin/dc build build the server
Clean ./bin/dc /app/bin/clean.sh

More commands

Build and start the services docker-compose up
Stop and remove all services docker-compose down
Rebuild all services docker-compose build
List all running containers docker-compose ps
View the output from containers docker-compose logs
Execute a command in a running container docker-compose exec <service> <command>
Replace with the name of the service defined in the docker-compose.yml file, and with the command you want to run.

Bare metal

Install CUDA Toolkit 11.7
Install miniconda
Activate environment ``
conda activate runai
Install requirements pip install -r requirements.txt

Create a lib folder in the root of the project:

mkdir -r lib/torch

Copy the following into lib/torch/:

lib/torch/bin/torch_shm_manager
lib/torch/lib/libtorch_global_deps.so

Your directory structure may differ, but it will likely look something like this:

/home/<user>/miniconda3/envs/ksd-build/lib/python3.10/site-packages/torch/bin/torch_shm_manager
/home/<user>/miniconda3/envs/ksd-build/lib/python3.10/site-packages/torch/lib/libtorch_global_deps.so

git
conda
a cuda capable GPU

Build the server

./bin/buildlinux.sh

The standalone server will be in the dist directory

Run the server

conda activate runai
python server.py

Request structure

Clients establish a connection with the server over a socket and send a JSON object encoded as a byte string split into packets. An EOM (end of message) signal is sent to indicate the end of the message.

The server assembles the packets, decodes the JSON object and processes the request. Once processing is complete the server will send a response back to the client.

It is up to the client to reassemble the packets, decode the byte string to JSON and handle the message.

Client

For an example client, take a look at the connect.py file in the Krita Stable Diffusion Plugin which uses this server.

Stable Diffusion directory structure

This is the recommended and default setup for runai

Linux

Default directory structure for runai Stable Diffusion

Base models

These models are required to run Stable Diffusion

CLIP files for CLIP
CompVis safety checker model (used for NSWF filtering)
openai clip-vit-large-patch14 model

 ├── ~/stablediffusuion
    ├── CLIP
    ├── CompVis
    │   ├── stable-diffusion-safety-checker
    ├── openai
        ├── clip-vit-large-patch14

Diffusers models

These are the base models to run a particular version of Stable Diffusion.

runwayml: Base models for Stable Diffusion v1
stabilityai: Base models for Stable Diffusion v2

├── ~/stablediffusuion
   ├── runwayml
      ├── stable-diffusion-inpainting
      ├── stable-diffusion-v1-5
   ├── stabilityai
      ├── stable-diffusion-2-1-base
      ├── stable-diffusion-2-inpainting

Custom models

v1 should be a directory containing models using stable diffusion v1
v2 should be a directory containing models using stable diffusion v2

You may place diffusers folders, ckpt and safetensor files in these directories.

├── ~/stablediffusuion
   ├── v1
   │   ├── <folder> (diffusers directory)
   │   ├── <file>.ckpt
   │   ├── <file>.safetensor
   ├── v2
       ├── <folder> (diffusers directory)
       ├── <file>.ckpt
       ├── <file>.safetensor

Automatic1111 existing files

If you are using Automatic1111 you can place your checkpoints in the webui models folder as you typically would, however the directory structure which includes v1 models separated from v2 models is required for now.

This allows you to use the same checkpoints for both Automatic1111 webui and this server.

For example, if your webui directory looks like this

├── /home/USER/stable-diffusion-webui/models/Stable-diffusion
    ├── <some_checkpoint_file>.ckpt
    ├── <some_other_checkpoint_file>.ckpt
    ├── <some_other_checkpoint_file_v2>.ckpt

You would reorganize it like this:

├── /home/USER/stable-diffusion-webui/models/Stable-diffusion
    ├── v1
       ├── <some_checkpoint_file>.ckpt
       ├── <some_other_checkpoint_file>.ckpt
    ├── v2
       ├── <some_other_checkpoint_file_v2>.ckpt

You would then set BASE_DIR to /home/USER/stable-diffusion-webui/models/Stable-diffusion

Build

First install pyinstaller

pip install pyinstaller

Then build the executable

./bin/buildlinux.sh

Test

cd ./dist/runai
./runai

This should start a server.

Connect a client to see if it is working properly

Running the server

python server.py

The following flags and options are available

--port (int) - port to run server on
--host (str) - host to run server on
--timeout - whether to timeout after failing to receive a client connection, pass this flag for true, otherwise the server will not timeout.
--packet-size (int) - size of byte packets to transmit to and from the client
--model-base-path (str) - base directory for checkpoints
--max-client-connections (int) - maximum number of client connections to accept

Example

python server.py --port 8080 --host https://0.0.0.0 --timeout

This will start a server listening on https://0.0.0.0:8080 and will timeout after a set number of attempts when no failing to receive a client connection.

Request structure

Requets are sent to the server as a JSON encoded byte string. The JSON object should look as follows

{
    TODO
}

Model loading

The server does not automatically load a model. It waits for the client to send a request which contains a model path and name. The server will determine which version of stable diffusion is in use and which model has been selected to generate images. It will also determine the best model to load based on the list of available types in the directory provided.

Development notes

StableDiffusionRequestQueueWorker.callback Handle routes and dispatch to functions
socket_server.message_client Send a message to the client

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
bin		bin
configs/stable-diffusion		configs/stable-diffusion
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
VERSION		VERSION
build.spec		build.spec
connection.py		connection.py
convert_original_stable_diffusion_to_diffusers.py		convert_original_stable_diffusion_to_diffusers.py
daemon.json		daemon.json
database.py		database.py
docker-compose.yml		docker-compose.yml
environment.yml		environment.yml
exceptions.py		exceptions.py
img.png		img.png
img_1.png		img_1.png
install.sh		install.sh
logger.py		logger.py
messagecodes.py		messagecodes.py
requirements.txt		requirements.txt
run.sh		run.sh
runner.py		runner.py
server.py		server.py
settings.py		settings.py
setup.py		setup.py
simple_enqueue_socket_server.py		simple_enqueue_socket_server.py
socket_connection.py		socket_connection.py
socket_server.py		socket_server.py
stable_diffusion_request_queue_worker.py		stable_diffusion_request_queue_worker.py
update.py		update.py

License

w4ffl35/run-ai-socket-server

Folders and files

Latest commit

History

Repository files navigation

RunAI

Features

Limitations

Data between server and client is not encrypted

Only uses float16 (half floats)

Installation

Docker

Docker commands

More commands

Bare metal

Request structure

Client

Stable Diffusion directory structure

Linux

Base models

Diffusers models

Custom models

Automatic1111 existing files

Build

Running the server

Request structure

Model loading

Development notes

About

Resources

License

Stars

Watchers

Forks

Languages