Model Distribution

A library and CLI tool for distributing models using container registries.

Overview

Model Distribution is a Go library and CLI tool that allows you to package, push, pull, and manage models using container registries. It provides a simple API and command-line interface for working with models in GGUF format.

Features

Push models to container registries
Pull models from container registries
Local model storage
Model metadata management
Command-line interface for all operations
GitHub workflows for automated model packaging
Support for both GGUF and safetensors model formats

Usage

As a CLI Tool

# Build the CLI tool
make build

# Pull a model from a registry
./bin/model-distribution-tool pull registry.example.com/models/llama:v1.0

# Package a model and push to a registry
./bin/model-distribution-tool package ./model.gguf registry.example.com/models/llama:v1.0

# Package a model with license files and push to a registry
./bin/model-distribution-tool package --licenses license1.txt --licenses license2.txt ./model.gguf registry.example.com/models/llama:v1.0

# Package a model with a default context size and push to a registry
./bin/model-distribution-tool ./model.gguf --context-size 2048 registry.example.com/models/llama:v1.0

# Push a model from the content store to the registry
./bin/model-distribution-tool push registry.example.com/models/llama:v1.0

# List all models in the local store
./bin/model-distribution-tool list

# Get information about a model
./bin/model-distribution-tool get registry.example.com/models/llama:v1.0

# Get the local file path for a model
./bin/model-distribution-tool get-path registry.example.com/models/llama:v1.0

# Remove a model from the local store (will untag w/o deleting if there are multiple tags)
./bin/model-distribution-tool rm registry.example.com/models/llama:v1.0

# Force Removal of a model from the local store, even when there are multiple referring tags
./bin/model-distribution-tool rm --force sha256:0b329b335467cccf7aa219e8f5e1bd65e59b6dfa81cfa42fba2f8881268fbf82

# Tag a model with an additional reference
./bin/model-distribution-tool tag registry.example.com/models/llama:v1.0 registry.example.com/models/llama:latest

For more information about the CLI tool, run:

./bin/model-distribution-tool --help

As a Library

import (
    "context"
    "github.com/docker/model-distribution/pkg/distribution"
)

// Create a new client
client, err := distribution.NewClient("/path/to/cache")
if err != nil {
    // Handle error
}

// Pull a model
err := client.PullModel(context.Background(), "registry.example.com/models/llama:v1.0", os.Stdout)
if err != nil {
    // Handle error
}

// Get a model
model, err := client.GetModel("registry.example.com/models/llama:v1.0")
if err != nil {
    // Handle error
}

// Get the GGUF file path
modelPath, err := model.GGUFPath()
if err != nil {
    // Handle error
}

fmt.Println("Model path:", modelPath)

// List all models
models, err := client.ListModels()
if err != nil {
    // Handle error
}

// Delete a model
err = client.DeleteModel("registry.example.com/models/llama:v1.0", false)
if err != nil {
    // Handle error
}

// Tag a model
err = client.Tag("registry.example.com/models/llama:v1.0", "registry.example.com/models/llama:latest")
if err != nil {
    // Handle error
}

// Push a model
err = client.PushModel("registry.example.com/models/llama:v1.0")
if err != nil {
    // Handle error
}

GitHub Workflows for Model Packaging and Promotion

This project provides GitHub workflows to automate the process of packaging both GGUF and Safetensors models and promoting them from staging to production environments.

Overview

The model promotion process follows a two-step workflow:

Package and Push to Staging: Use either:
- package-gguf-model.yml to download a pre-built GGUF model and push it to the aistaging namespace
- package-safetensors-model.yml to clone a safetensors model from HuggingFace, convert it to GGUF, and push it to the aistaging namespace
Promote to Production: Use promote-model-to-production.yml to copy the model from staging (aistaging) to production (ai) namespace

Prerequisites

The following GitHub secrets must be configured:

DOCKER_USER: DockerHub username for production namespace
DOCKER_OAT: DockerHub access token for production namespace
DOCKER_USER_STAGING: DockerHub username for staging namespace (aistaging)
DOCKER_OAT_STAGING: DockerHub access token for staging namespace

Note: The current secrets are configured to write to the ai production namespace. If you need to write to a different namespace, you'll need to update the DOCKERHUB_USERNAME and DOCKERHUB_TOKEN secrets accordingly.

Step 1: Package Model to Staging

Option A: Package GGUF Model

Use the Package GGUF model workflow to download a pre-built GGUF model and push it to the staging environment.

Single Model Example:

Go to Actions → Package GGUF model → Run workflow
Fill in the inputs:
- GGUF file URL: https://huggingface.co/unsloth/SmolLM2-135M-Instruct-GGUF/resolve/main/SmolLM2-135M-Instruct-Q4_K_M.gguf
- Registry repository: smollm2
- Tag: 135M-Q4_K_M
- License URL: https://huggingface.co/datasets/choosealicense/licenses/resolve/main/markdown/apache-2.0.md

This will create: aistaging/smollm2:135M-Q4_K_M

Multi-Model Example: For packaging multiple models at once, use the models_json input:

[
  {
    "gguf_url": "https://huggingface.co/unsloth/Qwen3-32B-GGUF/resolve/main/Qwen3-32B-Q4_K_XL.gguf",
    "repository": "qwen3-gguf",
    "tag": "32B-Q4_K_XL",
    "license_url": "https://huggingface.co/datasets/choosealicense/licenses/resolve/main/markdown/apache-2.0.md"
  },
  {
    "gguf_url": "https://huggingface.co/unsloth/Qwen3-32B-GGUF/resolve/main/Qwen3-32B-Q8_0.gguf",
    "repository": "qwen3-gguf", 
    "tag": "32B-Q8_0",
    "license_url": "https://huggingface.co/datasets/choosealicense/licenses/resolve/main/markdown/apache-2.0.md"
  }
]

Option B: Package Safetensors Model

Use the Package Safetensors model workflow to clone a Safetensors model from HuggingFace, convert it to GGUF format, and push it to the staging environment.

Single Model Example:

Go to Actions → Package Safetensors model → Run workflow
Fill in the inputs:
- HuggingFace repository: HuggingFaceTB/SmolLM2-135M-Instruct
- Registry repository: smollm2-safetensors
- Weights: 135M
- Quantization: Q4_K_M (default)
- Llama.cpp tag: full-b5763 (default)
- License URL: https://huggingface.co/datasets/choosealicense/licenses/resolve/main/markdown/apache-2.0.md

This will create: aistaging/smollm2-safetensors:135M-Q4_K_M

Multi-Model Example: For packaging multiple safetensors models at once, use the models_json input:

[
  {
    "hf_repository": "microsoft/DialoGPT-medium",
    "repository": "dialogpt",
    "weights": "medium",
    "quantization": "Q4_K_M",
    "license_url": "https://huggingface.co/datasets/choosealicense/licenses/resolve/main/markdown/mit.md"
  },
  {
    "hf_repository": "microsoft/DialoGPT-large",
    "repository": "dialogpt",
    "weights": "large",
    "quantization": "Q8_0",
    "license_url": "https://huggingface.co/datasets/choosealicense/licenses/resolve/main/markdown/mit.md"
  }
]

Step 2: Promote to Production

Once your model is successfully packaged in staging, use the Promote Model to Production workflow to copy it to the production namespace.

Go to Actions → Promote Model to Production → Run workflow
Fill in the inputs:
- Image: smollm2:135M-Q4_K_M (must match the repository:tag from Step 1)
- Source namespace: aistaging (default, can be changed if needed)
- Target namespace: ai (default, can be changed if needed)

This will copy: aistaging/smollm2:135M-Q4_K_M → ai/smollm2:135M-Q4_K_M

Complete Example Walkthrough

Example 1: GGUF Model (Pre-built)

Let's walk through packaging and promoting a pre-built GGUF model:

Package to Staging:
- Workflow: Package GGUF model
- GGUF file URL: https://huggingface.co/unsloth/SmolLM2-135M-Instruct-GGUF/resolve/main/SmolLM2-135M-Instruct-Q4_K_M.gguf
- Registry repository: smollm2
- Tag: 135M-Q4_K_M
- License URL: https://huggingface.co/datasets/choosealicense/licenses/resolve/main/markdown/apache-2.0.md
- Result: aistaging/smollm2:135M-Q4_K_M
Promote to Production:
- Workflow: Promote Model to Production
- Image: smollm2:135M-Q4_K_M
- Result: ai/smollm2:135M-Q4_K_M

Your model is now available in production and can be pulled using:

docker pull ai/smollm2:135M-Q4_K_M

Example 2: Safetensors Model (Convert to GGUF)

Let's walk through packaging and promoting a safetensors model:

Package to Staging:
- Workflow: Package Safetensors model
- HuggingFace repository: HuggingFaceTB/SmolLM2-135M-Instruct
- Registry repository: smollm2-safetensors
- Weights: 135M
- Quantization: Q4_K_M
- Llama.cpp tag: full-b5763
- License URL: https://huggingface.co/datasets/choosealicense/licenses/resolve/main/markdown/apache-2.0.md
- Result: aistaging/smollm2-safetensors:135M-Q4_K_M
Promote to Production:
- Workflow: Promote Model to Production
- Image: smollm2-safetensors:135M-Q4_K_M
- Result: ai/smollm2-safetensors:135M-Q4_K_M

Your converted model is now available in production and can be pulled using:

docker pull ai/smollm2-safetensors:135M-Q4_K_M

Name		Name	Last commit message	Last commit date
Latest commit History 85 Commits
.github		.github
assets		assets
builder		builder
cmd/mdltool		cmd/mdltool
distribution		distribution
internal		internal
registry		registry
scripts/model-push		scripts/model-push
types		types
.gitignore		.gitignore
Dockerfile.gguf		Dockerfile.gguf
Dockerfile.safetensors		Dockerfile.safetensors
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Model Distribution

Overview

Features

Usage

As a CLI Tool

As a Library

GitHub Workflows for Model Packaging and Promotion

Overview

Prerequisites

Step 1: Package Model to Staging

Option A: Package GGUF Model

Option B: Package Safetensors Model

Step 2: Promote to Production

Complete Example Walkthrough

Example 1: GGUF Model (Pre-built)

Example 2: Safetensors Model (Convert to GGUF)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 7

Uh oh!

Languages

License

docker/model-distribution

Folders and files

Latest commit

History

Repository files navigation

Model Distribution

Overview

Features

Usage

As a CLI Tool

As a Library

GitHub Workflows for Model Packaging and Promotion

Overview

Prerequisites

Step 1: Package Model to Staging

Option A: Package GGUF Model

Option B: Package Safetensors Model

Step 2: Promote to Production

Complete Example Walkthrough

Example 1: GGUF Model (Pre-built)

Example 2: Safetensors Model (Convert to GGUF)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 7

Uh oh!

Languages

Packages