Persistent AI workspaces for ComfyUI and GPU-backed creation apps.
Run ComfyUI, VS Code, Jupyter, vLLM, Ollama, n8n, and workflow APIs on your own GPU server — with real folders, model vaults, snapshots, HTTPS routes, GPU leases, and no Kubernetes.
Join the waitlist · Discord · Status · Why Qiln exists · How it works · First release scope · License
Qiln turns a bare-metal GPU server into managed, persistent AI workspaces.
The first wedge is ComfyUI because ComfyUI users feel the pain most directly: large models, fragile custom nodes, path-sensitive workflows, disposable GPU pods, scattered outputs, exposed ports, and environments that disappear right when the workflow starts to matter.
Qiln gives that work a stable home:
- persistent ComfyUI workspaces
- private model and dataset vaults
- real folders for inputs, outputs, configs, and custom nodes
- VS Code, Jupyter, n8n, FastAPI, vLLM, Ollama, and Open WebUI companion apps
- snapshots before risky changes
- stable HTTPS routes
- GPU lease primitives for scarce GPU capacity
- browser-based admin and user flows
- experimental: GUI streaming to an HTML canvas for selected graphical apps
- no Kubernetes required
This repository is the future public source home for Qiln Community: the open-source, self-managed engine for running Qiln on GPU hardware you own.
Qiln is developed by IonSignal, Inc..
- Status
- Why Qiln exists
- Qiln Community, Managed, and Appliance
- Who Qiln is for
- What Qiln does
- Core concepts
- How it works
- What Qiln is not
- First release scope
- Planned repository layout
- Documentation
- Stack
- License
- Security
Qiln is pre-release infrastructure.
The public source will open here for the first usable alpha, v0.1.0. Until then, this repository is intentionally minimal and serves as the future public source home for Qiln Community.
The private development branch currently includes core primitives for:
- Incus-based per-user workspace provisioning
- declarative blueprint loading and validation
- ZFS-backed vault creation, cloning, quotas, and snapshots
- GPU lease lifecycle for supported workloads
- automatic HTTPS routing through Caddy
- experimental Selkies-backed graphical app streaming
- NATS-based internal eventing
- Postgres-backed state
- Fastify/tRPC APIs
- browser-based admin UI for workspaces, vaults, blueprints, users, and operations
The public v0.1.0 release is gated on:
- finalized admin and user UI flows
- a usable vault/file browser with image previews and file editing
- model import flows, including Hugging Face integration
- documented Incus, ZFS, NVIDIA, and Caddy setup
- stable blueprint examples for ComfyUI and common companion workloads
- preparation of the source tree for external review and contribution
Qiln is not publicly installable yet.
If you own GPUs and want to help shape the alpha, join the waitlist.
We are prioritizing:
- Design partners running real GPU hardware with real users.
- Studios, labs, founders, and internal teams with persistent ComfyUI or GPU workspace pain.
- Discord community members willing to test, break things, and give feedback.
- GitHub stars/watchers who want to follow the public release.
Disposable GPU pods are fine for disposable jobs.
They are a bad fit for serious ComfyUI and GPU-backed creation workflows.
Once you have private checkpoints, LoRAs, VAEs, ControlNets, datasets, custom nodes, workflow JSON, output folders, wrapper APIs, and automation scripts, the job is no longer “rent a GPU for an hour.”
The job is:
Open your AI workspace and keep building.
Qiln is built around that job.
Instead of rebuilding a GPU pod, re-uploading models, fixing custom nodes, exposing random ports, and worrying about credits, Qiln gives users a persistent workspace with the files, routes, GPU access, snapshots, and companion tools needed to keep working.
ComfyUI is the beachhead. The broader product is a workspace layer for GPU-backed creative and AI development environments.
Qiln has three related delivery models.
Open-source, self-managed Qiln for people who own GPU hardware.
Use it to turn your own GPU server into managed AI workspaces without Kubernetes.
Qiln Community is:
- Apache 2.0 core
- no CLA
- single-node first
- Incus-based isolated workspaces
- ZFS-backed vaults, clones, and snapshots
- declarative blueprints
- GPU lease primitives
- Caddy HTTPS routing
- pre-release alpha until public
v0.1.0
Hosted persistent AI workspaces run by IonSignal.
Use it when you want persistent ComfyUI and GPU-backed creation workspaces without operating GPU infrastructure.
Qiln Managed is built for:
- persistent ComfyUI workspaces
- private model storage
- direct checkpoint, LoRA, VAE, and ControlNet uploads
- custom node onboarding
- VS Code, Jupyter, n8n, FastAPI, vLLM, Ollama, and companion services
- stable HTTPS routes
- snapshots
- reserved high-VRAM GPU capacity
- experimental browser-streamed graphical apps for selected GUI workloads
Qiln installed and managed on customer-owned hardware.
Use it when your studio, lab, or organization needs a private AI workspace appliance on owned GPUs, servers, racks, or datacenter infrastructure.
Qiln Appliance can include:
- custom blueprints
- identity integration
- backup and restore planning
- monitoring
- updates
- support retainers
- deployment guidance for owned GPU servers
Qiln is for people who need persistent GPU-backed AI workspaces instead of disposable pods or fragile local installs.
Good fits include:
- commercial ComfyUI users with private models, custom nodes, and serious output workflows
- creative studios giving artists sanctioned, persistent GPU workspaces
- founders building visual AI products from ComfyUI or workflow backends
- internal AI and innovation teams replacing unmanaged GPU cloud usage
- research groups sharing a few expensive GPUs
- small labs that want Jupyter, ComfyUI, vLLM, Ollama, and Open WebUI without Kubernetes
- serious homelabbers who want team-grade isolation on their own GPU server
Qiln is especially relevant if you are currently dealing with:
- rebuilt RunPod-style GPU pods
- broken custom node environments
- duplicated model folders across machines
- exposed ComfyUI ports
- unstable URLs
- no clean path from visual workflow to API endpoint
- unmanaged GPU usage inside a team
- Kubernetes pressure before you have a platform team
Qiln turns one bare-metal GPU machine into a managed AI workspace host.
A Qiln workspace can include:
- an isolated Incus environment
- one or more ZFS-backed vaults
- persistent folders for models, datasets, inputs, outputs, configs, and code
- GPU lease access for supported workloads
- snapshots and rollback paths
- HTTPS routes through Caddy
- app-specific configuration from a declarative blueprint
- browser-accessible workspace, file, and admin UI
Qiln starts with ComfyUI, then expands through companion blueprints for the tools people already need around their workflows:
- VS Code / code-server
- JupyterLab
- n8n
- FastAPI wrappers
- vLLM
- Ollama
- Open WebUI
- model and file tools
The goal is not to become a generic PaaS.
The goal is to make GPU-backed AI creation environments persistent, understandable, private, and recoverable.
Qiln is organized around workspace objects, not raw infrastructure objects.
The important concepts are:
- Workspaces — persistent, isolated environments for users, projects, and apps.
- Apps — ComfyUI, VS Code, Jupyter, n8n, vLLM, Ollama, Open WebUI, FastAPI wrappers, and similar AI workflow tools.
- Blueprints — declarative definitions for repeatable workspace provisioning.
- Model vaults — storage for checkpoints, LoRAs, VAEs, ControlNets, embeddings, and other model files.
- Dataset vaults — storage for source datasets, references, training data, and project inputs.
- Output folders — persistent locations for generated images, video, logs, artifacts, and exports.
- GPU leases — controlled access to scarce GPU capacity for workspaces that need acceleration.
- Snapshots — recovery points before risky dependency, model, workflow, or custom-node changes.
- Routes — stable HTTPS URLs for workspace apps.
- Endpoints — API paths for workflows that need to be called by other systems.
- Projects and users — the organizational layer for teams, studios, labs, and internal groups.
- Graphical sessions — Selkies-backed sessions that stream a complete GUI to an HTML canvas in the browser.
Under the hood, Qiln uses Incus, ZFS, Caddy, NATS, Postgres, Fastify, tRPC, and Vue 3.
Those are ingredients. The product is the workspace.
At a high level:
- An admin installs Qiln on a bare-metal GPU server.
- Qiln prepares the host for isolated workspaces, storage vaults, GPU access, and HTTPS routing.
- An admin imports or configures blueprints for ComfyUI and companion apps.
- A user launches a persistent workspace from a blueprint.
- Qiln provisions an isolated Incus environment.
- Qiln mounts the required vaults and folders.
- Qiln attaches GPU access when the workload needs acceleration.
- Qiln routes the workspace through HTTPS using Caddy.
- Users work from the browser while keeping models, outputs, configs, code, and custom nodes in place.
- Admins manage workspaces, vaults, users, leases, snapshots, routes, and events from the browser UI.
For supported apps, Qiln can integrate more deeply.
For ComfyUI, the goal is to make model management, custom node work, output browsing, snapshots, file operations, GPU access, and route management feel native instead of like infrastructure chores.
If you need occasional cheap GPU-hours, disposable experiments, or extremely spiky batch jobs, raw GPU clouds may be the right answer.
RunPod, Vast.ai, Lambda, Modal, and similar platforms can be good choices when persistence does not matter.
Qiln is for the cases where persistence does matter:
- private models
- custom nodes
- stable routes
- real folders
- repeatable workflows
- snapshots
- team access
- workflow endpoints
- predictable infrastructure ownership or reserved capacity
They rent GPUs. Qiln gives you a persistent AI workspace around GPU capacity.
Kubernetes, Run:ai, Kubeflow, Slurm, and related systems can make sense for large GPU fleets, formal cluster operations, and teams with platform engineers.
Qiln is for the studio, lab, founder, or small team before that point.
It is the answer when you have one serious GPU server, real users, real workflows, and no desire to become a platform team just to run ComfyUI, Jupyter, vLLM, Ollama, or workflow APIs.
Qiln is not trying to host every app.
Good blueprint fits include:
- ComfyUI
- VS Code / code-server
- JupyterLab
- n8n
- FastAPI workflow wrappers
- vLLM
- Ollama
- Open WebUI
- TensorBoard
- Kohya
- model and file tools
Bad fits include:
- WordPress
- CRM systems
- billing systems
- generic SaaS hosting
- arbitrary production microservices
- managed databases
Qiln should make AI workspaces better, not become a generic cloud platform.
Qiln can run tools like Open WebUI, Ollama, ComfyUI, JupyterLab, and vLLM.
Those are apps inside the workspace.
Qiln is the workspace layer that provisions, isolates, routes, snapshots, and attaches storage and GPU capacity to those apps.
The alpha is aimed at trusted teams, studios, labs, founders, and homelabs.
Do not expose untrusted public tenants to Qiln until the public security model has been hardened and documented.
v0.1.0 is focused on one machine.
The first public alpha is expected to support:
- one bare-metal GPU server
- NVIDIA-first GPU support
- Incus for workspace isolation
- ZFS for vaults, clones, quotas, and snapshots
- Caddy for HTTPS routing
- NATS for internal eventing
- Postgres for state
- Fastify/tRPC APIs
- Vue 3 browser UI
- browser-based admin workflows
- workspace and vault management
- ComfyUI-first blueprint support
- companion blueprints for common AI workloads
The first release is single-node by design.
Multi-node support may come later, but Qiln is being built to be excellent on one serious GPU server before becoming a cluster product.
Qiln is not recommended for production or untrusted multi-tenant environments until the public alpha has been hardened.
When the source opens, this repository is expected to contain:
apps/
qiln/ # runnable Qiln server, API, admin UI, and user UI
packages/
core/ # shared schemas, events, permissions, NATS/RPC utilities
engine/ # orchestration engine, Incus/ZFS/GPU/route services
ui/ # shared UI components
blueprints/
comfyui/
vscode/
jupyterlab/
n8n/
fastapi/
vllm/
ollama/
open-webui/
docs/
concepts/
install/
blueprints/
guides/
reference/This may change before v0.1.0.
The goal is a clean public tree, not a source dump.
Public documentation will live on the Qiln website.
This repository will link to the docs when v0.1.0 opens.
The documentation set is expected to cover:
- concepts
- architecture
- installation
- ComfyUI workspace setup
- model vaults
- dataset vaults
- output folders
- file browser usage
- blueprint authoring
- GPU lease behavior
- snapshots and rollback
- HTTPS routes
- endpoint patterns
- backup and restore planning
- security model and limitations
- reference material
Qiln is built with:
- Incus
- ZFS
- Caddy
- NATS
- Postgres
- Fastify
- tRPC
- Vue 3
The stack is chosen to make single-node GPU workspaces practical: isolated environments, real filesystems, copy-on-write storage, simple HTTPS routing, evented operations, and a browser UI.
Qiln core will be released under Apache 2.0.
The LICENSE file is included now to make the intended public-source license explicit before the source lands.
No CLA.
IonSignal may offer managed workspaces, appliance deployments, commercial modules, integrations, and support separately.
The open-source core is intended to remain useful on its own for self-managed, single-node AI workspaces on GPU hardware you own.
Qiln is pre-release infrastructure and should be treated accordingly.
Qiln Community alpha is for trusted users and controlled environments.
Do not expose untrusted public tenants to Qiln until the security model has been hardened and documented.
Security reporting details are provided in SECURITY.md.
