Skip to content

v0.1.0-rc.0

Pre-release
Pre-release

Choose a tag to compare

@negz negz released this 16 Jun 17:50
· 293 commits to main since this release
95d06a3

The first release candidate for Modelplane v0.1.0, and the first published Modelplane package.

At this point Modelplane can provision GKE or EKS clusters, or run on existing ones via a supplied kubeconfig. It serves models with any OpenAI-compatible engine (vLLM tested), running either single-node or multi-node across a LeaderWorkerSet gang, and schedules replicas across a multi-cluster GPU fleet using Dynamic Resource Allocation to bind GPUs. A ModelDeployment scales via spec.replicas, and a ModelService routes traffic across replicas behind one unified OpenAI-compatible endpoint. It can stage models onto a per-cluster cache ahead of serving, and supports disaggregated prefill/decode serving.

Install with the Crossplane CLI:

crossplane xpkg install configuration xpkg.upbound.io/modelplane/modelplane:v0.1.0-rc.0

See the getting started guide to deploy Modelplane and serve a model.