v0.1.0-rc.0
Pre-releaseThe first release candidate for Modelplane v0.1.0, and the first published Modelplane package.
At this point Modelplane can provision GKE or EKS clusters, or run on existing ones via a supplied kubeconfig. It serves models with any OpenAI-compatible engine (vLLM tested), running either single-node or multi-node across a LeaderWorkerSet gang, and schedules replicas across a multi-cluster GPU fleet using Dynamic Resource Allocation to bind GPUs. A ModelDeployment scales via spec.replicas, and a ModelService routes traffic across replicas behind one unified OpenAI-compatible endpoint. It can stage models onto a per-cluster cache ahead of serving, and supports disaggregated prefill/decode serving.
Install with the Crossplane CLI:
crossplane xpkg install configuration xpkg.upbound.io/modelplane/modelplane:v0.1.0-rc.0
See the getting started guide to deploy Modelplane and serve a model.