v0.2.0
Production Stack v0.2.0
Container image
ghcr.io/kaito-project/gpu-node-mocker:0.2.0
Helm charts
Add the chart repository (once):
helm repo add production-stack https://kaito-project.github.io/production-stack/charts/kaito-project
helm repo update production-stack
The following charts are published from this release (versions taken
from each chart's Chart.yaml at this tag):
production-stack/gpu-node-mockerproduction-stack/modeldeploymentproduction-stack/modelharness
See README.md for installation steps.
What's Changed
- fix release step by @rambohe-ch in #54
- test(e2e): probe EPP pod for netpol deny assertions by @tnsimon in #53
- e2e: add provider switch (upstream/azure), bump keda-kaito-scaler to v0.5.1, instrument timings by @rambohe-ch in #50
- fix nightly e2e workflow by @rambohe-ch in #48
- chore: Surface error when docker or podman is not installed by @techworldhello in #56
- chore: add workflow to push helm charts to mcr by @t0rr3sp3dr0 in #63
- Replace model-not-found Service with Envoy direct_response EnvoyFilter by @rambohe-ch in #60
- feat: separate image preparation from cluster setup for accurate E2E timing by @rambohe-ch in #61
- feat: Garbage collector for GPU mocker by @techworldhello in #58
- feat: upgrade readme according to the latest helm chart by @rambohe-ch in #64
- Refactor E2E install around the productionstack umbrella chart by @rambohe-ch in #67
- Generate E2E coverage report by @techworldhello in #73
- Improve networkpolicy harness by @tnsimon in #57
- Pin BBR and keda-kaito-scaler versions in productionstack chart by @rambohe-ch in #69
- Make nightly env setup consistent to e2e by @techworldhello in #75
- test(e2e/netpol): dump canary pod sidecar state on enforcement precheck failure by @tnsimon in #77
- feat(productionstack): add llm-gateway-apikey as OCI Helm dependency by @tnsimon in #82
- Proposal: add End-to-End Error Handling Across Cluster, Modelharness, and Modeldeployment Levels by @rambohe-ch in #76
- fix label filters for report data by @techworldhello in #81
- fix(modelharness): scope NetworkPolicy selector to production-stack pods (#83) by @tnsimon in #84
- docs: recommend Cilium dataplane; update NetworkPolicy → CiliumNetworkPolicy references by @tnsimon in #85
- chore: bump llm-gateway-apikey to 0.0.10-alpha by @tnsimon in #93
- add highly-available requirements for llm-gateway-auth and bbr by @rambohe-ch in #86
- feat(bbr): harden body-based-router for HA and add e2e coverage (#89) by @rambohe-ch in #95
- chore: bump llm-gateway-apikey to 0.0.11-alpha; per-namespace ext_authz EnvoyFilter by @tnsimon in #97
- feat(charts/productionstack): fail-closed cluster filters + unified outage local_reply by @rambohe-ch in #94
- release: bump charts to 0.2.0 and publish productionstack by @rambohe-ch in #98
New Contributors
- @t0rr3sp3dr0 made their first contribution in #63
Full Changelog: v0.1.0...v0.2.0