Skip to content

Releases: kaito-project/production-stack

v0.2.1

16 Jun 06:49
c6a4220

Choose a tag to compare

Production Stack v0.2.1

Container image

ghcr.io/kaito-project/gpu-node-mocker:0.2.1

Helm charts

Add the chart repository (once):

helm repo add production-stack https://kaito-project.github.io/production-stack/charts/kaito-project
helm repo update production-stack

The following charts are published from this release (versions taken
from each chart's Chart.yaml at this tag):

  • production-stack/gpu-node-mocker
  • production-stack/modeldeployment
  • production-stack/modelharness

See README.md for installation steps.


What's Changed

  • feat(charts): add gateway error-mapping, ownership labels, and namespce management by @rambohe-ch in #99
  • feat(charts): consolidate whole-path outage local_reply with component-first attribution by @rambohe-ch in #100
  • release: bump charts to 0.2.1 by @rambohe-ch in #101

Full Changelog: v0.2.0...v0.2.1

v0.2.0

11 Jun 04:42
bdca618

Choose a tag to compare

Production Stack v0.2.0

Container image

ghcr.io/kaito-project/gpu-node-mocker:0.2.0

Helm charts

Add the chart repository (once):

helm repo add production-stack https://kaito-project.github.io/production-stack/charts/kaito-project
helm repo update production-stack

The following charts are published from this release (versions taken
from each chart's Chart.yaml at this tag):

  • production-stack/gpu-node-mocker
  • production-stack/modeldeployment
  • production-stack/modelharness

See README.md for installation steps.


What's Changed

  • fix release step by @rambohe-ch in #54
  • test(e2e): probe EPP pod for netpol deny assertions by @tnsimon in #53
  • e2e: add provider switch (upstream/azure), bump keda-kaito-scaler to v0.5.1, instrument timings by @rambohe-ch in #50
  • fix nightly e2e workflow by @rambohe-ch in #48
  • chore: Surface error when docker or podman is not installed by @techworldhello in #56
  • chore: add workflow to push helm charts to mcr by @t0rr3sp3dr0 in #63
  • Replace model-not-found Service with Envoy direct_response EnvoyFilter by @rambohe-ch in #60
  • feat: separate image preparation from cluster setup for accurate E2E timing by @rambohe-ch in #61
  • feat: Garbage collector for GPU mocker by @techworldhello in #58
  • feat: upgrade readme according to the latest helm chart by @rambohe-ch in #64
  • Refactor E2E install around the productionstack umbrella chart by @rambohe-ch in #67
  • Generate E2E coverage report by @techworldhello in #73
  • Improve networkpolicy harness by @tnsimon in #57
  • Pin BBR and keda-kaito-scaler versions in productionstack chart by @rambohe-ch in #69
  • Make nightly env setup consistent to e2e by @techworldhello in #75
  • test(e2e/netpol): dump canary pod sidecar state on enforcement precheck failure by @tnsimon in #77
  • feat(productionstack): add llm-gateway-apikey as OCI Helm dependency by @tnsimon in #82
  • Proposal: add End-to-End Error Handling Across Cluster, Modelharness, and Modeldeployment Levels by @rambohe-ch in #76
  • fix label filters for report data by @techworldhello in #81
  • fix(modelharness): scope NetworkPolicy selector to production-stack pods (#83) by @tnsimon in #84
  • docs: recommend Cilium dataplane; update NetworkPolicy → CiliumNetworkPolicy references by @tnsimon in #85
  • chore: bump llm-gateway-apikey to 0.0.10-alpha by @tnsimon in #93
  • add highly-available requirements for llm-gateway-auth and bbr by @rambohe-ch in #86
  • feat(bbr): harden body-based-router for HA and add e2e coverage (#89) by @rambohe-ch in #95
  • chore: bump llm-gateway-apikey to 0.0.11-alpha; per-namespace ext_authz EnvoyFilter by @tnsimon in #97
  • feat(charts/productionstack): fail-closed cluster filters + unified outage local_reply by @rambohe-ch in #94
  • release: bump charts to 0.2.0 and publish productionstack by @rambohe-ch in #98

New Contributors

Full Changelog: v0.1.0...v0.2.0

v0.1.0

11 May 12:34
b597706

Choose a tag to compare

Production Stack v0.1.0

Container image

ghcr.io/kaito-project/gpu-node-mocker:0.1.0

Helm charts

Add the chart repository (once):

helm repo add production-stack https://kaito-project.github.io/production-stack/charts/kaito-project
helm repo update production-stack

The following charts are published from this release (versions taken
from each chart's Chart.yaml at this tag):

  • production-stack/gpu-node-mocker
  • production-stack/modeldeployment
  • production-stack/modelharness

See README.md for installation steps.


What's Changed

New Contributors

Full Changelog: https://github.com/kaito-project/production-stack/commits/v0.1.0