Skip to content

v0.15.0

Latest

Choose a tag to compare

@github-actions github-actions released this 15 Jun 19:54
· 82 commits to main since this release
Immutable release. Only release title and notes can be modified.
v0.15.0
915ed66

This release focuses on recipe health scoring, improved deployment validation, improved snapshot/discovery, and extending software supply chain capabilities for enterprise users.

Highlights

Recipe Structural Health - New pkg/health engine computes per-recipe health signals (chart_pinned, constraints_wellformed, declared_coverage) and rolls them up into a recipe-health matrix. aicr recipe list surfaces structural-health columns (with a --no-health opt-out), a tools/health generator and weekly recipe-health-refresh workflow keep the matrix current, and a lint guard now requires healthCheck.assertFile.

Improved Deployment Validation - The chainsaw deployment-phase runner is now an in-process executor rather than a shelled-out binary. aicr validate runs all phases by default with a --fail-fast opt-in, fails closed on evaluator errors, and is nil-safe across health checks.

Snapshot/Discovery - The collector now discovers GPU SKUs without nvidia-smi, removing the CUDA base image dependency and matching SKUs on token boundaries instead of substrings.

Closed Supply Chain - Signing and verification now work end-to-end in air-gapped and enterprise environments. aicr bundle supports KMS-backed signing (--signing-key) and private Sigstore deployments (--fulcio-url, --rekor-url); aicr verify --key validates bundles against a KMS or public key; and aicr evidence publish signs recipe evidence off-network. The recipe catalog itself now ships signed provenance for the V1 closed supply chain, and keyless signing warns before publishing identity to the public transparency log.

New Recipes & Overlays

  • A100 training Kubeflow overlay chains for EKS, AKS, GKE COS, and OKE
  • GB300 concrete EKS service-bound overlays
  • OKE GB200 and AKS H100 Dynamo performance checks

CLI & Bundling

  • aicr recipe list subcommand for catalog enumeration
  • Gatekeeper added as an optional component

Inference Performance & Validation

  • Inference-performance validation enhanced and tuned; gated on all worker services Ready
  • nccl-all-reduce-bw gates wired for EKS + H200; GKE NCCL node selector made dynamic
  • Bounded absent-resource retries in deployment-phase health checks

Thanks to @atif1996, @cdesiniotis, @dims, @haarchri, @JaydipGabani, @lalitadithya, @lockwobr, @njhensley, @pdmack, @pedjak, @rsd-darshan, @sttts, @xdu31, @yuanchen8911, and @mchmarny.

Changelog

New Features

Bug Fixes

Other Tasks