Skip to content

v0.7.0

Latest

Choose a tag to compare

@Syspretor Syspretor released this 04 Jun 11:31
· 9 commits to main since this release
01df6fc

What's Changed

  • fix(stateful): skip correct unhealth instances by @JasonHe-WQ in #169
  • feat(kep): add new kep to refine configmap content and configuration by @JasonHe-WQ in #134
  • refactor(rbg): add api v1alpha2 by @Syspretor in #167
  • fix(e2e): Fixed sporadic end-to-end (e2e) failures caused by slow pod… by @Syspretor in #170
  • refactor(helm): support crd upgrader by @Syspretor in #175
  • refactor(rbgs): init rbgs apiVersion v1alpha2 by @Syspretor in #173
  • fix(pod group): fix instance workload calculation bug by @JasonHe-WQ in #174
  • 修复PD分离部署中,手动减少prefill和decode的replicas的时候,patio无法正确通知router删除路由的问题。 by @zhaoweimiao in #177
  • chore(coordinated-policy): init coordinated policy crd with apiversio… by @Syspretor in #183
  • chore(rbg): update v1alpha2 constants by @Syspretor in #184
  • fix(role-instance): restart policy does not take effect by @Syspretor in #185
  • chore(role-instance): init apiversion v1alpha2 for roleInstance and r… by @Syspretor in #186
  • chore(scalingadapter): init scalingAdapter apiversion v1alpha2 by @Syspretor in #189
  • chore(engine-runtime): init engine runtime profile apiversion v1alpha2 by @Syspretor in #190
  • chore(role-instance): implement roleInstance and roleInstanceSet as reconcile version by @Syspretor in #191
  • chore(rbg): implement v1alpha2 as storage version by @Syspretor in #188
  • fix(ut): fix base cases ut failure by @Syspretor in #192
  • refactor(rbg): support customComponentsPatterns in v1alpha2 version by @Syspretor in #193
  • feature(rbg): support role instance index env by @Syspretor in #194
  • fix(instance): fix updated generation by @JasonHe-WQ in #197
  • chore(role-instance): add roleinstance related rbac by @Syspretor in #199
  • fix(rbg): fix inject env related ut by @Syspretor in #200
  • fix(rbg): fix reconciler related ut by @Syspretor in #201
  • chore(coordinated-policies): add rbac for coordinatedpolicies crd by @Syspretor in #202
  • Fix(e2e): add crd check by @Syspretor in #198
  • fix(rbg): solve the problem of patch race by @Syspretor in #203
  • feature(rbg): support gang scheduling in v1alpha2 by @Syspretor in #195
  • Feature(role-instanceset): support create pod in parallel by @Syspretor in #204
  • chore(role-instanceset): change spec podManagement field type by @Syspretor in #207
  • Refactor/unify constants by @Syspretor in #208
  • chore(makefile): support to generate rbac in make manifests by @Syspretor in #209
  • refactor(coordinated-policy): change strategies api field by @Syspretor in #205
  • refactor(rbgs): change spec api field by @Syspretor in #206
  • Ensure the inference service can be correctly re-register after the router is restarted through patio heartbeat by @zhaoweimiao in #187
  • fix(env): fix missing env by @JasonHe-WQ in #214
  • Change version of Makefile to 0.7.0 by @cheyang in #215
  • fix(coordinated-policy): fix progression filed enum validation by @Syspretor in #217
  • fix(rbg): fix rbg delete failed in foreground deletion by @Syspretor in #219
  • [PATCH] feat: add arm64 multi-arch docker publishing by @cheyang in #218
  • fix: generated manifests missing latest CRD by @Syspretor in #222
  • fix(role-instance): fix update role instance status conflict by @Syspretor in #220
  • Build: update rbg helm chart 0.7.0-alpha.1 by @cheyang in #224
  • chore: add copyright 2026 by @Syspretor in #229
  • Check copyright by @cheyang in #230
  • fix(rbgs): missing re-generate client code for rbgs by @Syspretor in #236
  • chore: add check for generated client-code by @Syspretor in #237
  • refactor(coordinated-policy): add name field in policy by @Syspretor in #235
  • chore: add check for yaml lint by @Syspretor in #238
  • Controllerrevisions default by @JasonHe-WQ in #234
  • update aiconfigurator dependency by @diw-zw in #228
  • Update xxx-copyright-to-pr-go-files.sh - ignore vendor/ by @diw-zw in #239
  • Cli implement by @diw-zw in #182
  • fix: some typo and wrong kubebuilder comments by @Syspretor in #244
  • fix(rbgsa-controller): use time.Second for RequeueAfter durations by @sebest in #240
  • feat(scheduler): inherit podGroup annotations by @JasonHe-WQ in #233
  • chore: update copyright header by @Syspretor in #245
  • fix(rbg-controller): watch owned RoleBasedGroupScalingAdapter resources by @sebest in #241
  • feat: pod port allocator (kep 171) by @NoobDream2568 in #210
  • fix(port-allocator): add startup args to charts by @NoobDream2568 in #249
  • feat(rbg): support enabling portAllocator on demand by @Syspretor in #250
  • Build: update rbg helm chart 0.7.0-alpha.2 by @cheyang in #253
  • Chore(rbg): add v1alpha2 related examples by @Syspretor in #231
  • chore: use ARCH and remove TARGETARCH fallback by @NoobDream2568 in #256
  • Security issue: unsafe json string in pulling job's annotations; unsa… by @diw-zw in #251
  • chore(community): add examples for dynamo by @Syspretor in #255
  • build: support patio arm64 image by @TrafalgarZZZ in #258
  • Deal with codeQL alert: Uncontrolled data used in path expression by @diw-zw in #257
  • feat(scaling-adapter): add Labels field for user-specified RBGSA labels by @sebest in #242
  • Enhance service reconciler by @lx1036 in #252
  • fix(rbgsa-controller): use Warning event type for failure events by @sebest in #262
  • fix(readiness): re-evaluate pod Ready condition in removeNotReadyKey by @sebest in #246
  • feat: support user-defined model configuration by @diw-zw in #259
  • chore: disable unparam linter by @Syspretor in #266
  • feat(rbg): support use templateRef in lwp by @Syspretor in #254
  • feat(scaling-adapter): add readyReplicas to RBGSA with sole-writer pattern by @sebest in #243
  • v1alpha2 conversion webhook by @diw-zw in #225
  • chore: add examples for mooncake integration with v1alpha2 api by @Syspretor in #272
  • rbgcli: adds multi-node LLM inference serving support by @diw-zw in #265
  • build(deps): bump github.com/moby/spdystream from 0.5.0 to 0.5.1 by @dependabot[bot] in #278
  • fix(rbgsa): api conversion failed caused by fields missing by @Syspretor in #283
  • [CLI] Secure OSS secrets in cluster, fix engine port args, and add Qwen3.6 model by @diw-zw in #279
  • refactor: remove workload field in v1alpha2 by @Syspretor in #281
  • fix: deal with injectMetadataSave properly by @diw-zw in #286
  • fix(cli): extractRBGStatus wrong by @diw-zw in #290
  • chore: add cases for LWP env by @Syspretor in #287
  • feat(cli): enhance llm svc run with two new override flags by @diw-zw in #289
  • fix(rbgsa-controller): preserve and initialize readyReplicas by @sebest in #280
  • fix(roleinstance): nil pointer dereference in in-place update by @sebest in #274
  • Copilot/fix env vars order by @cheyang in #282
  • chore(rbgs): update to 0.7.0-alpha.3 by @cheyang in #291
  • docs: update README with latest releases and PD disaggregation example by @Syspretor in #292
  • feat(cli): allow custom all fields without pre-build model config by @diw-zw in #293
  • chore: skip e2e test for cli changes by @Syspretor in #294
  • docs: update documentation for v1alpha2 API by @Syspretor in #295
  • Chore/update readme by @Syspretor in #297
  • docs: update documents for rbg cli by @diw-zw in #296
  • chore: update readme by @Syspretor in #298
  • chore: skip e2e tests for non-code file changes by @Syspretor in #299
  • feat: add --dev flag for local controller debugging by @Syspretor in #288
  • docs: fix Deployment Patterns, Ecosystem descriptions and Mooncake fi… by @Syspretor in #300
  • chore: rename run-dev to run-local and update developer guide by @Syspretor in #301
  • chore(doc): update install ways in readme by @Syspretor in #311
  • ci: add manifest-based E2E test for deploy changes by @Syspretor in #312
  • chore(doc): update readme for cli by @Syspretor in #313
  • ci: use server-side apply for manifest E2E and remove image build by @Syspretor in #314
  • feat(cli): get storage/source/engine details by @diw-zw in #303
  • feat(cli): add command 'llm svc model-configs' to list builtin/custom… by @diw-zw in #318
  • Fix helm generate issue by @cheyang in #305
  • fix(e2e): enable portallocator in e2e-manifest by @Syspretor in #321
  • fix(controller): prevent RestartInProgress condition loss due to stal… by @Syspretor in #324
  • fix(e2e): ensure manifests contain conversion webhook configuration by @Syspretor in #323
  • fix: prevent RestartInProgress condition loss during RBG restart by @Syspretor in #325
  • feat(cli): auto-benchmark ui by @diw-zw in #320
  • feat(cli): new command - auto-benchmark by @diw-zw in #319
  • feat(cli): enhancements to the auto-benchmark search algorithms by @diw-zw in #327
  • feat(ui): enhances the benchmark viewer UI according to #327 by @diw-zw in #328
  • cli: add --memory and --cpu flag to 'llm model pull' by @diw-zw in #326
  • Remove cli by @diw-zw in #330
  • fix(rbg): handle inactive pod states (Evicted/Failed) for RBG by @Syspretor in #329
  • chore(rbg): deprecate and remove RecreateRBGOnPodRestart restart policy by @Syspretor in #340
  • fix(rbg): handle inactive pod recreation and clean up dead code by @Syspretor in #337
  • fix(rbg): use status.Replicas instead of spec.Replicas for RBG role s… by @Syspretor in #338
  • fix(controller): fix stateful RoleInstanceSet surge rolling update by @NoobDream2568 in #333
  • fix(controller): clean up stale RoleInstances on stateful/stateless pattern switch by @NoobDream2568 in #334
  • feat: change default restartPolicy to RecreateRoleInstanceOnPodRestart by @Syspretor in #331
  • fix(e2e): use RetryOnConflict for LWP update in revision test by @Syspretor in #344
  • feat(rbg): enhanced customComponentsPatten lifcycle management by @Syspretor in #316
  • feat(rbg): add leader only service by @JasonHe-WQ in #273
  • feat: enhance networking to support per replica service by @JasonHe-WQ in #276
  • feat: add annotation to exclude components from triggering restart po… by @Syspretor in #332
  • feat: add kube-api-qps and kube-api-burst flags for API client tuning by @Syspretor in #347
  • feat: add --enable-pprof flag for runtime profiling by @Syspretor in #346
  • test(e2e): add v1alpha2 stability and update coverage by @Syspretor in #345
  • chore(rbgs): update Helm chart and default image versions to 0.7.0-al… by @cheyang in #350
  • fix: prevent shouldRecreateInstance from triggering during updates by @Syspretor in #349
  • chore: sync deploy/kubectl/manifests.yaml during make manifests by @Syspretor in #355
  • test: add comprehensive envtest and unit tests for restart policy by @Syspretor in #354
  • refactor: move restartPolicy from RoleSpec to pattern types by @Syspretor in #357
  • fix: update restart policy tests to match pattern-level RestartPolicy API by @Syspretor in #360
  • chore(rbgs): update Helm chart and default image versions to 0.7.0 by @cheyang in #362

New Contributors

Full Changelog: v0.6.0...v0.7.0