What's Changed
- fix(stateful): skip correct unhealth instances by @JasonHe-WQ in #169
- feat(kep): add new kep to refine configmap content and configuration by @JasonHe-WQ in #134
- refactor(rbg): add api v1alpha2 by @Syspretor in #167
- fix(e2e): Fixed sporadic end-to-end (e2e) failures caused by slow pod… by @Syspretor in #170
- refactor(helm): support crd upgrader by @Syspretor in #175
- refactor(rbgs): init rbgs apiVersion v1alpha2 by @Syspretor in #173
- fix(pod group): fix instance workload calculation bug by @JasonHe-WQ in #174
- 修复PD分离部署中,手动减少prefill和decode的replicas的时候,patio无法正确通知router删除路由的问题。 by @zhaoweimiao in #177
- chore(coordinated-policy): init coordinated policy crd with apiversio… by @Syspretor in #183
- chore(rbg): update v1alpha2 constants by @Syspretor in #184
- fix(role-instance): restart policy does not take effect by @Syspretor in #185
- chore(role-instance): init apiversion v1alpha2 for roleInstance and r… by @Syspretor in #186
- chore(scalingadapter): init scalingAdapter apiversion v1alpha2 by @Syspretor in #189
- chore(engine-runtime): init engine runtime profile apiversion v1alpha2 by @Syspretor in #190
- chore(role-instance): implement roleInstance and roleInstanceSet as reconcile version by @Syspretor in #191
- chore(rbg): implement v1alpha2 as storage version by @Syspretor in #188
- fix(ut): fix base cases ut failure by @Syspretor in #192
- refactor(rbg): support customComponentsPatterns in v1alpha2 version by @Syspretor in #193
- feature(rbg): support role instance index env by @Syspretor in #194
- fix(instance): fix updated generation by @JasonHe-WQ in #197
- chore(role-instance): add roleinstance related rbac by @Syspretor in #199
- fix(rbg): fix inject env related ut by @Syspretor in #200
- fix(rbg): fix reconciler related ut by @Syspretor in #201
- chore(coordinated-policies): add rbac for coordinatedpolicies crd by @Syspretor in #202
- Fix(e2e): add crd check by @Syspretor in #198
- fix(rbg): solve the problem of patch race by @Syspretor in #203
- feature(rbg): support gang scheduling in v1alpha2 by @Syspretor in #195
- Feature(role-instanceset): support create pod in parallel by @Syspretor in #204
- chore(role-instanceset): change spec podManagement field type by @Syspretor in #207
- Refactor/unify constants by @Syspretor in #208
- chore(makefile): support to generate rbac in make manifests by @Syspretor in #209
- refactor(coordinated-policy): change strategies api field by @Syspretor in #205
- refactor(rbgs): change spec api field by @Syspretor in #206
- Ensure the inference service can be correctly re-register after the router is restarted through patio heartbeat by @zhaoweimiao in #187
- fix(env): fix missing env by @JasonHe-WQ in #214
- Change version of Makefile to 0.7.0 by @cheyang in #215
- fix(coordinated-policy): fix progression filed enum validation by @Syspretor in #217
- fix(rbg): fix rbg delete failed in foreground deletion by @Syspretor in #219
- [PATCH] feat: add arm64 multi-arch docker publishing by @cheyang in #218
- fix: generated manifests missing latest CRD by @Syspretor in #222
- fix(role-instance): fix update role instance status conflict by @Syspretor in #220
- Build: update rbg helm chart 0.7.0-alpha.1 by @cheyang in #224
- chore: add copyright 2026 by @Syspretor in #229
- Check copyright by @cheyang in #230
- fix(rbgs): missing re-generate client code for rbgs by @Syspretor in #236
- chore: add check for generated client-code by @Syspretor in #237
- refactor(coordinated-policy): add name field in policy by @Syspretor in #235
- chore: add check for yaml lint by @Syspretor in #238
- Controllerrevisions default by @JasonHe-WQ in #234
- update aiconfigurator dependency by @diw-zw in #228
- Update xxx-copyright-to-pr-go-files.sh - ignore vendor/ by @diw-zw in #239
- Cli implement by @diw-zw in #182
- fix: some typo and wrong kubebuilder comments by @Syspretor in #244
- fix(rbgsa-controller): use time.Second for RequeueAfter durations by @sebest in #240
- feat(scheduler): inherit podGroup annotations by @JasonHe-WQ in #233
- chore: update copyright header by @Syspretor in #245
- fix(rbg-controller): watch owned RoleBasedGroupScalingAdapter resources by @sebest in #241
- feat: pod port allocator (kep 171) by @NoobDream2568 in #210
- fix(port-allocator): add startup args to charts by @NoobDream2568 in #249
- feat(rbg): support enabling portAllocator on demand by @Syspretor in #250
- Build: update rbg helm chart 0.7.0-alpha.2 by @cheyang in #253
- Chore(rbg): add v1alpha2 related examples by @Syspretor in #231
- chore: use ARCH and remove TARGETARCH fallback by @NoobDream2568 in #256
- Security issue: unsafe json string in pulling job's annotations; unsa… by @diw-zw in #251
- chore(community): add examples for dynamo by @Syspretor in #255
- build: support patio arm64 image by @TrafalgarZZZ in #258
- Deal with codeQL alert: Uncontrolled data used in path expression by @diw-zw in #257
- feat(scaling-adapter): add Labels field for user-specified RBGSA labels by @sebest in #242
- Enhance service reconciler by @lx1036 in #252
- fix(rbgsa-controller): use Warning event type for failure events by @sebest in #262
- fix(readiness): re-evaluate pod Ready condition in removeNotReadyKey by @sebest in #246
- feat: support user-defined model configuration by @diw-zw in #259
- chore: disable unparam linter by @Syspretor in #266
- feat(rbg): support use templateRef in lwp by @Syspretor in #254
- feat(scaling-adapter): add readyReplicas to RBGSA with sole-writer pattern by @sebest in #243
- v1alpha2 conversion webhook by @diw-zw in #225
- chore: add examples for mooncake integration with v1alpha2 api by @Syspretor in #272
- rbgcli: adds multi-node LLM inference serving support by @diw-zw in #265
- build(deps): bump github.com/moby/spdystream from 0.5.0 to 0.5.1 by @dependabot[bot] in #278
- fix(rbgsa): api conversion failed caused by fields missing by @Syspretor in #283
- [CLI] Secure OSS secrets in cluster, fix engine port args, and add Qwen3.6 model by @diw-zw in #279
- refactor: remove workload field in v1alpha2 by @Syspretor in #281
- fix: deal with injectMetadataSave properly by @diw-zw in #286
- fix(cli): extractRBGStatus wrong by @diw-zw in #290
- chore: add cases for LWP env by @Syspretor in #287
- feat(cli): enhance llm svc run with two new override flags by @diw-zw in #289
- fix(rbgsa-controller): preserve and initialize readyReplicas by @sebest in #280
- fix(roleinstance): nil pointer dereference in in-place update by @sebest in #274
- Copilot/fix env vars order by @cheyang in #282
- chore(rbgs): update to 0.7.0-alpha.3 by @cheyang in #291
- docs: update README with latest releases and PD disaggregation example by @Syspretor in #292
- feat(cli): allow custom all fields without pre-build model config by @diw-zw in #293
- chore: skip e2e test for cli changes by @Syspretor in #294
- docs: update documentation for v1alpha2 API by @Syspretor in #295
- Chore/update readme by @Syspretor in #297
- docs: update documents for rbg cli by @diw-zw in #296
- chore: update readme by @Syspretor in #298
- chore: skip e2e tests for non-code file changes by @Syspretor in #299
- feat: add --dev flag for local controller debugging by @Syspretor in #288
- docs: fix Deployment Patterns, Ecosystem descriptions and Mooncake fi… by @Syspretor in #300
- chore: rename run-dev to run-local and update developer guide by @Syspretor in #301
- chore(doc): update install ways in readme by @Syspretor in #311
- ci: add manifest-based E2E test for deploy changes by @Syspretor in #312
- chore(doc): update readme for cli by @Syspretor in #313
- ci: use server-side apply for manifest E2E and remove image build by @Syspretor in #314
- feat(cli): get storage/source/engine details by @diw-zw in #303
- feat(cli): add command 'llm svc model-configs' to list builtin/custom… by @diw-zw in #318
- Fix helm generate issue by @cheyang in #305
- fix(e2e): enable portallocator in e2e-manifest by @Syspretor in #321
- fix(controller): prevent RestartInProgress condition loss due to stal… by @Syspretor in #324
- fix(e2e): ensure manifests contain conversion webhook configuration by @Syspretor in #323
- fix: prevent RestartInProgress condition loss during RBG restart by @Syspretor in #325
- feat(cli): auto-benchmark ui by @diw-zw in #320
- feat(cli): new command - auto-benchmark by @diw-zw in #319
- feat(cli): enhancements to the auto-benchmark search algorithms by @diw-zw in #327
- feat(ui): enhances the benchmark viewer UI according to #327 by @diw-zw in #328
- cli: add --memory and --cpu flag to 'llm model pull' by @diw-zw in #326
- Remove cli by @diw-zw in #330
- fix(rbg): handle inactive pod states (Evicted/Failed) for RBG by @Syspretor in #329
- chore(rbg): deprecate and remove RecreateRBGOnPodRestart restart policy by @Syspretor in #340
- fix(rbg): handle inactive pod recreation and clean up dead code by @Syspretor in #337
- fix(rbg): use status.Replicas instead of spec.Replicas for RBG role s… by @Syspretor in #338
- fix(controller): fix stateful RoleInstanceSet surge rolling update by @NoobDream2568 in #333
- fix(controller): clean up stale RoleInstances on stateful/stateless pattern switch by @NoobDream2568 in #334
- feat: change default restartPolicy to RecreateRoleInstanceOnPodRestart by @Syspretor in #331
- fix(e2e): use RetryOnConflict for LWP update in revision test by @Syspretor in #344
- feat(rbg): enhanced customComponentsPatten lifcycle management by @Syspretor in #316
- feat(rbg): add leader only service by @JasonHe-WQ in #273
- feat: enhance networking to support per replica service by @JasonHe-WQ in #276
- feat: add annotation to exclude components from triggering restart po… by @Syspretor in #332
- feat: add kube-api-qps and kube-api-burst flags for API client tuning by @Syspretor in #347
- feat: add --enable-pprof flag for runtime profiling by @Syspretor in #346
- test(e2e): add v1alpha2 stability and update coverage by @Syspretor in #345
- chore(rbgs): update Helm chart and default image versions to 0.7.0-al… by @cheyang in #350
- fix: prevent shouldRecreateInstance from triggering during updates by @Syspretor in #349
- chore: sync deploy/kubectl/manifests.yaml during make manifests by @Syspretor in #355
- test: add comprehensive envtest and unit tests for restart policy by @Syspretor in #354
- refactor: move restartPolicy from RoleSpec to pattern types by @Syspretor in #357
- fix: update restart policy tests to match pattern-level RestartPolicy API by @Syspretor in #360
- chore(rbgs): update Helm chart and default image versions to 0.7.0 by @cheyang in #362
New Contributors
- @JasonHe-WQ made their first contribution in #169
- @diw-zw made their first contribution in #228
- @sebest made their first contribution in #240
- @NoobDream2568 made their first contribution in #210
- @lx1036 made their first contribution in #252
- @dependabot[bot] made their first contribution in #278
Full Changelog: v0.6.0...v0.7.0