feat: move x-k8s to apix and add v1 InferencePool to api/v1 #1116

capri-xiyue · 2025-07-07T20:21:14Z

move inference.networking.x-k8s.io resources from /api to /apix
add inference.networking.x-k8s.io resources in /api

k8s-ci-robot · 2025-07-07T20:21:23Z

Hi @capri-xiyue. Thanks for your PR.

I'm waiting for a kubernetes-sigs member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

netlify · 2025-07-07T20:21:25Z

✅ Deploy Preview for gateway-api-inference-extension ready!

Name	Link
🔨 Latest commit	`70de454`
🔍 Latest deploy log	https://app.netlify.com/projects/gateway-api-inference-extension/deploys/686c2c3cd35e6f000869b255
😎 Deploy Preview	https://deploy-preview-1116--gateway-api-inference-extension.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

capri-xiyue · 2025-07-07T20:21:43Z

/assign @robscott

capri-xiyue · 2025-07-07T20:34:09Z

It's ready for review. I added DO-NOT-MERGE because this PR should be merged in v1 branch

capri-xiyue · 2025-07-07T21:07:25Z

It's ready for review. I added DO-NOT-MERGE because this PR should be merged in v1 branch

Change the base to be kubernetes-sigs:release-1.0

kfswain · 2025-07-07T21:10:18Z

/hold

Adding this label until we have confirmation that we can merge this PR without sig-net approval (or getting sig-net approval)

kfswain · 2025-07-07T23:14:56Z

/ok-to-test

capri-xiyue · 2025-07-07T23:33:50Z

/retest

capri-xiyue · 2025-07-08T00:20:12Z

/retest

capri-xiyue · 2025-07-08T18:07:48Z

/retest

capri-xiyue · 2025-07-08T19:23:38Z

/hold

Adding this label until we have confirmation that we can merge this PR without sig-net approval (or getting sig-net approval)

Currently I added the annotation api-approved.kubernetes.io=unapproved, experimental-only which matches the criteria defined in https://github.com/kubernetes/enhancements/pull/1111/files, I guess it should be ok to get the PR merged.

capri-xiyue · 2025-07-08T20:50:26Z

/assign @zetxqx

k8s-ci-robot · 2025-07-08T20:50:30Z

@capri-xiyue: GitHub didn't allow me to assign the following users: zetxqx.

Note that only kubernetes-sigs members with read permissions, repo collaborators and people who have commented on this issue/PR can be assigned. Additionally, issues/PRs can only have 10 assignees at the same time.
For more information please see the contributor guide

In response to this:

/assign @zetxqx

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

zetxqx

looks good to me! just one comment

config/crd/kustomization.yaml

k8s-ci-robot · 2025-07-09T18:32:16Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: capri-xiyue, zetxqx
Once this PR has been reviewed and has the lgtm label, please assign terrytangyuan for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

capri-xiyue · 2025-07-10T16:10:01Z

I followed https://github.com/kubernetes-sigs/gateway-api pattern to have api-x and api directory, otherwise make generate failed as it seems like it can't support different group CRD with the same name.

robscott

Thanks @capri-xiyue!

api/v1/inferencepool_types.go

robscott · 2025-07-11T16:58:36Z

apix/config/v1alpha1/endpointpickerconfig_types.go

These should be updated to use aliases pointing to the v1 types. See https://github.com/kubernetes-sigs/gateway-api/blob/093f6538409c071e906856bd1ce4072204fb3f08/apis/v1beta1/gateway_types.go#L48-L55 for an example.

The existing make generate can't work with such alias usage, it will throw non-local type error. I guess it is because gateway-api use https://github.com/kubernetes-sigs/gateway-api/blob/main/hack/update-clientset.sh while this repo uses kube-codegen natively. I'm wondering should we move to gateway-api self-written codegen to support alias or we are fine with native kube-codegen with non-alias

change from kube-codegen to self-written codegen like gateway api to support alias usage like type EndpointPickerConfig = v1.EndpointPickerConfig

pros: avoid type duplication

cons: need to maintain self-written codegen

keep existing kube-codegen usage but need to duplicate the InferencePool types in alpha and v1

pros: use kube-codegen and can get the latest code-gen feature without self-development

con: duplicate the v1 and alpha type for now

cc @kfswain

I'm not familiar with GW API's custom code gen tool so I'm not quite sure what tradeoffs we would be making. Generally speaking, we do use a pretty slimmed down version of kubebuilders codegen tooling.

And to share more context: we might have need to define a controller for the evolution multi-tenant APIs. So IGW and GW-API differ in that regard where GW API is primarily a shared API surface w/ conformance spec. IGW has the EPP and potentially an additional controller, so we might have different code gen needs.

I understand that isn't a clear decision, just sharing additional context to help with decision making. Ultimately, I think you @capri-xiyue and @robscott would know the tradeoffs between the two tools best. I can help more if needed, but would need a tl;dr for what each choice entails.

@kfswain
Here are the trade-off I think(also commented above)

change from kube-codegen to self-written codegen like gateway api to support alias usage like type EndpointPickerConfig = v1.EndpointPickerConfig

pros: avoid type duplication, can use alias like type EndpointPickerConfig = v1.EndpointPickerConfig

cons: need to maintain self-written codegen

keep existing kube-codegen usage but need to duplicate the InferencePool types in alpha and v1

pros: use kube-codegen and can get the latest code-gen feature without self-development

con: duplicate the v1 and alpha type for now

Do you need additional info? For me, I think if gateway-api-inference-extension and gate-way-api may have different code gen needs, I prefer to keep using the existing slimmed down version of kubebuilders codegen tooling with which we can easily get the kubebuilder codegen latest feature easily without self development, the only cons is that we duplicate the CRD golang type in different versions, I think it is fine as I've seen such pattern commonly used in other k8s extension OSS repo, see events/v1 and events/v1beta1

Discussed offline, @capri-xiyue and I agree that keeping the existing kubebuilder codegen tool is a better path forward, at the cost of duplicating the InferencePool spec into 2 different places.

Thanks @capri-xiyue!

@robscott suggested that we won't need deepcopy code like api/v1alpha2/zz_generated.deepcopy.go, therefore we can still use alias like type EndpointPickerConfig = v1.EndpointPickerConfig. As without deepcopy code api/v1alpha2/zz_generated.deepcopy.go, we won't hit the limitation of k8s codegen. I tried it, but still no luck as our repo uses the deepcopy code, see https://prow.k8s.io/view/gs/kubernetes-ci-logs/pr-logs/pull/kubernetes-sigs_gateway-api-inference-extension/1116/pull-gateway-api-inference-extension-test-e2e-main/1943761093448962048. Will continue the discussion with Rob.

At the same time I have filed an issue in kubernetes/code-generator#187 to get the confirmation from the code-generator side to see whether they have such support or not. I'm wondering for GA, can we tolerate the non-alias now and see later we can achieve using k8s-codegen natively and at the same time using alias? I searched the existing usage of k8s code generator. I have not found the usage of such alias. See https://github.com/kubernetes/api/blob/master/events/v1/types.go, https://github.com/kubernetes/api/blob/master/events/v1beta1/types.go. I guess the code-gen probably doesn't have such alias support right now.

Spent a few hours working together with Rob to see whether we can use existing kube codegen in this repo and at the same time with GW api types definition like type InferencePool v1.InferencePool. However, it will eventually lose alpha InferencePool CRD. Rob suggested v1.0 exclusively includes v1 InferencePool with no alpha version of InferencePool type or CRD, release channels come in to play later and will raise it in next OSS meeting.

For now, I will duplicate the alpha golang types for InferencePool to unblock @zetxqx GA efforts.

Co-authored-by: Rob Scott <rob.scott87@gmail.com>

capri-xiyue · 2025-07-11T19:00:32Z

Currently it failed to pass the test because of #1116 (comment).

capri-xiyue · 2025-07-12T00:43:32Z

Currently it failed to pass the test because of #1116 (comment).

Revert it back to make it work.

rename x-k8s to apix and add v1 InferencePool to api/v1

70de454

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Jul 7, 2025

k8s-ci-robot requested review from kfswain and nirrozenbaum July 7, 2025 20:21

k8s-ci-robot added the needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. label Jul 7, 2025

k8s-ci-robot added the size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. label Jul 7, 2025

k8s-ci-robot assigned robscott Jul 7, 2025

capri-xiyue changed the title ~~feat: move x-k8s to apix and add v1 InferencePool to api/v1~~ [WIP] feat: move x-k8s to apix and add v1 InferencePool to api/v1 Jul 7, 2025

k8s-ci-robot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jul 7, 2025

capri-xiyue changed the base branch from main to release-1.0 July 7, 2025 21:06

capri-xiyue changed the title ~~[WIP] feat: move x-k8s to apix and add v1 InferencePool to api/v1~~ feat: move x-k8s to apix and add v1 InferencePool to api/v1 Jul 7, 2025

k8s-ci-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jul 7, 2025

kfswain added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jul 7, 2025

capri-xiyue mentioned this pull request Jul 7, 2025

feat: Change epp and conformance to use v1 type InferencePool #1118

Open

k8s-ci-robot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. and removed needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Jul 7, 2025

updated docker file

a4d5ed8

fixed naming collision

3e0ff2a

capri-xiyue added 2 commits July 8, 2025 10:06

added fake approved annotation

61fafbf

try to fix unit tests

51115df

capri-xiyue added 3 commits July 8, 2025 11:13

fixed typo

6f4941e

added annotation

739bb6b

fixed annotation

5096e70

zetxqx approved these changes Jul 9, 2025

View reviewed changes

config/crd/kustomization.yaml Show resolved Hide resolved

robscott reviewed Jul 11, 2025

View reviewed changes

capri-xiyue and others added 8 commits July 11, 2025 10:00

Update api/v1/inferencepool_types.go

a85056d

Co-authored-by: Rob Scott <rob.scott87@gmail.com>

Apply suggestions from code review

8029db4

Co-authored-by: Rob Scott <rob.scott87@gmail.com>

use v1 type in v1alpha2

19ddbf6

clean unused depdendency

c882c9c

fixed code generator error

87bbe4d

revert back

def4a8d

merge bob's change

274bfe3

upadted code-gen

5c46368

remove deep copy needs

b216c9a

robscott mentioned this pull request Jul 11, 2025

V1 inference pool #1145

Closed

capri-xiyue added 4 commits July 11, 2025 13:16

use codegen

5879f78

revert to workable version

0a9c063

re-run generate

712f800

revert to workable version

c3f3308

capri-xiyue requested a review from robscott July 12, 2025 00:47

feat: move x-k8s to apix and add v1 InferencePool to api/v1 #1116

Are you sure you want to change the base?

feat: move x-k8s to apix and add v1 InferencePool to api/v1 #1116

Conversation

capri-xiyue commented Jul 7, 2025

Uh oh!

k8s-ci-robot commented Jul 7, 2025

Uh oh!

netlify bot commented Jul 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for gateway-api-inference-extension ready!

Uh oh!

capri-xiyue commented Jul 7, 2025

Uh oh!

capri-xiyue commented Jul 7, 2025

Uh oh!

capri-xiyue commented Jul 7, 2025

Uh oh!

kfswain commented Jul 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kfswain commented Jul 7, 2025

Uh oh!

capri-xiyue commented Jul 7, 2025

Uh oh!

capri-xiyue commented Jul 8, 2025

Uh oh!

capri-xiyue commented Jul 8, 2025

Uh oh!

capri-xiyue commented Jul 8, 2025

Uh oh!

capri-xiyue commented Jul 8, 2025

Uh oh!

k8s-ci-robot commented Jul 8, 2025

Uh oh!

zetxqx left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

k8s-ci-robot commented Jul 9, 2025

Uh oh!

capri-xiyue commented Jul 10, 2025

Uh oh!

robscott left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

robscott Jul 11, 2025

Choose a reason for hiding this comment

Uh oh!

capri-xiyue Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kfswain Jul 11, 2025

Choose a reason for hiding this comment

Uh oh!

capri-xiyue Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kfswain Jul 11, 2025

Choose a reason for hiding this comment

Uh oh!

capri-xiyue Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

capri-xiyue Jul 12, 2025

Choose a reason for hiding this comment

Uh oh!

capri-xiyue commented Jul 11, 2025

netlify bot commented Jul 7, 2025 •

edited

Loading

kfswain commented Jul 7, 2025 •

edited

Loading

capri-xiyue Jul 11, 2025 •

edited

Loading

capri-xiyue Jul 11, 2025 •

edited

Loading

capri-xiyue Jul 11, 2025 •

edited

Loading