updating CRD causes connection loss with active watches of custom resources #113966

l1b0k · 2022-11-17T05:36:55Z

What happened?

update CRD 's description fields
the client watching for the cr resource will lost watch Terminate custom resource watches when storage is destroyed #78029

What did you expect to happen?

As i only chang the CRD's descriptions , this is not necessory to notify all client about the change.
This increase signifient pressure for kube-apiserver in large cluster.

How can we reproduce it (as minimally and precisely as possible)?

update CRD fields, and you can see watch connection is lost

Anything else we need to know?

No response

Kubernetes version

$ kubectl version
Server Version: version.Info{Major:"1", Minor:"22+", GitVersion:"v1.22.15-aliyun.1", GitCommit:"707e514954f0f3ba8ce36face7cf7058403057bc", GitTreeState:"clean", BuildDate:"2022-09-22T03:45:47Z", GoVersion:"go1.16.15", Compiler:"gc", Platform:"linux/amd64"}

/sig api-machinery

l1b0k · 2022-11-17T07:12:54Z

/sig api-machinery

Ritikaa96 · 2022-11-17T08:26:53Z

Maybe we can retitle it as:
updating CRD causes connection loss with active watches of custom resources.
for a better understanding.

Ritikaa96 · 2022-11-17T08:28:36Z

/area api-server
/area custom-resources

k8s-ci-robot · 2022-11-17T08:28:39Z

@Ritikaa96: The label(s) area/api-server cannot be applied, because the repository doesn't have them.

In response to this:

/area api-server
/area custom-resources

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Ritikaa96 · 2022-11-17T08:30:01Z

Whoops , my bad!
/area apiserver

Sakuralbj · 2022-11-17T12:11:13Z

Now crdHandler compare the spec and acceptedNames of crd to decide if crd update should be ignored.
So if some description info such as spec.versions.additionalPrinterColumns.description was changed,
kube-apiserver will destory storage and close watches. Can we make a more precise comparison here to reduce unnecessary connection close. In large-cluster the reconnection will bring huge load to apiserver. https://github.com/kubernetes/kubernetes/blob/master/staging/src/k8s.io/apiextensions-apiserver/pkg/apiserver/customresource_handler.go#L499

Sakuralbj · 2022-11-17T15:45:35Z

Maybe spec and acceptedNames here is not correct enough to compare against if a change is made on a CRD. Whether we can build a new struct to save necessary info which can be used to judge if storage need change.

leilajal · 2022-11-17T22:33:32Z

/triage accepted

k8s-triage-robot · 2024-01-19T08:58:47Z

This issue has not been updated in over 1 year, and should be re-triaged.

You can:

Confirm that this issue is still relevant with /triage accepted (org members only)
Close this issue with /close

For more details on the triage process, see https://www.kubernetes.dev/docs/guide/issue-triage/

/remove-triage accepted

Ritikaa96 · 2024-01-25T11:52:19Z

hi @l1b0k does the error still exist on your side?

jiahuif · 2024-01-25T17:50:13Z

/triage accepted

l1b0k added the kind/bug Categorizes issue or PR as related to a bug. label Nov 17, 2022

k8s-ci-robot added needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Nov 17, 2022

k8s-ci-robot added sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. and removed needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Nov 17, 2022

k8s-ci-robot added the area/custom-resources label Nov 17, 2022

k8s-ci-robot added the area/apiserver label Nov 17, 2022

l1b0k changed the title ~~Update CRD cause apiserver~~ updating CRD causes connection loss with active watches of custom resources Nov 17, 2022

k8s-ci-robot added triage/accepted Indicates an issue or PR is ready to be actively worked on. and removed needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Nov 17, 2022

k8s-ci-robot added needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. and removed triage/accepted Indicates an issue or PR is ready to be actively worked on. labels Jan 19, 2024

k8s-ci-robot added triage/accepted Indicates an issue or PR is ready to be actively worked on. and removed needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Jan 25, 2024

k8s-ci-robot assigned likakuli Jan 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

updating CRD causes connection loss with active watches of custom resources #113966

updating CRD causes connection loss with active watches of custom resources #113966

l1b0k commented Nov 17, 2022 •

edited

l1b0k commented Nov 17, 2022

Ritikaa96 commented Nov 17, 2022

Ritikaa96 commented Nov 17, 2022

k8s-ci-robot commented Nov 17, 2022

Ritikaa96 commented Nov 17, 2022 •

edited

Sakuralbj commented Nov 17, 2022 •

edited

Sakuralbj commented Nov 17, 2022

leilajal commented Nov 17, 2022

k8s-triage-robot commented Jan 19, 2024

Ritikaa96 commented Jan 25, 2024

jiahuif commented Jan 25, 2024

updating CRD causes connection loss with active watches of custom resources #113966

updating CRD causes connection loss with active watches of custom resources #113966

Comments

l1b0k commented Nov 17, 2022 • edited

What happened?

What did you expect to happen?

How can we reproduce it (as minimally and precisely as possible)?

Anything else we need to know?

Kubernetes version

l1b0k commented Nov 17, 2022

Ritikaa96 commented Nov 17, 2022

Ritikaa96 commented Nov 17, 2022

k8s-ci-robot commented Nov 17, 2022

Ritikaa96 commented Nov 17, 2022 • edited

Sakuralbj commented Nov 17, 2022 • edited

Sakuralbj commented Nov 17, 2022

leilajal commented Nov 17, 2022

k8s-triage-robot commented Jan 19, 2024

Ritikaa96 commented Jan 25, 2024

jiahuif commented Jan 25, 2024

l1b0k commented Nov 17, 2022 •

edited

Ritikaa96 commented Nov 17, 2022 •

edited

Sakuralbj commented Nov 17, 2022 •

edited