[sig-network] EndpointSlice should create and delete Endpoints and EndpointSlices for a Service with a selector specified #92776

RobertKielty · 2020-07-03T10:06:02Z

Which jobs are flaking:
gce-ubuntu-master-default

Which test(s) are flaking:
[sig-network] EndpointSlice should create and delete Endpoints and EndpointSlices for a Service with a selector specified

Testgrid link:
https://testgrid.k8s.io/sig-release-master-informing#gce-ubuntu-master-default&sort-by-flakiness=&embed=

Reason for failure:
from Spyglass for job 1278952987569426433

we have

I0703 07:42:24.519 Jul 3 07:42:20.620: FAIL: EndpointSlice resource not deleted after Service endpointslice-3248/example-empty-selector was deleted: timed out waiting for the condition

following thru to the corresponding gubernator logs/ci-kubernetes-e2e-ubuntu-gce/1278952987569426433/

we see lot of occurrences of

W0703 07:45:10.958376 8 request.go:161] Auditing failed of request: encoding failed: /v1, Kind=DeleteOptions is unstructured and is not suitable for converting to "cloud.google.com/v1beta1"

Anything else we need to know:

links to go.k8s.io/triage appreciated

The text was updated successfully, but these errors were encountered:

RobertKielty · 2020-07-03T10:06:28Z

/sig network

athenabot · 2020-07-03T10:12:12Z

/triage unresolved

Comment /remove-triage unresolved when the issue is assessed and confirmed.

🤖 I am a bot run by vllry. 👩‍🔬

gaurimadhok · 2020-07-09T21:13:57Z

/assign @robscott

robscott · 2020-07-09T21:22:07Z

Thanks for filing this bug @RobertKielty! The encoding failed: /v1, Kind=DeleteOptions is a very strange error message, not sure what's happening there. Am I correct that this test has only failed once? Are there other test failures elsewhere?

athenabot · 2020-07-16T22:12:12Z

@robscott
If this issue has been triaged, please comment /remove-triage unresolved.

If you aren't able to handle this issue, consider unassigning yourself and/or adding the help-wanted label.

🤖 I am a bot run by vllry. 👩‍🔬

robscott · 2020-07-23T16:44:05Z

@RobertKielty do you think we can close this bug now? I haven't seen this fail recently.

RobertKielty · 2020-07-23T21:06:58Z

Hi @robscott thanks for pinging me on this issue.

I'm still seeing EndpointSlice resource not deleted errors (admittedly on other jobs)

https://storage.googleapis.com/k8s-gubernator/triage/index.html?pr=1&text=EndpointSlice%20resource%20not%20deleted%20after%20Service

EndpointSlice resource not deleted after Service endpointslice-3248/example-empty-selector was deleted: timed out waiting for the condition

cc @hasheddan What do you think we should do here?

robscott · 2020-07-23T23:53:49Z

Hey @RobertKielty, thanks for letting me know about the other failures! This is a tricky one since it's a timeout on garbage collection which could take a variable amount of time. I've opened a new PR that increases the timeout by 50%, hopefully that will be sufficient: #93402

RobertKielty · 2020-07-24T09:15:30Z

Thanks Rob !

liggitt · 2020-07-26T06:16:12Z

still showing up - see https://prow.k8s.io/view/gs/kubernetes-jenkins/pr-logs/pull/93264/pull-kubernetes-e2e-gce/1287128021647495169

garbage collection is susceptible to interruption if API groups are being added/removed frequently (which happens all the time in e2e test runs, see investigation of a prior GC timeout issue in #87668 (comment))

bridgetkromhout · 2020-08-06T21:15:13Z

@robscott to follow up - this is failing because garbage collection is taking too long

athenabot · 2020-08-15T22:12:16Z

@robscott
If this issue has been triaged, please comment /remove-triage unresolved.

If you aren't able to handle this issue, consider unassigning yourself and/or adding the help-wanted label.

🤖 I am a bot run by vllry. 👩‍🔬

athenabot · 2020-09-14T23:12:19Z

@robscott
If this issue has been triaged, please comment /remove-triage unresolved.

If you aren't able to handle this issue, consider unassigning yourself and/or adding the help-wanted label.

🤖 I am a bot run by vllry. 👩‍🔬

fejta-bot · 2021-02-14T22:39:04Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

robscott · 2021-02-14T22:57:55Z

Looks like this is still happening, just less frequently: https://storage.googleapis.com/k8s-gubernator/triage/index.html?test=EndpointSlice%20should%20create%20and%20delete%20Endpoints%20and%20EndpointSlices%20for%20a%20Service%20with%20a%20selector%20specified.

Some recent examples:

https://prow.k8s.io/view/gcs/kubernetes-jenkins/logs/ci-kubernetes-e2e-gce-cos-k8sstable1-default/1360211909957128192
https://prow.k8s.io/view/gcs/kubernetes-jenkins/logs/ci-kubernetes-e2e-gce-cos-k8sstable1-default/1359877020594475008

In both cases it's specifically timing out waiting for the EndpointSlice to be deleted.
/remove-lifecycle stale

liggitt · 2021-03-11T04:14:08Z

no flakes in last two weeks
https://testgrid.k8s.io/sig-release-master-informing#gce-ubuntu-master-default&sort-by-flakiness=&embed=&width=5&show-stale-tests=&include-filter-by-regex=EndpointSlice

/close

k8s-ci-robot · 2021-03-11T04:14:23Z

@liggitt: Closing this issue.

In response to this:

no flakes in last two weeks
https://testgrid.k8s.io/sig-release-master-informing#gce-ubuntu-master-default&sort-by-flakiness=&embed=&width=5&show-stale-tests=&include-filter-by-regex=EndpointSlice

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

RobertKielty added the kind/flake Categorizes issue or PR as related to a flaky test. label Jul 3, 2020

k8s-ci-robot added the needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. label Jul 3, 2020

k8s-ci-robot added sig/network Categorizes an issue or PR as relevant to SIG Network. and removed needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Jul 3, 2020

k8s-ci-robot added the triage/unresolved Indicates an issue that can not or will not be resolved. label Jul 3, 2020

k8s-ci-robot assigned robscott Jul 9, 2020

robscott mentioned this issue Jul 23, 2020

Updating EndpointSlice e2e tests to be less flaky and easier to debug #93402

Merged

robscott mentioned this issue Sep 14, 2020

Increasing acceptable timeout for EndpointSlice garbage collection #94785

Merged

thockin added needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. and removed triage/unresolved Indicates an issue that can not or will not be resolved. labels Oct 15, 2020

thockin added triage/accepted Indicates an issue or PR is ready to be actively worked on. area/deflake Issues or PRs related to deflaking kubernetes tests area/test and removed needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Nov 16, 2020

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 14, 2021

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 14, 2021

k8s-ci-robot closed this as completed Mar 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[sig-network] EndpointSlice should create and delete Endpoints and EndpointSlices for a Service with a selector specified #92776

[sig-network] EndpointSlice should create and delete Endpoints and EndpointSlices for a Service with a selector specified #92776

RobertKielty commented Jul 3, 2020

RobertKielty commented Jul 3, 2020

athenabot commented Jul 3, 2020

gaurimadhok commented Jul 9, 2020

robscott commented Jul 9, 2020 •

edited

Loading

athenabot commented Jul 16, 2020

robscott commented Jul 23, 2020

RobertKielty commented Jul 23, 2020 •

edited

Loading

robscott commented Jul 23, 2020

RobertKielty commented Jul 24, 2020

liggitt commented Jul 26, 2020

bridgetkromhout commented Aug 6, 2020

athenabot commented Aug 15, 2020

athenabot commented Sep 14, 2020

fejta-bot commented Feb 14, 2021

robscott commented Feb 14, 2021

liggitt commented Mar 11, 2021

k8s-ci-robot commented Mar 11, 2021

[sig-network] EndpointSlice should create and delete Endpoints and EndpointSlices for a Service with a selector specified #92776

[sig-network] EndpointSlice should create and delete Endpoints and EndpointSlices for a Service with a selector specified #92776

Comments

RobertKielty commented Jul 3, 2020

RobertKielty commented Jul 3, 2020

athenabot commented Jul 3, 2020

gaurimadhok commented Jul 9, 2020

robscott commented Jul 9, 2020 • edited Loading

athenabot commented Jul 16, 2020

robscott commented Jul 23, 2020

RobertKielty commented Jul 23, 2020 • edited Loading

robscott commented Jul 23, 2020

RobertKielty commented Jul 24, 2020

liggitt commented Jul 26, 2020

bridgetkromhout commented Aug 6, 2020

athenabot commented Aug 15, 2020

athenabot commented Sep 14, 2020

fejta-bot commented Feb 14, 2021

robscott commented Feb 14, 2021

liggitt commented Mar 11, 2021

k8s-ci-robot commented Mar 11, 2021

robscott commented Jul 9, 2020 •

edited

Loading

RobertKielty commented Jul 23, 2020 •

edited

Loading