Flake when SCCs are not associated yet #26225

ravisantoshgudimetla · 2021-06-14T14:36:55Z

https://bugzilla.redhat.com/show_bug.cgi?id=1961204 was punted
to 4.9 release, however occasionally we see multus not
getting SCCs in time causing auth issues. This commit
causes the test to flake in such scenarios instead of
failing. This is a temporary measure to get better
signal for 4.8 upgrades. We need to remove this
when the associated bug has been fixed

cc @vrutkovs @dcbw

openshift-ci · 2021-06-14T14:37:05Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: ravisantoshgudimetla
To complete the pull request process, please assign bparees after the PR has been reviewed.
You can assign the PR to them by writing /assign @bparees in a comment when ready.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

pkg/OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

deads2k · 2021-06-14T15:56:34Z

pkg/synthetictests/networking.go

@@ -42,6 +42,10 @@ func testPodSandboxCreation(events monitorapi.Intervals) []*ginkgo.JUnitTestCase
 			flakes = append(flakes, fmt.Sprintf("%v - multus is unable to get pods due to LB disruption https://bugzilla.redhat.com/show_bug.cgi?id=1927264 - %v", event.Locator, event.Message))
 			continue
 		}
+		if strings.Contains(event.Message, "Multus") && strings.Contains(event.Message, "error getting pod") && strings.Contains(event.Message, "is forbidden: User \"system:serviceaccount:openshift-multus:multus\"") {
+			flakes = append(flakes, fmt.Sprintf("%v - multus is unable to get pods due to not getting required SCCs https://bugzilla.redhat.com/show_bug.cgi?id=1961204 - %v", event.Locator, event.Message))


a GET request doesn't ever use SCC

deads2k · 2021-06-14T15:56:45Z

the fix and bug dont' appear related

/hold

https://bugzilla.redhat.com/show_bug.cgi?id=1961204 was punted to 4.9 release, however occasionally we see multus not getting SCCs in time causing auth issues. This commit causes the test to flake in such scenarios instead of failing. This is a temporary measure to get better signal for 4.8 upgrades. We need to remove this when the associated bug has been fixed

deads2k · 2021-06-15T19:50:09Z

one possibility is that there is no ready kube-apiserver. Another possibility is that there is a ready kube-apiserver, but for some reason the LB isn't detecting it.

openshift/kubernetes#807 should show the reality of being ready at any given time. I'm not sure how to collect the list of endpoints for a load balancer. Perhaps the SPLAT team knows how to do this?

openshift-ci · 2021-06-19T01:46:20Z

@ravisantoshgudimetla: PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

openshift-ci · 2021-06-25T15:23:33Z

@ravisantoshgudimetla: The following tests failed, say /retest to rerun all failed tests:

Test name	Commit	Details	Rerun command
ci/prow/e2e-gcp-csi	`595740d`	link	`/test e2e-gcp-csi`
ci/prow/e2e-aws-csi	`595740d`	link	`/test e2e-aws-csi`
ci/prow/e2e-agnostic-cmd	`595740d`	link	`/test e2e-agnostic-cmd`
ci/prow/e2e-gcp	`595740d`	link	`/test e2e-gcp`
ci/prow/e2e-aws-disruptive	`595740d`	link	`/test e2e-aws-disruptive`
ci/prow/e2e-gcp-upgrade	`595740d`	link	`/test e2e-gcp-upgrade`
ci/prow/e2e-gcp-disruptive	`595740d`	link	`/test e2e-gcp-disruptive`
ci/prow/e2e-aws-jenkins	`595740d`	link	`/test e2e-aws-jenkins`

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

openshift-bot · 2021-09-23T20:47:43Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh by commenting /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.
Exclude this issue from closing by commenting /lifecycle frozen.

If this issue is safe to close now please do so with /close.

/lifecycle stale

openshift-bot · 2021-10-23T21:16:47Z

Stale issues rot after 30d of inactivity.

Mark the issue as fresh by commenting /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.
Exclude this issue from closing by commenting /lifecycle frozen.

If this issue is safe to close now please do so with /close.

/lifecycle rotten
/remove-lifecycle stale

openshift-bot · 2021-11-22T21:47:41Z

Rotten issues close after 30d of inactivity.

Reopen the issue by commenting /reopen.
Mark the issue as fresh by commenting /remove-lifecycle rotten.
Exclude this issue from closing again by commenting /lifecycle frozen.

/close

openshift-ci · 2021-11-22T21:51:19Z

@openshift-bot: Closed this PR.

In response to this:

Rotten issues close after 30d of inactivity.

Reopen the issue by commenting /reopen.
Mark the issue as fresh by commenting /remove-lifecycle rotten.
Exclude this issue from closing again by commenting /lifecycle frozen.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

openshift-ci bot requested review from mfojtik and spadgett June 14, 2021 14:37

deads2k reviewed Jun 14, 2021

View reviewed changes

openshift-ci bot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jun 14, 2021

ravisantoshgudimetla force-pushed the fix-scc branch from 73af412 to 595740d Compare June 14, 2021 16:00

openshift-ci bot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jun 19, 2021

openshift-ci bot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Sep 23, 2021

openshift-ci bot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Oct 23, 2021

openshift-ci bot closed this Nov 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flake when SCCs are not associated yet #26225

Flake when SCCs are not associated yet #26225

ravisantoshgudimetla commented Jun 14, 2021

openshift-ci bot commented Jun 14, 2021

deads2k Jun 14, 2021

deads2k commented Jun 14, 2021

deads2k commented Jun 15, 2021

openshift-ci bot commented Jun 19, 2021

openshift-ci bot commented Jun 25, 2021

openshift-bot commented Sep 23, 2021

openshift-bot commented Oct 23, 2021

openshift-bot commented Nov 22, 2021

openshift-ci bot commented Nov 22, 2021

Flake when SCCs are not associated yet #26225

Flake when SCCs are not associated yet #26225

Conversation

ravisantoshgudimetla commented Jun 14, 2021

openshift-ci bot commented Jun 14, 2021

deads2k Jun 14, 2021

Choose a reason for hiding this comment

deads2k commented Jun 14, 2021

deads2k commented Jun 15, 2021

openshift-ci bot commented Jun 19, 2021

openshift-ci bot commented Jun 25, 2021

openshift-bot commented Sep 23, 2021

openshift-bot commented Oct 23, 2021

openshift-bot commented Nov 22, 2021

openshift-ci bot commented Nov 22, 2021