Bug 1865998: Tolerate multiple package manifests with the same name #6225

spadgett · 2020-08-05T16:11:16Z

Generate a unique key for package manifests in our k8s reducer when name and namespace aren't unique.

openshift-ci-robot · 2020-08-05T16:11:23Z

@spadgett: This pull request references Bugzilla bug 1865998, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker.

3 validation(s) were run on this bug

bug is open, matching expected state (open)
bug target release (4.6.0) matches configured target release for branch (4.6.0)
bug is in the state ASSIGNED, which is one of the valid states (NEW, ASSIGNED, ON_DEV, POST, POST)

In response to this:

Bug 1865998: Tolerate multiple package manifests with the same name

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

spadgett · 2020-08-05T16:11:53Z

Note this is only a partial fix. Links to package manifest details pages will still be ambiguous since we rely on name + namespace to be unique in the URL.

spadgett · 2020-08-05T16:17:40Z

/cherry-pick release-4.5

openshift-cherrypick-robot · 2020-08-05T16:17:41Z

@spadgett: once the present PR merges, I will cherry-pick it on top of release-4.5 in a new PR and assign it to you.

In response to this:

/cherry-pick release-4.5

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

spadgett · 2020-08-05T16:18:43Z

/assign @TheRealJon @andrewballantyne

This change partially works around upstream OLM bug: https://bugzilla.redhat.com/show_bug.cgi?id=1814822 Generate a unique key for package manifests in our k8s reducer when name and namespace aren't unique.

spadgett · 2020-08-05T17:07:10Z

/retest

andrewballantyne · 2020-08-05T17:24:39Z

/lgtm

If the tests pass I think we are good to go. It appears to have addressed the issue.

openshift-ci-robot · 2020-08-05T17:24:55Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: andrewballantyne, spadgett

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~frontend/OWNERS~~ [spadgett]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

spadgett · 2020-08-05T17:49:34Z

It got past the original failure, although failed a later test. I'll dig into this once the artifacts are captured. It might be a different flake.

80) Interacting with an `AllNamespaces` install mode Operator (Jaeger)
   ✔ displays subscription creation form for selected Operator
   ✔ selects all namespaces for Operator subscription
   ✔ displays Operator as subscribed in OperatorHub
   ✔ displays Operator in "Cluster Service Versions" view for "test-hjnyy" namespace
   ✔ creates Operator `Deployment`
   ✔ displays metadata about Operator in the "Overview" section
   ✔ displays empty message in the "Jaeger" section
   ✔ displays form editor for creating a new `Jaeger` instance
   ✔ displays new `Jaeger` that was created from YAML editor
   ✔ displays metadata about the created `Jaeger` in its "Overview" section
A Jasmine spec timed out. Resetting the WebDriver Control Flow.
A Jasmine spec timed out. Resetting the WebDriver Control Flow.
A Jasmine spec timed out. Resetting the WebDriver Control Flow.
   ✖ displays the raw YAML for the `Jaeger` (3 failures)

spadgett · 2020-08-05T18:05:11Z

/retest

andrewballantyne · 2020-08-05T18:22:21Z

/retest

Hoping it's an unrelated flake?

spadgett · 2020-08-05T18:23:04Z

I think it's an unrelated related flake. I'm testing locally now.

spadgett · 2020-08-05T18:49:03Z

It works locally for me (both testing manually and running the protractor tests). I have a suspicion the resource updated in the background while the test was running, which prevented the test from saving the YAML. But I'm not sure.

andrewballantyne · 2020-08-05T19:00:50Z

It works locally for me (both testing manually and running the protractor tests). I have a suspicion the resource updated in the background while the test was running, which prevented the test from saving the YAML. But I'm not sure.

I'm not an expert on this, but I thought you have referenced screenshots before when tests fail? Like the page as-is when the test failed.

openshift-bot · 2020-08-05T19:09:20Z

/retest

Please review the full test history for this PR and help us cut down flakes.

spadgett · 2020-08-05T19:31:27Z

I'm not an expert on this, but I thought you have referenced screenshots before when tests fail? Like the page as-is when the test failed.

Yeah, screenshot is here:

https://storage.googleapis.com/origin-ci-test/pr-logs/pull/openshift_console/6225/pull-ci-openshift-console-master-e2e-gcp-console/1291048652772478976/artifacts/e2e-gcp-console/gui_test_screenshots/bc627daf4d44611dd42152266e071756.png

It's under Artifacts -> e2e-gcp-console -> gui_test_screenshots from the test details page.

openshift-ci-robot · 2020-08-05T20:41:55Z

@spadgett: All pull requests linked via external trackers have merged: openshift/console#6225. Bugzilla bug 1865998 has been moved to the MODIFIED state.

In response to this:

Bug 1865998: Tolerate multiple package manifests with the same name

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

openshift-cherrypick-robot · 2020-08-05T20:42:19Z

@spadgett: new pull request created: #6237

In response to this:

/cherry-pick release-4.5

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

spadgett · 2020-08-06T15:32:01Z

Digging into the YAML flake more, I've confirmed the resource was updated in the background:

[SEVERE] https://console-openshift-console.apps.ci-op-tmbri0v5-75d12.origin-ci-int-gce.dev.openshift.com/api/kubernetes/apis/jaegertracing.io/v1/namespaces/test-hjnyy/jaegers/jaeger-all-in-one-inmemory - Failed to load resource: the server responded with a status of 409 (Conflict)

I'm tempted to remove the lines that save the YAML editor in the OLM tests. That's already well-covered by the CRUD tests and doesn't seem specific to OLM.

https://github.com/openshift/console/blob/master/frontend/packages/operator-lifecycle-manager/integration-tests/scenarios/global-installmode.scenario.ts#L201-L206

I'm surprised there isn't an error message in the screenshot, though.

andrewballantyne · 2020-08-06T15:37:24Z

Digging into the YAML flake more, I've confirmed the resource was updated in the background:

[SEVERE] https://console-openshift-console.apps.ci-op-tmbri0v5-75d12.origin-ci-int-gce.dev.openshift.com/api/kubernetes/apis/jaegertracing.io/v1/namespaces/test-hjnyy/jaegers/jaeger-all-in-one-inmemory - Failed to load resource: the server responded with a status of 409 (Conflict)

I'm tempted to remove the lines that save the YAML editor in the OLM tests. That's already well-covered by the CRUD tests and doesn't seem specific to OLM.

Sounds like overlap that adds to the flakes. I think we don't need tests for this YAML save ... not sure it adds any quality to our tests.

https://github.com/openshift/console/blob/master/frontend/packages/operator-lifecycle-manager/integration-tests/scenarios/global-installmode.scenario.ts#L201-L206

I'm surprised there isn't an error message in the screenshot, though.

I was too - figured maybe the error got replaced with the info or something haha.

spadgett · 2020-08-06T16:28:49Z

I was too - figured maybe the error got replaced with the info or something haha.

That's exactly it! Here's how to reproduce manually:

Start editing resource YAML in browser tab 1.
Commit some changes to the same resource in tab 2.
Click save in tab 1. (You get a conflict error.)
Commit some other changes in tab 2.

The error goes away from tab 1. I'm almost certain that's what happened in the tests, in which case we got unlucky. And things are working as expected. I'm not sure if there's a good way to fix this other than removing the save from the test.

Arguably it's a bug that we clear the error on background updates, though.

spadgett · 2020-08-06T16:36:31Z

I guess an alternate fix is to always click Reload before Save if we want to keep the test.

andrewballantyne · 2020-08-06T16:38:09Z

I guess an alternate fix is to always click Reload before Save if we want to keep the test.

I think that will just reduce the flake count... not eliminate it. I don't think there is anything specific to the spec of a resource that says it cannot update back to back because of some criteria is deems is needed.

andrewballantyne · 2020-08-06T16:41:06Z

Arguably it's a bug that we clear the error on background updates, though.

Imo, definitely a bug :) Clearing errors is not something I think we should do until another submit is triggered. Submit errors are things that I think should remain around even if the data reloads as the user is the only one who really has control over how fast they consumed that error message and understood it... so programmatically removing it I feel is a bad UX.

spadgett · 2020-08-06T17:01:30Z

I opened https://bugzilla.redhat.com/show_bug.cgi?id=1866875 for the error message getting cleared.

openshift-ci-robot added the bugzilla/severity-urgent Referenced Bugzilla bug's severity is urgent for the branch this PR is targeting. label Aug 5, 2020

openshift-ci-robot added the bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. label Aug 5, 2020

openshift-ci-robot requested review from jcaianirh and jhadvig August 5, 2020 16:11

openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Aug 5, 2020

spadgett force-pushed the package-manifest-qn branch from 6a812d2 to 7c5f468 Compare August 5, 2020 16:18

openshift-ci-robot assigned andrewballantyne and TheRealJon Aug 5, 2020

Bug 1865998: Tolerate multiple package manifests with the same name

54e0671

This change partially works around upstream OLM bug: https://bugzilla.redhat.com/show_bug.cgi?id=1814822 Generate a unique key for package manifests in our k8s reducer when name and namespace aren't unique.

spadgett force-pushed the package-manifest-qn branch from 7c5f468 to 54e0671 Compare August 5, 2020 16:29

andrewballantyne mentioned this pull request Aug 5, 2020

Bug 1860535: show project description #6163

Merged

spadgett mentioned this pull request Aug 5, 2020

Bug 1846894: remove namespace if resource is not namespaced and namespace is provided #6220

Merged

openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Aug 5, 2020

spadgett mentioned this pull request Aug 5, 2020

Bug 1865930: Fix runtime error on project details page #6226

Merged

openshift-merge-robot merged commit cd87130 into openshift:master Aug 5, 2020

openshift-cherrypick-robot mentioned this pull request Aug 5, 2020

[release-4.5] Bug 1865998: Tolerate multiple package manifests with the same name #6237

Closed

spadgett deleted the package-manifest-qn branch August 5, 2020 20:42

spadgett mentioned this pull request Aug 6, 2020

tests: improve OLM scenario reliability #6253

Merged

spadgett added this to the v4.6 milestone Aug 11, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug 1865998: Tolerate multiple package manifests with the same name #6225

Bug 1865998: Tolerate multiple package manifests with the same name #6225

spadgett commented Aug 5, 2020

openshift-ci-robot commented Aug 5, 2020

spadgett commented Aug 5, 2020

spadgett commented Aug 5, 2020

openshift-cherrypick-robot commented Aug 5, 2020

spadgett commented Aug 5, 2020

spadgett commented Aug 5, 2020

andrewballantyne commented Aug 5, 2020

openshift-ci-robot commented Aug 5, 2020

spadgett commented Aug 5, 2020

spadgett commented Aug 5, 2020

andrewballantyne commented Aug 5, 2020 •

edited

spadgett commented Aug 5, 2020

spadgett commented Aug 5, 2020

andrewballantyne commented Aug 5, 2020

openshift-bot commented Aug 5, 2020

spadgett commented Aug 5, 2020

openshift-ci-robot commented Aug 5, 2020

openshift-cherrypick-robot commented Aug 5, 2020

spadgett commented Aug 6, 2020

andrewballantyne commented Aug 6, 2020

spadgett commented Aug 6, 2020

spadgett commented Aug 6, 2020

andrewballantyne commented Aug 6, 2020

andrewballantyne commented Aug 6, 2020

spadgett commented Aug 6, 2020

Bug 1865998: Tolerate multiple package manifests with the same name #6225

Bug 1865998: Tolerate multiple package manifests with the same name #6225

Conversation

spadgett commented Aug 5, 2020

openshift-ci-robot commented Aug 5, 2020

spadgett commented Aug 5, 2020

spadgett commented Aug 5, 2020

openshift-cherrypick-robot commented Aug 5, 2020

spadgett commented Aug 5, 2020

spadgett commented Aug 5, 2020

andrewballantyne commented Aug 5, 2020

openshift-ci-robot commented Aug 5, 2020

spadgett commented Aug 5, 2020

spadgett commented Aug 5, 2020

andrewballantyne commented Aug 5, 2020 • edited

spadgett commented Aug 5, 2020

spadgett commented Aug 5, 2020

andrewballantyne commented Aug 5, 2020

openshift-bot commented Aug 5, 2020

spadgett commented Aug 5, 2020

openshift-ci-robot commented Aug 5, 2020

openshift-cherrypick-robot commented Aug 5, 2020

spadgett commented Aug 6, 2020

andrewballantyne commented Aug 6, 2020

spadgett commented Aug 6, 2020

spadgett commented Aug 6, 2020

andrewballantyne commented Aug 6, 2020

andrewballantyne commented Aug 6, 2020

spadgett commented Aug 6, 2020

andrewballantyne commented Aug 5, 2020 •

edited