New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug 1999556: annotate rendered config with OCP version #2918
Bug 1999556: annotate rendered config with OCP version #2918
Conversation
d7fb906
to
f8a5118
Compare
@kikisdeliveryservice: This pull request references Bugzilla bug 1999556, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker. 3 validation(s) were run on this bug
No GitHub users were found matching the public email listed for the QA contact in Bugzilla (rioliu@redhat.com), skipping review request. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
@kikisdeliveryservice: This pull request references Bugzilla bug 1999556, which is valid. 3 validation(s) were run on this bug
No GitHub users were found matching the public email listed for the QA contact in Bugzilla (rioliu@redhat.com), skipping review request. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/test e2e-aws-disruptive |
31e860e
to
eabdba8
Compare
@kikisdeliveryservice: This pull request references Bugzilla bug 1999556, which is valid. 3 validation(s) were run on this bug
No GitHub users were found matching the public email listed for the QA contact in Bugzilla (rioliu@redhat.com), skipping review request. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
17332ce
to
5ba686b
Compare
@kikisdeliveryservice: This pull request references Bugzilla bug 1999556, which is valid. 3 validation(s) were run on this bug
No GitHub users were found matching the public email listed for the QA contact in Bugzilla (rioliu@redhat.com), skipping review request. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
mco/pr looking good, but some unrelatedblocked passing ``multus-validating-config.k8s.io: x509: certificate signed by unknown authority"``` /test e2e-agnostic-upgrade |
aws not doing great |
/retest-required |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Overall, this looks sane to me and I think we could ship it as is. Some comments below which could be considered all optionals/informational.
A couple of changes are incoming /hold |
f63d071
to
82ed8fd
Compare
Updated PR after doing some testing to implement some of @cgwalters suggestions. This is now ready. 😸 /hold cancel |
Code generally lgtm. I have a question on the BZ: for situations that would trigger the bugzilla error The Looking through CI search (https://search.ci.openshift.org/?search=pool+should+be+updated+before+the+CVO+reports+available+at+the+new+version&maxAge=336h&context=1&type=junit&name=&excludeName=&maxMatches=5&maxBytes=20971520&groupBy=job), there was only 2 hits of this in the past 2 weeks. Looking at the first link, there was a new rendered-master being generated: it seems that the upgrade only contained a diff in Which seems to imply that to trigger this error, a template change would be the only diff between the two OCP release versions...? Or is that due to another clusteroperator change that changed the dynamic variables populating that file? Would this PR be covering this scenario? Sorry, am a bit confused as to how we can trigger this error after your previous fixes, so trying to think this through. |
Controllerconfig also annotated to plumb version to render controller. Also added check to verify that annotations match ocp release image version to prevent MCO reported upgraded before a new rendered cfg is rolled out in cases where there is no new MCO commit or new osImageURL. Related-to: BZ 1999556
Related-to: BZ 1999556
…ersion annotation Related-to: BZ 1999556
82ed8fd
to
c5b9bf0
Compare
@yuqi-zhang Intention for this PR is to fix the known issue of when there is an OCP upgrade that contains no new MCO commit and no new MCO osImageURL. Have had known failures there in the past, so closing the loop on that case via this pr by waiting for a new renderconfig to always roll out on all upgrades. In the run that you pointed to, osimageurl is same for both releases and old rendered config and new rendered config both point to same controller version |
Yeah ok that makes sense, so we don't report before the newest render gets generated. /lgtm To give it a safer chance of making it in |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: cgwalters, kikisdeliveryservice, yuqi-zhang The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/retest-required Please review the full test history for this PR and help us cut down flakes. |
2 similar comments
/retest-required Please review the full test history for this PR and help us cut down flakes. |
/retest-required Please review the full test history for this PR and help us cut down flakes. |
@kikisdeliveryservice: The following tests failed, say
Full PR test history. Your PR dashboard. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
@kikisdeliveryservice: All pull requests linked via external trackers have merged: Bugzilla bug 1999556 has been moved to the MODIFIED state. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/cherry-pick release-4.9 |
@kikisdeliveryservice: #2918 failed to apply on top of branch "release-4.9":
In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
We need to ensure that all rendered configs were generated in the proper version of the OCP release version before allowing the MCO to report upgraded. Previously, we added checks to verify MCs were generated with the correct MCO commit/version.Hash and verify the osImageURL. However, on rare occasions (and usually just in ci) there will be an OCP release that neither includes a new MCO commit nor a new rhcos image, and because we aren't checking against the OCP version, we prematurely report upgraded before the new MCO/MCC/Rendered MCs are generated.