New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OCPBUGS-7076: Copy Day2 BMH if Agent is installing #5250
OCPBUGS-7076: Copy Day2 BMH if Agent is installing #5250
Conversation
@CrystalChun: This pull request references Jira Issue OCPBUGS-7076, which is invalid:
Comment The bug has been updated to refer to the pull request using the external bug tracker. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
https://issues.redhat.com/browse/OCPBUGS-7076 Previously the BMAC would copy the BMH and Machine CRs of a Day2 worker to the spoke cluster before the Agent began installing. If the Agent doesn't start installing within the two hours these CRs were copied over, the CSRs for these nodes will not be approved by the machine approver pod. To prevent this from happening, only copy the BMH and Machine CRs to the spoke cluster when the Agent has started installing.
Codecov Report
Additional details and impacted files@@ Coverage Diff @@
## master #5250 +/- ##
==========================================
+ Coverage 67.46% 68.91% +1.44%
==========================================
Files 218 221 +3
Lines 32714 35497 +2783
==========================================
+ Hits 22072 24462 +2390
- Misses 8656 8915 +259
- Partials 1986 2120 +134
|
Why not wait until the node is fully installed? What happens if the installation itself takes more than 2-3 hours (for example, a long time to pull images, or the host booted into the wrong disk and it took time to address)? |
@avishayt Thank you for the feedback! I've tested this out by only checking for The spoke cluster's
where Please correct me if I'm wrong, but it seems like copying the BMH & Machine of the day 2 worker node must be done while the Agent is in the installing phase. Otherwise, the CSR will never be approved and the installation will never complete. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Your PR looks good - sorry for the noise. It slipped my mind that we won't set the status to installed until the node is Ready, and that won't happen until the CSR is approved.
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: avishayt, CrystalChun The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
No problem! Thank you again for the review! |
/jira refresh |
@CrystalChun: This pull request references Jira Issue OCPBUGS-7076, which is valid. 3 validation(s) were run on this bug
Requesting review from QA contact: In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
@CrystalChun: all tests passed! Full PR test history. Your PR dashboard. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
@CrystalChun: Jira Issue OCPBUGS-7076: All pull requests linked via external trackers have merged: Jira Issue OCPBUGS-7076 has been moved to the MODIFIED state. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/cherry-pick release-ocm-2.7 |
@CrystalChun: new pull request created: #5283 In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/cherry-pick release-ocm-2.8 |
@CrystalChun: new pull request created: #5286 In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
https://issues.redhat.com/browse/OCPBUGS-7076 Previously the BMAC would copy the BMH and Machine CRs of a Day2 worker to the spoke cluster before the Agent began installing. If the Agent doesn't start installing within the two hours these CRs were copied over, the CSRs for these nodes will not be approved by the machine approver pod. To prevent this from happening, only copy the BMH and Machine CRs to the spoke cluster when the Agent has started installing.
https://issues.redhat.com//browse/MGMT-14871 Backport to ACM 2.7 Cherry pick of openshift#5250 Original description: OCPBUGS-7076 Previously the BMAC would copy the BMH and Machine CRs of a Day2 worker to the spoke cluster before the Agent began installing. If the Agent doesn't start installing within the two hours these CRs were copied over, the CSRs for these nodes will not be approved by the machine approver pod. To prevent this from happening, only copy the BMH and Machine CRs to the spoke cluster when the Agent has started installing.
https://issues.redhat.com//browse/MGMT-14871 Backport to ACM 2.7 Cherry pick of openshift#5250 OCPBUGS-7076: Copy Day2 BMH if Agent is installing (openshift#5250) https://issues.redhat.com/browse/OCPBUGS-7076 Previously the BMAC would copy the BMH and Machine CRs of a Day2 worker to the spoke cluster before the Agent began installing. If the Agent doesn't start installing within the two hours these CRs were copied over, the CSRs for these nodes will not be approved by the machine approver pod. To prevent this from happening, only copy the BMH and Machine CRs to the spoke cluster when the Agent has started installing.
Fix included in accepted release 4.14.0-0.nightly-2023-09-11-201102 |
Fix included in accepted release 4.14.0-0.nightly-2023-09-12-024050 |
Fix included in accepted release 4.14.0-0.nightly-2023-09-15-101929 |
https://issues.redhat.com/browse/OCPBUGS-7076 Previously the BMAC would copy the BMH and Machine CRs of a Day2 worker to the spoke cluster before the Agent began installing. If the Agent doesn't start installing within the two hours these CRs were copied over, the CSRs for these nodes will not be approved by the machine approver pod. To prevent this from happening, only copy the BMH and Machine CRs to the spoke cluster when the Agent has started installing.
OCPBUGS-7076
Previously the BMAC would copy the BMH and
Machine CRs of a Day2 worker to the spoke cluster before
the Agent began installing. If the Agent doesn't
start installing within the two hours these CRs
were copied over, the CSRs for these nodes will
not be approved by the machine approver pod.
To prevent this from happening, only copy
the BMH and Machine CRs to the spoke cluster
when the Agent has started installing.
List all the issues related to this PR
What environments does this code impact?
How was this code tested?
Checklist
docs
, README, etc)Reviewers Checklist
/cc @eranco74