Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AGENT-906: Script to run monitor-add-nodes in cluster #8294

Merged
merged 1 commit into from
May 9, 2024

Conversation

rwsu
Copy link
Contributor

@rwsu rwsu commented Apr 20, 2024

Derived from a similar script by Andrea Fasano to generate the add-nodes ISO.

#8242

This script tweaks it and creates a node-joiner-monitor pod to monitor adding nodes to a cluster.

@openshift-ci-robot openshift-ci-robot added the jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. label Apr 20, 2024
@openshift-ci-robot
Copy link
Contributor

openshift-ci-robot commented Apr 20, 2024

@rwsu: This pull request references AGENT-863 which is a valid jira issue.

In response to this:

Derived from a similar script by Andrea Fasano to generate the add-nodes ISO.

#8242

This script tweaks it and creates a node-joiner-monitor pod to monitor adding nodes to a cluster.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@openshift-ci openshift-ci bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Apr 20, 2024
Copy link
Contributor

openshift-ci bot commented Apr 20, 2024

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

@rwsu rwsu changed the title AGENT-863: Script to run monitor-add-nodes in cluster AGENT-906: Script to run monitor-add-nodes in cluster Apr 29, 2024
@openshift-ci-robot
Copy link
Contributor

openshift-ci-robot commented Apr 29, 2024

@rwsu: This pull request references AGENT-906 which is a valid jira issue.

In response to this:

Derived from a similar script by Andrea Fasano to generate the add-nodes ISO.

#8242

This script tweaks it and creates a node-joiner-monitor pod to monitor adding nodes to a cluster.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@rwsu rwsu marked this pull request as ready for review April 30, 2024 03:05
@openshift-ci openshift-ci bot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Apr 30, 2024
@@ -84,7 +84,8 @@ $ ./node-joiner.sh config.yaml
Use the iso image to boot all the nodes listed in the configuration file, and wait for the related
certificate signing requests (CSRs) to appear. When adding a new node to the cluster, two pending CSRs will
be generated, and they must be manually approved by the user.
Use the following command to monitor the pending certificates:

Use the following command or [node-joiner-monitor.sh](./node-joiner-monitor.sh) described below to monitor the pending certificates:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just noticed that below there's a small pre-existing typo, User instead of use


# Monitoring
After the ISO image is booted on a node, progress can be monitored using the node-joiner-monitor.sh script.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: I'd say the opposite, ie After a node is booted using the ISO image

is printed out to stdout.

The monitoring script will print out the CSRs that will need to be approved for the node
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Here it could be useful to cite another couple of relevant points, printed by the script:

  • Pre-flight validations. In case the node does not match on or more validations, the installation will not start. The output of the mismatching validations is reported in the stdout to allow the user to fix the problem(s) when required.
  • Installation progress. A progress about the current installation stage is reported, it like the image writing to the disk.


oc wait --for=condition=Ready=true --timeout=300s pod/node-joiner-monitor

oc logs -f -n "${namespace}" node-joiner-monitor
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Will be the logs printed in real time? Not sure if the wait prevents the output to be displayed only when everything is completed?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The wait ends after the pod becomes running. The log should be printed in real time after the wait.

Derived from a similar script by Andrea Fasano
to generate the add-nodes ISO.

openshift#8242

This script tweaks it and creates a node-joiner-monitor
pod to monitor adding nodes to a cluster.

Co-authored-by: Andrea Fasano <andfasano@redhat.com>
Copy link
Contributor

openshift-ci bot commented May 8, 2024

@rwsu: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/prow/okd-e2e-agent-compact-ipv4 cc833e9 link false /test okd-e2e-agent-compact-ipv4

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

@andfasano
Copy link
Contributor

/approve

Copy link
Contributor

openshift-ci bot commented May 9, 2024

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: andfasano

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label May 9, 2024
Copy link
Contributor

@pawanpinjarkar pawanpinjarkar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm

@openshift-ci openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label May 9, 2024
@openshift-merge-bot openshift-merge-bot bot merged commit bbca50f into openshift:master May 9, 2024
6 of 7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. lgtm Indicates that a PR is ready to be merged.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants