Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

introduces a startup monitor #1130

Merged
merged 1 commit into from Jul 20, 2021

Conversation

p0lyn0mial
Copy link
Contributor

@p0lyn0mial p0lyn0mial commented Jul 13, 2021

The main purpose of the new app is to provide a fallback mechanism that will make the operand to revert to the last healthy revision when the new revision fails.

The fallback mechanism is opt-in by the operator, and kube-apiserver and etcd operators (and potentially kcm and ks) have to consult the deployment topology in the infrastructure resource to decide.

Co-authored-by: Dr. Stefan Schimanski stefan.schimanski@gmail.com
Co-authored-by: Lukasz Szaszkiewicz lukasz.szaszkiewicz@gmail.com

For more details see: openshift/enhancements#833

@openshift-ci openshift-ci bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jul 13, 2021
@openshift-ci openshift-ci bot requested review from mfojtik and sttts July 13, 2021 07:54
@p0lyn0mial
Copy link
Contributor Author

/assign @sttts @marun

terminationGracePeriodSeconds: 5
priorityClassName: system-node-critical
tolerations:
- operator: "Exists"
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we need it?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

no idea. Better remove it instead of cargo culting :)

- --target-name=kube-apiserver
- --manifests-dir=/etc/kubernetes/manifests
- --resource-dir=/etc/kubernetes/static-pod-resources
- --revision=REVISION
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we really still have REVISION and not {{.Revision}} ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yeah, the REVISION is replaced by the installer-pod

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

at some point we should clean up :)

}

// step 1: create last known good revision
if err := m.io.Symlink(revisionedTargetManifestPath, m.lastKnownGoodManifestDstPath()); err != nil {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

isn't there a force variant for this? I always thought symlink update is atomic.

@p0lyn0mial p0lyn0mial changed the title [WIP] introduces a startup monitor introduces a startup monitor Jul 16, 2021
@openshift-ci openshift-ci bot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jul 16, 2021
@sttts sttts force-pushed the startup-monitor branch 3 times, most recently from d03338f to 0c3516b Compare July 16, 2021 18:19
@marun marun removed their assignment Jul 19, 2021
@p0lyn0mial
Copy link
Contributor Author

/lgtm

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jul 20, 2021

@p0lyn0mial: you cannot LGTM your own PR.

In response to this:

/lgtm

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@sttts
Copy link
Contributor

sttts commented Jul 20, 2021

Needs squashing.

Then lgtm.

/approve

@openshift-ci openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jul 20, 2021
The main purpose of the new app is to provide a fallback mechanism that will make the operand to revert to the last healthy revision when the new revision fails.
The fallback mechanism is opt-in by the operator, and kube-apiserver and etcd operators (and potentially kcm and ks) have to consult the deployment topology in the infrastructure resource to decide.

For more details see: openshift/enhancements#833

Co-authored-by: Dr. Stefan Schimanski <stefan.schimanski@gmail.com>
Co-authored-by: Lukasz Szaszkiewicz <lukasz.szaszkiewicz@gmail.com>
@sttts
Copy link
Contributor

sttts commented Jul 20, 2021

/lgtm

@openshift-ci openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Jul 20, 2021
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jul 20, 2021

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: p0lyn0mial, sttts

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-merge-robot openshift-merge-robot merged commit f8ed438 into openshift:master Jul 20, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. lgtm Indicates that a PR is ready to be merged.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants