Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug 1963730: kube-apiserver failed to load SNI cert and key #610

Merged
merged 3 commits into from Jun 11, 2021

Conversation

p0lyn0mial
Copy link
Contributor

@p0lyn0mial p0lyn0mial commented Jun 9, 2021

As of today, the dynamic certificates i.e. kube-apiserver-certs are accessed by at least two processes, namely the installer pod and the cert-syncer.
Up until now, there was no coordination between these processes that might have lead to many unexpected errors, like https://bugzilla.redhat.com/show_bug.cgi?id=1963730

This PR writes all certificates in an atomic way by first creating a temporary file, writing the content to it, and then renaming it to the original file.

os.Rename calls syscall.Rename which in turn uses the rename syscall (Linux) which provides atomicity (https://man7.org/linux/man-pages/man2/rename.2.html)

The previous attempt didn't work as it would break the DR scripts - openshift/library-go#1098

@openshift-ci openshift-ci bot requested review from marun and retroflexer June 9, 2021 10:58
@p0lyn0mial
Copy link
Contributor Author

/retest

1 similar comment
@p0lyn0mial
Copy link
Contributor Author

/retest

@p0lyn0mial p0lyn0mial force-pushed the atomic-certs branch 3 times, most recently from 5106fac to d39f518 Compare June 11, 2021 09:54
@hexfusion
Copy link
Contributor

Moved upgrade to GCP where I saw green in 4.8 rehearsal openshift/release#19168

/test e2e-agnostic-upgrade

@p0lyn0mial p0lyn0mial changed the title proof for https://github.com/openshift/library-go/pull/1103 Bug 1963730: kube-apiserver failed to load SNI cert and key Jun 11, 2021
@openshift-ci openshift-ci bot added bugzilla/severity-low Referenced Bugzilla bug's severity is low for the branch this PR is targeting. bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. labels Jun 11, 2021
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jun 11, 2021

@p0lyn0mial: This pull request references Bugzilla bug 1963730, which is valid. The bug has been updated to refer to the pull request using the external bug tracker.

3 validation(s) were run on this bug
  • bug is open, matching expected state (open)
  • bug target release (4.8.0) matches configured target release for branch (4.8.0)
  • bug is in the state POST, which is one of the valid states (NEW, ASSIGNED, ON_DEV, POST, POST)

Requesting review from QA contact:
/cc @wangke19

In response to this:

Bug 1963730: kube-apiserver failed to load SNI cert and key

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci openshift-ci bot requested a review from wangke19 June 11, 2021 10:36
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jun 11, 2021

@p0lyn0mial: This pull request references Bugzilla bug 1963730, which is valid.

3 validation(s) were run on this bug
  • bug is open, matching expected state (open)
  • bug target release (4.8.0) matches configured target release for branch (4.8.0)
  • bug is in the state POST, which is one of the valid states (NEW, ASSIGNED, ON_DEV, POST, POST)

Requesting review from QA contact:
/cc @wangke19

In response to this:

Bug 1963730: kube-apiserver failed to load SNI cert and key

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@hexfusion
Copy link
Contributor

/lgtm
/approve

@openshift-ci openshift-ci bot added lgtm Indicates that a PR is ready to be merged. approved Indicates a PR has been approved by an approver from all required OWNERS files. labels Jun 11, 2021
@hexfusion
Copy link
Contributor

skip known disruptive failures

/override ci/prow/e2e-gcp-disruptive-five-control-plane-replicas
/override ci/prow/e2e-aws-disruptive
/override ci/prow/e2e-aws-disruptive-ovn
/override ci/prow/e2e-gcp-disruptive
/override ci/prow/e2e-gcp-disruptive-five-control-plane-replicas
/override ci/prow/e2e-gcp-disruptive-ovn

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jun 11, 2021

@hexfusion: Overrode contexts on behalf of hexfusion: ci/prow/e2e-aws-disruptive, ci/prow/e2e-aws-disruptive-ovn, ci/prow/e2e-gcp-disruptive, ci/prow/e2e-gcp-disruptive-five-control-plane-replicas, ci/prow/e2e-gcp-disruptive-ovn

In response to this:

skip known disruptive failures

/override ci/prow/e2e-gcp-disruptive-five-control-plane-replicas
/override ci/prow/e2e-aws-disruptive
/override ci/prow/e2e-aws-disruptive-ovn
/override ci/prow/e2e-gcp-disruptive
/override ci/prow/e2e-gcp-disruptive-five-control-plane-replicas
/override ci/prow/e2e-gcp-disruptive-ovn

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci openshift-ci bot removed the lgtm Indicates that a PR is ready to be merged. label Jun 11, 2021
@hexfusion
Copy link
Contributor

/lgtm

@openshift-ci openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Jun 11, 2021
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jun 11, 2021

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: hexfusion, p0lyn0mial

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@hexfusion
Copy link
Contributor

/override ci/prow/e2e-gcp-disruptive-five-control-plane-replicas
/override ci/prow/e2e-aws-disruptive
/override ci/prow/e2e-aws-disruptive-ovn
/override ci/prow/e2e-gcp-disruptive
/override ci/prow/e2e-gcp-disruptive-five-control-plane-replicas
/override ci/prow/e2e-gcp-disruptive-ovn

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jun 11, 2021

@hexfusion: Overrode contexts on behalf of hexfusion: ci/prow/e2e-aws-disruptive, ci/prow/e2e-aws-disruptive-ovn, ci/prow/e2e-gcp-disruptive, ci/prow/e2e-gcp-disruptive-five-control-plane-replicas, ci/prow/e2e-gcp-disruptive-ovn

In response to this:

/override ci/prow/e2e-gcp-disruptive-five-control-plane-replicas
/override ci/prow/e2e-aws-disruptive
/override ci/prow/e2e-aws-disruptive-ovn
/override ci/prow/e2e-gcp-disruptive
/override ci/prow/e2e-gcp-disruptive-five-control-plane-replicas
/override ci/prow/e2e-gcp-disruptive-ovn

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-bot
Copy link
Contributor

/retest

Please review the full test history for this PR and help us cut down flakes.

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jun 11, 2021

@p0lyn0mial: The following tests failed, say /retest to rerun all failed tests:

Test name Commit Details Rerun command
ci/prow/e2e-gcp-disruptive-ovn 33fa8a2 link /test e2e-gcp-disruptive-ovn
ci/prow/e2e-gcp-disruptive-five-control-plane-replicas 33fa8a2 link /test e2e-gcp-disruptive-five-control-plane-replicas
ci/prow/e2e-gcp-disruptive 33fa8a2 link /test e2e-gcp-disruptive

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@openshift-merge-robot openshift-merge-robot merged commit 300bdf3 into openshift:master Jun 11, 2021
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jun 11, 2021

@p0lyn0mial: Some pull requests linked via external trackers have merged:

The following pull requests linked via external trackers have not merged:

These pull request must merge or be unlinked from the Bugzilla bug in order for it to move to the next state. Once unlinked, request a bug refresh with /bugzilla refresh.

Bugzilla bug 1963730 has not been moved to the MODIFIED state.

In response to this:

Bug 1963730: kube-apiserver failed to load SNI cert and key

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. bugzilla/severity-low Referenced Bugzilla bug's severity is low for the branch this PR is targeting. bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. lgtm Indicates that a PR is ready to be merged.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants