Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug 2021097: Set Upgradeable: false when HA workloads are incorrectly spread #1472

Merged
merged 2 commits into from Nov 18, 2021

Conversation

jan--f
Copy link
Contributor

@jan--f jan--f commented Nov 8, 2021

… spread (#1431)

Bug 1995924: Set Upgradeable: false when HA workloads are incorrectly spread

  • pkg/rebalancer: sort resources for deletion

The resources that are marked for deletion are sorted by their PVC
creation timestamp, from the newest to the oldest to make the deletion
consistent.

Signed-off-by: Damien Grisonnet dgrisonn@redhat.com

  • pkg: improve rebalancer logging

Signed-off-by: Damien Grisonnet dgrisonn@redhat.com

  • pkg/rebalancer: sort based on pod names

Sort PVCs by their creation timestamps, from the newest to the oldest to
make sure that the oldest PVC is retained in case all of them are
annotated. If some PVCs have the same creation timestamp, they will be
sorted based on their pod name.

Signed-off-by: Damien Grisonnet dgrisonn@redhat.com

  • pkg/rebalancer: split annotation removal

Add EnsurePVCsAreNotAnnoted function that makes sure that none of the
PVCs of the given workload have the
openshift.io/cluster-monitoring-drop-pvc annotation after the
rebalancing is done. In case one of the PVC has the annotation, it will
be removed to prevent deleting the PVC in a future cycle.

Signed-off-by: Damien Grisonnet dgrisonn@redhat.com

  • pkg/rebalancer: sort resources to delete by age

Signed-off-by: Damien Grisonnet dgrisonn@redhat.com

  • test/e2e: fix framework

Signed-off-by: Damien Grisonnet dgrisonn@redhat.com

Co-authored-by: OpenShift Merge Robot openshift-merge-robot@users.noreply.github.com
(cherry picked from commit 8500e0f)

  • I added CHANGELOG entry for this change.
  • No user facing changes, so no entry in CHANGELOG was needed.

@openshift-ci openshift-ci bot added bugzilla/severity-high Referenced Bugzilla bug's severity is high for the branch this PR is targeting. bugzilla/invalid-bug Indicates that a referenced Bugzilla bug is invalid for the branch this PR is targeting. labels Nov 8, 2021
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Nov 8, 2021

@jan--f: This pull request references Bugzilla bug 2021097, which is invalid:

  • expected dependent Bugzilla bug 1995924 to be in one of the following states: VERIFIED, RELEASE_PENDING, CLOSED (ERRATA), CLOSED (CURRENTRELEASE), but it is ON_QA instead

Comment /bugzilla refresh to re-evaluate validity if changes to the Bugzilla bug are made, or edit the title of this pull request to link to a different bug.

In response to this:

Bug 2021097: Set Upgradeable: false when HA workloads are incorrectly spread

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Nov 8, 2021
@jan--f
Copy link
Contributor Author

jan--f commented Nov 9, 2021

/hold
Investigating a CI issue with the master PR and still resolving backport issues.

@openshift-ci openshift-ci bot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Nov 9, 2021
@jan--f jan--f force-pushed the bp-1995924-4.9 branch 2 times, most recently from eb827f5 to 5ff3f13 Compare November 15, 2021 10:36
@jan--f
Copy link
Contributor Author

jan--f commented Nov 15, 2021

/retest

2 similar comments
@jan--f
Copy link
Contributor Author

jan--f commented Nov 15, 2021

/retest

@jan--f
Copy link
Contributor Author

jan--f commented Nov 16, 2021

/retest

dgrisonnet and others added 2 commits November 16, 2021 11:13
… spread (openshift#1431)

* Merge pull request openshift#1330 from dgrisonnet/ha-upgradeable

Bug 1995924: Set `Upgradeable: false` when HA workloads are incorrectly spread

* pkg/rebalancer: sort resources for deletion

The resources that are marked for deletion are sorted by their PVC
creation timestamp, from the newest to the oldest to make the deletion
consistent.

Signed-off-by: Damien Grisonnet <dgrisonn@redhat.com>

* pkg: improve rebalancer logging

Signed-off-by: Damien Grisonnet <dgrisonn@redhat.com>

* pkg/rebalancer: sort based on pod names

Sort PVCs by their creation timestamps, from the newest to the oldest to
make sure that the oldest PVC is retained in case all of them are
annotated. If some PVCs have the same creation timestamp, they will be
sorted based on their pod name.

Signed-off-by: Damien Grisonnet <dgrisonn@redhat.com>

* pkg/rebalancer: split annotation removal

Add EnsurePVCsAreNotAnnoted function that makes sure that none of the
PVCs of the given workload have the
openshift.io/cluster-monitoring-drop-pvc annotation after the
rebalancing is done. In case one of the PVC has the annotation, it will
be removed to prevent deleting the PVC in a future cycle.

Signed-off-by: Damien Grisonnet <dgrisonn@redhat.com>

* pkg/rebalancer: sort resources to delete by age

Signed-off-by: Damien Grisonnet <dgrisonn@redhat.com>

* test/e2e: fix framework

Signed-off-by: Damien Grisonnet <dgrisonn@redhat.com>

Co-authored-by: OpenShift Merge Robot <openshift-merge-robot@users.noreply.github.com>
(cherry picked from commit 8500e0f)
On master this was added in d3c1e5a.
This can be considered a partial backport.

Signed-off-by: Jan Fajerski <jfajersk@redhat.com>
@jan--f
Copy link
Contributor Author

jan--f commented Nov 16, 2021

/retest

1 similar comment
@jan--f
Copy link
Contributor Author

jan--f commented Nov 17, 2021

/retest

@jan--f
Copy link
Contributor Author

jan--f commented Nov 17, 2021

/bugzilla refresh

@openshift-ci openshift-ci bot added bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. and removed bugzilla/invalid-bug Indicates that a referenced Bugzilla bug is invalid for the branch this PR is targeting. labels Nov 17, 2021
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Nov 17, 2021

@jan--f: This pull request references Bugzilla bug 2021097, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker.

6 validation(s) were run on this bug
  • bug is open, matching expected state (open)
  • bug target release (4.9.z) matches configured target release for branch (4.9.z)
  • bug is in the state ASSIGNED, which is one of the valid states (NEW, ASSIGNED, ON_DEV, POST, POST)
  • dependent bug Bugzilla bug 1995924 is in the state VERIFIED, which is one of the valid states (VERIFIED, RELEASE_PENDING, CLOSED (ERRATA), CLOSED (CURRENTRELEASE))
  • dependent Bugzilla bug 1995924 targets the "4.10.0" release, which is one of the valid target releases: 4.10.0
  • bug has dependents

Requesting review from QA contact:
/cc @juzhao

In response to this:

/bugzilla refresh

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci openshift-ci bot requested a review from juzhao November 17, 2021 13:38
@jan--f
Copy link
Contributor Author

jan--f commented Nov 17, 2021

/unhold
I think this is ready to review and merge.

@openshift-ci openshift-ci bot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Nov 17, 2021
@simonpasquier
Copy link
Contributor

I'll add this at the top of my review list :)

@simonpasquier
Copy link
Contributor

/lgtm

@simonpasquier
Copy link
Contributor

/retest

@simonpasquier
Copy link
Contributor

/skip

@simonpasquier
Copy link
Contributor

/label backport-risk-assessed

@openshift-ci openshift-ci bot added the backport-risk-assessed Indicates a PR to a release branch has been evaluated and considered safe to accept. label Nov 17, 2021
@openshift-ci openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Nov 17, 2021
@openshift-bot
Copy link
Contributor

/retest-required

Please review the full test history for this PR and help us cut down flakes.

8 similar comments
@openshift-bot
Copy link
Contributor

/retest-required

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest-required

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest-required

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest-required

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest-required

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest-required

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest-required

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest-required

Please review the full test history for this PR and help us cut down flakes.

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Nov 17, 2021

@jan--f: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/prow/versions b445978 link false /test versions

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@openshift-bot
Copy link
Contributor

/retest-required

Please review the full test history for this PR and help us cut down flakes.

4 similar comments
@openshift-bot
Copy link
Contributor

/retest-required

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest-required

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest-required

Please review the full test history for this PR and help us cut down flakes.

@openshift-bot
Copy link
Contributor

/retest-required

Please review the full test history for this PR and help us cut down flakes.

@juzhao
Copy link

juzhao commented Nov 18, 2021

tested with PR, bound PVs for prometheus, and schedule prometheus pods to one same node, Upgradeable is False now
/lgtm

@juzhao
Copy link

juzhao commented Nov 18, 2021

/label qe-approved

@openshift-ci openshift-ci bot added the qe-approved Signifies that QE has signed off on this PR label Nov 18, 2021
@juzhao
Copy link

juzhao commented Nov 18, 2021

/label cherry-pick-approved

@openshift-ci openshift-ci bot added the cherry-pick-approved Indicates a cherry-pick PR into a release branch has been approved by the release branch manager. label Nov 18, 2021
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Nov 18, 2021

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: jan--f, juzhao, simonpasquier

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:
  • OWNERS [jan--f,simonpasquier]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-merge-robot openshift-merge-robot merged commit ae7b0a3 into openshift:release-4.9 Nov 18, 2021
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Nov 18, 2021

@jan--f: All pull requests linked via external trackers have merged:

Bugzilla bug 2021097 has been moved to the MODIFIED state.

In response to this:

Bug 2021097: Set Upgradeable: false when HA workloads are incorrectly spread

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. backport-risk-assessed Indicates a PR to a release branch has been evaluated and considered safe to accept. bugzilla/severity-high Referenced Bugzilla bug's severity is high for the branch this PR is targeting. bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. cherry-pick-approved Indicates a cherry-pick PR into a release branch has been approved by the release branch manager. lgtm Indicates that a PR is ready to be merged. qe-approved Signifies that QE has signed off on this PR
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

6 participants