New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug 1840531: Adding rollbackcopy subcommand #372
Bug 1840531: Adding rollbackcopy subcommand #372
Conversation
898c3b8
to
0e0c3ba
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looking good @retroflexer , few notes.
Although we definitely want the backup logic to eventually be in golang having both means we need to support both in 4.4. Just something to consider for bugs/logical changes etc.
Try to make the logical flow as simple as possible, adding unit tests will make review easier in general.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just a quick first pass, there's a lot more here to discuss.
860da5f
to
4e7c15d
Compare
@retroflexer: This pull request references Bugzilla bug 1840531, which is invalid:
Comment In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
@retroflexer: This pull request references Bugzilla bug 1846025, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker. 3 validation(s) were run on this bug
In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/bugzilla refresh |
@retroflexer: This pull request references Bugzilla bug 1846025, which is valid. 3 validation(s) were run on this bug
In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
3abeeef
to
1a28818
Compare
bbe17cf
to
4c1a082
Compare
4c1a082
to
293c03d
Compare
/test e2e-metal-ipi |
/retest |
/test e2e-aws |
1 similar comment
/test e2e-aws |
/lgtm |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: hexfusion, retroflexer The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
Verified the sanity of the backup taken by the rollbackcopy container by comparing it with the tar backup obtained from cluster-backup.sh script.
Applied cluster-restore.sh to restore the pod to the database saved by the rollbackcopy. It successfully restored and a single node cluster came up:
And the cluster came up with single node cluster successfully! |
This is a targeted bug fix, instead of committing to master/4.6 and reverting we would like to merge directly in 4.5 with the net as a 4.6 removal. cc @mfojtik once merged into 4.5 we will verify with QE tonight. If there is a problem we will revert. But this has been extensively tested in CI and manually. |
/refresh |
@retroflexer: The following tests failed, say
Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
/retest Please review the full test history for this PR and help us cut down flakes. |
@retroflexer: All pull requests linked via external trackers have merged: . Bugzilla bug 1840531 has been moved to the MODIFIED state. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/cherry-pick release-4.4 |
@hexfusion: new pull request created: #379 In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
This work provides a container within the etcd pod to take periodic backups.
The immediate need for automated backups is to provide a path to upgrade from 4.4 to 4.5, with the knowledge that if the upgrade fails for some reason, there is no easy path to rollback, as the etcd version 3.4.x (used in OCP 4.5) is incompatible with the etcd version of 3.3.x (used in 4.4.x). Since restoring a backup is the safest way to rollback, it is important to have automated periodical backups to protect the users from unexpected data loss in such scenarios.