mgr: Allow more than two mgrs #12895

travisn · 2023-09-12T22:20:35Z

Description of your changes:
While two mgrs is considered sufficient, more mgr daemons are now allowed in case the admin wants even more fault tolerance to the active mgr going down, in case multiple mgrs go down. Up to five mgr pods will be allowed. All mgrs will be in standby mode except one active mgr. All mgr pods have a sidecar that will update their respective pod specs with the active or passive label.

Which issue is resolved by this Pull Request:
Resolves #12812

Checklist:

Commit Message Formatting: Commit titles and messages follow guidelines in the developer guide.
Reviewed the developer guide on Submitting a Pull Request
Pending release notes updated with breaking and/or notable changes for the next minor release.
Documentation has been updated, if necessary.
Unit tests have been added, if necessary.
Integration tests have been added, if necessary.

subhamkrai

lgtm, also restarted the couple of ci

parth-gr · 2023-09-14T07:57:09Z

pkg/apis/ceph.rook.io/v1/types.go

 	// +kubebuilder:validation:Minimum=0
-	// +kubebuilder:validation:Maximum=2
+	// +kubebuilder:validation:Maximum=5


if we have a validation by CRD, why do we need secendory validation in the code?

The code validation must be there from before the crd schema validation. If you're referring to the maxMgrCount in code, that's still needed for deleting extra mgrs. But I'm going to look at the implementation again to see if we can have a better way to remove extras without duplicating that in code.

Now the maxMgrCount is removed from code and am querying for the existing mgrs to see if there are any extra to remove.

While two mgrs is considered sufficient, more mgr daemons are now allowed in case the admin wants even more fault tolerance to the active mgr going down, in case multiple mgrs go down. Up to five mgr pods will be allowed. All mgrs will be in standby mode except one active mgr. All mgr pods have a sidecar that will update their respective pod specs with the active or passive label. Signed-off-by: travisn <tnielsen@redhat.com>

mgr: Allow more than two mgrs (backport #12895)

parth-gr · 2023-09-15T13:59:14Z

pkg/operator/ceph/cluster/mgr/mgr.go

+		logger.Warningf("failed to check for extra mgrs. %v", err)
+		return
+	}
+	if len(mgrDeployments.Items) == len(daemonIDs) {


what if the deployments are less than daemons?

If there are fewer deployments, then the loop below will just not find any to remove. But I don't expect that to happen actually because this is called right after all the expected mgr daemons are reconciled.

parth-gr · 2023-09-15T14:04:34Z

pkg/operator/ceph/cluster/mgr/mgr.go

+			continue
+		}
+		found := false
+		for _, daemonID := range daemonIDs {


if one of the mgr is deleted,

For ex: List of mgr{a,b,c}
b got deleted, new list {a,c}
Now a new mgr will try to get an added {a,c,d} to keep up the replica count but as per the daemonIDs array is implemented it will keep the daemonIDS list as {a,b,c} only
And later replica of mgr count become = 4
New list {a,c,d,e}

So now removing the mgr will work on
mgr list->{a,c,d,e} and hopefully daemon list should be {a,b,c,d}
By which this algo will remove e intentionally.

I can discuss more on this

PS: No it won't be a problem sorry for the confusion,
It would have applied to mon algorithm

travisn added the backport-release-1.12 label Sep 12, 2023

subhamkrai approved these changes Sep 14, 2023

View reviewed changes

parth-gr reviewed Sep 14, 2023

View reviewed changes

travisn force-pushed the multiple-mgr branch from f01b90b to 66547d2 Compare September 14, 2023 19:29

travisn merged commit 8da6dea into rook:master Sep 14, 2023
48 of 50 checks passed

travisn deleted the multiple-mgr branch September 14, 2023 21:02

mergify bot mentioned this pull request Sep 14, 2023

mgr: Allow more than two mgrs (backport #12895) #12906

Merged

mergify bot added a commit that referenced this pull request Sep 14, 2023

Merge pull request #12906 from rook/mergify/bp/release-1.12/pr-12895

c1d25e7

mgr: Allow more than two mgrs (backport #12895)

parth-gr reviewed Sep 15, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mgr: Allow more than two mgrs #12895

mgr: Allow more than two mgrs #12895

travisn commented Sep 12, 2023

subhamkrai left a comment

parth-gr Sep 14, 2023

travisn Sep 14, 2023

travisn Sep 14, 2023

parth-gr Sep 15, 2023

travisn Sep 15, 2023

parth-gr Sep 15, 2023

parth-gr Sep 15, 2023

mgr: Allow more than two mgrs #12895

mgr: Allow more than two mgrs #12895

Conversation

travisn commented Sep 12, 2023

subhamkrai left a comment

Choose a reason for hiding this comment

parth-gr Sep 14, 2023

Choose a reason for hiding this comment

travisn Sep 14, 2023

Choose a reason for hiding this comment

travisn Sep 14, 2023

Choose a reason for hiding this comment

parth-gr Sep 15, 2023

Choose a reason for hiding this comment

travisn Sep 15, 2023

Choose a reason for hiding this comment

parth-gr Sep 15, 2023

Choose a reason for hiding this comment

parth-gr Sep 15, 2023

Choose a reason for hiding this comment