Populate list of clusters in the controller at startup. #364

alexeyklyukin · 2018-08-07T15:48:38Z

Assign the list of clusters in the controller with the up-to-date list
of Postgres manifests on Kubernetes during the startup.

Node migration routines launched asynchronously to the cluster
processing rely on an up-to-date list of clusters in the controller to
detect clusters affected by the migration of the node and lock them
when doing migration of master pods. Without the initial list the
operator was subject to race conditions like the one described at
#363

Restructure the code to decouple list cluster function required by the
postgresql informer from the one that emits cluster sync events. No
extra work is introduced, since cluster sync already runs in a separate
goroutine (clusterResync).

Introduce explicit initial cluster sync at the end of
acquireInitialListOfClusters instead of relying on an implicit one
coming from list function of the PostgreSQL informer.

Some minor refactoring.

Assign the list of clusters in the controller with the up-to-date list of Postgres manifests on Kubernetes during the startup. Node migration routines launched asynchronously to the cluster processing rely on an up-to-date list of clusters in the controller to detect clusters affected by the migration of the node and lock them when doing migration of master pods. Without the initial list the operator was subject to race conditions like the one described at #363 Restructure the code to decouple list cluster function required by the postgresql informer from the one that emits cluster sync events. No extra work is introduced, since cluster sync already runs in a separate goroutine (clusterResync). Introduce explicit initial cluster sync at the end of acquireInitialListOfClusters instead of relying on an implicit one coming from list function of the PostgreSQL informer. Some minor refactoring.

coveralls · 2018-08-07T15:52:30Z

Coverage remained the same at 4.606% when pulling 5da4951 on wip/restartable_node_migration into 1405058 on master.

sdudoladov · 2018-08-07T16:43:57Z

pkg/controller/controller.go

 func (c *Controller) Run(stopCh <-chan struct{}, wg *sync.WaitGroup) {
 	c.initController()

+	// start workers reading from the events queue to prevent the initial sync from blocking on it.


that basically means that workers must always start before acquireInitialListOfClusters executes since it queues the 1st Sync event, right ?

Yes, and if there are no consumers of that queue acquireInitialListOfClusters function will block forever...

alexeyklyukin requested review from erthalion and sdudoladov as code owners August 7, 2018 15:48

sdudoladov approved these changes Aug 7, 2018

View reviewed changes

alexeyklyukin merged commit 199aa65 into master Aug 8, 2018

alexeyklyukin deleted the wip/restartable_node_migration branch August 8, 2018 09:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Populate list of clusters in the controller at startup. #364

Populate list of clusters in the controller at startup. #364

Uh oh!

alexeyklyukin commented Aug 7, 2018

Uh oh!

coveralls commented Aug 7, 2018

Uh oh!

sdudoladov Aug 7, 2018

Uh oh!

alexeyklyukin Aug 8, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Populate list of clusters in the controller at startup. #364

Populate list of clusters in the controller at startup. #364

Uh oh!

Conversation

alexeyklyukin commented Aug 7, 2018

Uh oh!

coveralls commented Aug 7, 2018

Uh oh!

sdudoladov Aug 7, 2018

Choose a reason for hiding this comment

Uh oh!

alexeyklyukin Aug 8, 2018

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants