[Network] refactor lifecycle manager #1031

synzhu · 2021-07-26T21:56:06Z

This PR refactors the lifecycle management code in Network into a separate struct.

This struct can be used in the future to refactor many existing ReadyDoneAware modules: https://github.com/dapperlabs/flow-go/issues/5702

closes https://github.com/dapperlabs/flow-go/issues/5704

codecov-commenter · 2021-07-26T22:12:53Z

Codecov Report

Merging #1031 (93b1f8e) into master (2d496c1) will increase coverage by 0.14%.
The diff coverage is 68.29%.

@@            Coverage Diff             @@
##           master    #1031      +/-   ##
==========================================
+ Coverage   53.27%   53.42%   +0.14%     
==========================================
  Files         318      318              
  Lines       21515    21521       +6     
==========================================
+ Hits        11462    11497      +35     
+ Misses       8487     8455      -32     
- Partials     1566     1569       +3

Flag	Coverage Δ
unittests	`53.42% <68.29%> (+0.14%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
network/p2p/network.go	`0.00% <0.00%> (ø)`
module/lifecycle/lifecycle.go	`100.00% <100.00%> (ø)`
cmd/util/ledger/migrations/storage_v4.go	`41.56% <0.00%> (-0.61%)`	⬇️
...sus/approvals/assignment_collector_statemachine.go	`50.00% <0.00%> (+7.69%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 2d496c1...93b1f8e. Read the comment docs.

synzhu · 2021-07-27T19:21:11Z

utils/unittest/unittest.go

+func RequireClosed(t *testing.T, ch <-chan struct{}, message string) {
+	select {
+	case <-ch:
+	default:
+		require.Fail(t, "channel is not closed: "+message)
+	}
+}


Note: this is guaranteed to select from the channel if it is closed, and will only run the default case if nothing is available to be read from the channel: https://stackoverflow.com/questions/45580151/priority-of-case-versus-default-in-golang-select-statements

synzhu · 2021-07-27T19:21:23Z

utils/unittest/unittest.go

+func RequireNotClosed(t *testing.T, ch <-chan struct{}, message string) {
+	select {
+	case <-ch:
+		require.Fail(t, "channel is closed: "+message)
+	default:
+	}
+}


…w/flow-go into smnzhu/synchronize-network

yhassanzadeh13

Looks great 👍

yhassanzadeh13 · 2021-07-29T22:35:51Z

module/lifecycle/lifecycle.go

+	lm.stateTransition.Lock()
+	if lm.shutdownCommenced || lm.startupCommenced {
+		lm.stateTransition.Unlock()
+		return
+	}
+	lm.startupCommenced = true
+	lm.stateTransition.Unlock()


Suggested change

lm.stateTransition.Lock()

if lm.shutdownCommenced || lm.startupCommenced {

lm.stateTransition.Unlock()

return

}

lm.startupCommenced = true

lm.stateTransition.Unlock()

func(){

lm.stateTransition.Lock()

defer lm.stateTransition.Unlock()

if lm.shutdownCommenced || lm.startupCommenced {

return

}

lm.startupCommenced = true

}()

Using defer pattern is safer in this scenario.

Thanks! In this particular case, I think it may introduce a bit more messiness than your suggested change, because when we return on line 79 we actually want to return from the enclosing function.

If we wrap the logic in a nested function, the return now only returns from the nested function, so we would probably have to add some if / else logic and return a boolean from the nested function instead.

I think that may not be necessary here, since the locking code is very short and contained

yhassanzadeh13 · 2021-07-29T22:38:40Z

module/lifecycle/lifecycle.go

+	lm.stateTransition.Lock()
+	if lm.shutdownCommenced {
+		lm.stateTransition.Unlock()
+		return
+	}
+	lm.shutdownCommenced = true
+	lm.stateTransition.Unlock()


Same defer pattern is suggested here.

yhassanzadeh13 · 2021-07-29T22:44:50Z

module/lifecycle/lifecycle_test.go

+
+// TestConsecutiveStart tests that calling OnStart multiple times concurrently only
+// results in startup being performed once
+func (suite *LifecycleManagerSuite) TestConsecutiveStart() {


The test scenario looks like concurrent than consecutive.

yhassanzadeh13 · 2021-07-29T22:44:56Z

module/lifecycle/lifecycle_test.go

+
+// TestConsecutiveStop tests that calling OnStop multiple times concurrently only
+// results in shutdown being performed once
+func (suite *LifecycleManagerSuite) TestConsecutiveStop() {


The test scenario looks like concurrent than consecutive.

yhassanzadeh13 · 2021-07-29T23:38:01Z

module/lifecycle/lifecycle_test.go

+
+	suite.lm.OnStart(func() {
+		// simulate startup processing
+		time.Sleep(3 * time.Second)


**General comment: ** sleep durations in this PR are in order of seconds. This slows down building over CI. Would be great if you could please reduce them to milliseconds if possible. For most of the cases, 100 milliseconds is a preferable sleep duration

or you could use a channel here as a gate. After the call to suite.lm.OnStop(), you can close the channel (a.k.a. open the gate). Then we won't have to rely on a sleep

vishalchangrani · 2021-07-30T00:15:02Z

module/lifecycle/lifecycle_test.go

+	unittest.RequireCloseBefore(suite.T(), suite.lm.Started(), time.Second, "timed out waiting for startup")
+	time.Sleep(100 * time.Millisecond) // wait for potential race conditions to occur
+
+	suite.Assert().EqualValues(1, numStarts)


we can use assert.Eventuallyf here to factor in the sleep

vishalchangrani · 2021-07-30T00:22:28Z

network/p2p/network.go

-	}()
-
-	return n.done
+	n.cancel()


vishalchangrani

lgtm

refactor lifecycle manager

89283ef

synzhu requested review from vishalchangrani, jordanschalm and yhassanzadeh13 July 26, 2021 21:56

synzhu mentioned this pull request Jul 26, 2021

[Network] Sychronize ready and done #1026

Merged

Merge branch 'master' into smnzhu/synchronize-network

e727a2f

Merge branch 'master' into smnzhu/synchronize-network

47361e9

synzhu requested a review from huitseeker July 27, 2021 07:40

synzhu assigned yhassanzadeh13 Jul 27, 2021

synzhu added 4 commits July 27, 2021 10:32

Added teset

c0912cd

Merge branch 'master' into smnzhu/synchronize-network

242212a

Add tests

770c16f

Update lifecycle_test.go

6ae227d

synzhu commented Jul 27, 2021

View reviewed changes

synzhu added 3 commits July 27, 2021 12:24

Merge branch 'master' into smnzhu/synchronize-network

3fc3446

Update lifecycle_test.go

06c83da

Merge branch 'smnzhu/synchronize-network' of https://github.com/onflo…

3e40a7c

…w/flow-go into smnzhu/synchronize-network

synzhu requested a review from AlexHentschel July 28, 2021 08:23

Merge branch 'master' into smnzhu/synchronize-network

93b1f8e

synzhu mentioned this pull request Jul 29, 2021

Vishal/smnzhu/unstaked an #1051

Closed

yhassanzadeh13 approved these changes Jul 29, 2021

View reviewed changes

vishalchangrani reviewed Jul 30, 2021

View reviewed changes

network/p2p/network.go

}()

return n.done

n.cancel()

Copy link

Contributor

vishalchangrani Jul 30, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

❤️

vishalchangrani approved these changes Jul 30, 2021

View reviewed changes

synzhu added 3 commits July 30, 2021 16:11

rename test methods

bd0a93f

update sleep durations

dcf5511

Merge branch 'master' into smnzhu/synchronize-network

52d9966

synzhu merged commit c9f6e0e into master Jul 30, 2021

synzhu deleted the smnzhu/synchronize-network branch July 30, 2021 23:57

This was referenced Aug 2, 2021

[Synchronization engine] Split into separate request and response processing components. #1068

Merged

[hotstuff] replace SingleRunner with LifecycleManager #1078

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Network] refactor lifecycle manager #1031

[Network] refactor lifecycle manager #1031

synzhu commented Jul 26, 2021 •

edited

codecov-commenter commented Jul 26, 2021 •

edited

synzhu Jul 27, 2021

synzhu Jul 27, 2021

yhassanzadeh13 left a comment

yhassanzadeh13 Jul 29, 2021

synzhu Jul 30, 2021

yhassanzadeh13 Jul 29, 2021

yhassanzadeh13 Jul 29, 2021

yhassanzadeh13 Jul 29, 2021

yhassanzadeh13 Jul 29, 2021

vishalchangrani Jul 30, 2021

vishalchangrani Jul 30, 2021 •

edited

vishalchangrani Jul 30, 2021

vishalchangrani left a comment

[Network] refactor lifecycle manager #1031

[Network] refactor lifecycle manager #1031

Conversation

synzhu commented Jul 26, 2021 • edited

codecov-commenter commented Jul 26, 2021 • edited

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yhassanzadeh13 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vishalchangrani Jul 30, 2021 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vishalchangrani left a comment

Choose a reason for hiding this comment

synzhu commented Jul 26, 2021 •

edited

codecov-commenter commented Jul 26, 2021 •

edited

vishalchangrani Jul 30, 2021 •

edited