Add jitter to DKG start and upon processing phase 1 broadcast messages #1303

jordanschalm · 2021-09-16T19:58:47Z

Introduce random delays prior to expensive operations in the DKG:

Start - ~700ms
HandleBroadcastMessage - ~2.5s / message

codecov-commenter · 2021-09-21T17:36:14Z

Codecov Report

Merging #1303 (7edb23f) into master (7925ced) will decrease coverage by 0.00%.
The diff coverage is 89.18%.

@@            Coverage Diff             @@
##           master    #1303      +/-   ##
==========================================
- Coverage   55.39%   55.39%   -0.01%     
==========================================
  Files         510      510              
  Lines       31857    31892      +35     
==========================================
+ Hits        17648    17667      +19     
- Misses      11834    11851      +17     
+ Partials     2375     2374       -1

Flag	Coverage Δ
unittests	`55.39% <89.18%> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
module/dkg/controller_factory.go	`0.00% <0.00%> (ø)`
module/dkg/controller.go	`74.71% <97.05%> (+5.91%)`	⬆️
...sus/approvals/assignment_collector_statemachine.go	`42.30% <0.00%> (-9.62%)`	⬇️
...ngine/common/synchronization/finalized_snapshot.go	`68.75% <0.00%> (-4.17%)`	⬇️
admin/command_runner.go	`78.51% <0.00%> (-1.49%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 7925ced...7edb23f. Read the comment docs.

tarakby

Nice 👌🏼
I have a comment and a question:

I know 500 micro sec is a default duration and won't be used in the real networks. Shouldn't we also set mainnet and testnet config somewhere as part of this PR ?
I suggest to make the phase duration dependant on the new config, or at least have a sanity check that the phase duration is at least the max delay plus some constant buffer. Can that be added to this PR?

module/dkg/controller_test.go

module/dkg/controller.go

cmd/consensus/main.go

zhangchiqing · 2021-09-21T18:07:09Z

module/dkg/controller.go

+	// We would like to target spreading this cost over a 20 minute period in the
+	// average case.
+	//
+	// 500µs results in per-broadcast delays of max=11.25s, ave=5.625s.


What is the 500µs? Or are you talking about 500ms that is used as the DefaultBaseHandleBroadcastDelay

Suggested change

// 500µs results in per-broadcast delays of max=11.25s, ave=5.625s.

// 500ms results in per-broadcast delays of max=11.25s, ave=5.625s.

DefaultBaseHandleBroadcastDelay is 500 microseconds, not 500 milliseconds (code)

zhangchiqing · 2021-09-21T18:18:07Z

module/dkg/controller.go

+		dkgSize = 0
+	}
+
+	// m=b*n^2


Don't quite understand this math. Could you explain ?

maximum_delay = base_delay * number_of_dkg_nodes ^ 2

Then we select the actual delay from [0, maximum_delay]

So the reason for * number_of_dkg_nodes ^ 2 is because DKG takes O(N^2) messages?
So an average of each message delay t would result a total delay of t * n * n?

In your example, a delay of 500 microsecond would results in 0.0005 * 150 * 150 = 11.25 seconds total delay?

I don't get the n ^ 2 part in the math. To me, there are n^2 messages to exchange, because each node is sending n-1 messages to other nodes. However, I think each node is sending their message concurrently. So a delay d added to send each message would only result a total delay of n * d rather than n * d ^ 2. no?

What this PR is trying to do is spread out the CPU-intensive operations required in the DKG for each, not to spread out the actual broadcast messages. The broadcast messages all get sent at more or less the same time. Each node will receive n-1 broadcast messages, then will stagger the processing of them to minimize the impact on the rest of the system.

The reason we use n^2 is because the cost for a node to process an individual broadcast messages scales quadratically with the number of nodes and we want the aggregate amount of jitter introduced to be proportional to the aggregate amount of time spent processing broadcast messages.

jordanschalm · 2021-09-21T18:25:34Z

at least have a sanity check that the phase duration is at least the max delay plus some constant buffer

The duration is specified in the smart contract config, so it is tricky to add reliable sanity checks. We could for instance add a sanity check to the finalize step of bootstrapping, but this applies only to the first epoch following the spork -- after that the DKG phase length values come from the contract, so it provides a bit of a false sense of security.

What I would like to do instead, though I feel it's out of scope for this PR and for the upcoming spork, is to adjust the delays introduced based on the expected number of messages, the number of processed messages, and the number of views until the end of the phase. In other words, make the DKG participant software respond to the DKG parameters it is given as best as it can under the relevant timing constraints, rather than trying to enforce that the DKG parameters are consistent with hard-coded delay values.

500 micro sec is a default duration and won't be used in the real networks

My intention was actually to use this value across all networks. This is the coefficient of the delay computation, not the actual delay value (ie. 500µs is b in bn^2. There is reasoning for where this value comes from in the comments

@tarakby

tarakby · 2021-09-21T18:54:47Z

@jordanschalm

My intention was actually to use this value across all networks.

500 micro seconds gives a max delay of 11.25s, which means processing the broadcasts would spread out over 11.25s. To get a max delay of 20mn, we would need a base b of 53ms. Did I misunderstand something?

jordanschalm · 2021-09-21T18:59:44Z

@tarakby

To get a max delay of 20mn, we would need a base b of 53ms. Did I misunderstand something?

We are introducing this delay before each message. We don't have one delay that is on average 20mn -- we have many delays which in aggregate average to a total delay of ~20mn

From the comments:

// 500µs results in per-broadcast delays of max=11.25s, ave=5.625s.
// This results in total delays of max=~28m, ave=~14m, and total time to
// process all phase 1 broadcast messages of max=~34m, ave=~20m.

tarakby · 2021-09-21T21:57:31Z

@jordanschalm

Now I understand, I was thinking the delays are concurrent and are sampled in [0,20mn].

The issue with the sequential small delays is that we lose the uniformity of delays: consider for instance incoming message number 2. Each node would process it after delay_0 + processing_time + delay_1 + processing_time. The delay to process message 2 is basically randomized by delay_0 + delay_1. Although delay_0 and delay_1 are uniformly sampled, delay_0 + delay_1 is far from uniform (you can for instance think of the sum of 2 uniform dice throws and notice the result in not uniform). The bias gets worse after each message from 2 till 149. The bias means we loose the impact of the mitigation and we get more nodes processing messages at the same time.

My suggestion would be to remove the dependency between the sleeps by running them concurrently (one thread for each sleep). In that case, the delay should be sampled in [0, max_delay] where max_delay is something like 20mn. What do you think?

jordanschalm · 2021-09-22T00:27:55Z

the delay should be sampled in [0, max_delay] where max_delay is something like 20mn. What do you think?

The reason I'm cautious about that approach is that it's making a stronger assumption about the timing of broadcast messages. Currently, if a few broadcast messages are sent later that expected in phase 1, we will wait at most a few extra seconds before processing those messages. The added delay is unlikely to cause us to not process them before the end of phase 1. If we select a delay in [0,20m] for every message, late-arriving messages may "get unlucky" with a large delay and will not be processed before the end of phase 1.

The delay to process message 2 is basically randomized by delay_0 + delay_1. Although delay_0 and delay_1 are uniformly sampled, delay_0 + delay_1 is far from uniform (you can for instance think of the sum of 2 uniform dice throws and notice the result in not uniform). The bias gets worse after each message from 2 till 149. The bias means we loose the impact of the mitigation and we get more nodes processing messages at the same time.

It is true that the likelihood of processing a specific message is not uniformly distributed over the 20m period. But I'm not sure that's the right thing to measure. We don't care whether node A and node B are processing a particular message M at the same time. We care about whether node A and node B are processing any message at the same time, or how many nodes are processing any message at the same time. Are we optimally spreading the number of nodes processing a message at a given time, over the time available during phase 1 using this per-message delay? That I don't know (though probably not!), but it is close enough to mitigate the problem in the benchmarking.

…ow-go into jordan/5840-dkg-initial-delays

tarakby

Approving this PR to unblock the spork, the code having been tested on benchnet and the Grafana logs being checked.

Two improving items remain and could be implemented before a next spork:

The current PR implement a quadratic delay in the number of nodes. This is not needed with the current jitter heurstic (additive random delays before processing each broadcast). A linear jitter is enough. If we switch to a linear delay, the constant values would need to be adjusted.
The current delay heuristic leads to multiple nodes being busy at the same time (IMO). This happens only for successive broadcast messages and it happens because sum of uniform delays is not uniform. Two other possible heuristics have been discussed:
- use a blocking delay for each message, sampled uniformly in [0,30mn] (or a larger window) for an async sleep (separate thread) before processing a broadcast.
- use a random delay before the very first broadcast message, sampled randomly in [0,d] where d is relatively larger than a message processing time (2,5s). Use a constant delay before processing subsequent messages to free the CPU to work on Hotstuff. Constant delays would keep nodes processing async, though this only works for broadcasts arriving early (messages arriving late would be processed at the same time by all nodes).

module/dkg/controller.go

* introduce uniform random delay once, before processing the first dkg message, to avoid the clustering of processings with the previous approach (sum of uniform random delays) * introduce constant random delays between each subsequent dkg messages to avoid long continuous stretches of heavy resource usage by the dkg

add test case for default delay base values

zhangchiqing

Looks good

module/dkg/controller.go

Co-authored-by: Leo Zhang <zhangchiqing@gmail.com>

jordanschalm · 2021-09-29T21:27:39Z

bors merge

bors · 2021-09-29T21:56:43Z

Build succeeded:

jordanschalm added 4 commits September 16, 2021 15:50

add configurable delays to dkg start and broadcast processing

f320308

add flags for configuring delays

583522e

add reasoning for default value

13194ef

add debug log prior to delay

b6486e2

jordanschalm requested review from kc1116 and tarakby September 16, 2021 21:01

jordanschalm added 4 commits September 16, 2021 17:54

fix power computation

78d2345

lint

3561785

handle case of 0 base duration config

346710f

refactor delay computation

6c8d854

jordanschalm requested a review from huitseeker September 17, 2021 12:58

jordanschalm added 2 commits September 17, 2021 09:12

tests & additional comments

151d8bd

update dkg integrationt tests

da1625b

jordanschalm marked this pull request as ready for review September 21, 2021 16:22

jordanschalm requested review from AlexHentschel and zhangchiqing as code owners September 21, 2021 16:22

Merge branch 'master' into jordan/5840-dkg-initial-delays

87c8406

tarakby reviewed Sep 21, 2021

View reviewed changes

module/dkg/controller_test.go Show resolved Hide resolved

module/dkg/controller.go Show resolved Hide resolved

zhangchiqing reviewed Sep 21, 2021

View reviewed changes

Merge branch 'master' into jordan/5840-dkg-initial-delays

765e363

jordanschalm added 4 commits September 22, 2021 10:35

Merge branch 'master' into jordan/5840-dkg-initial-delays

4b2f4bf

Merge branch 'jordan/5840-dkg-initial-delays' of github.com:onflow/fl…

ddd1e0b

…ow-go into jordan/5840-dkg-initial-delays

add note about probabilistic test

3f4bb82

add example and more context to delay flags

7ffc1c1

jordanschalm added 2 commits September 22, 2021 18:18

Merge branch 'master' into jordan/5840-dkg-initial-delays

bde62c9

Merge branch 'master' into jordan/5840-dkg-initial-delays

36ac605

tarakby approved these changes Sep 23, 2021

View reviewed changes

module/dkg/controller.go Outdated Show resolved Hide resolved

jordanschalm added 4 commits September 24, 2021 08:56

Merge branch 'master' into jordan/5840-dkg-initial-delays

52425b3

don't introduce subsequent delay for first message

4d80a38

add flag for constant subsequent message delay

2d885e5

jordanschalm requested a review from tarakby September 24, 2021 14:34

Merge branch 'master' into jordan/5840-dkg-initial-delays

ea07b30

tarakby approved these changes Sep 29, 2021

View reviewed changes

jordanschalm mentioned this pull request Sep 29, 2021

[V0.22] DKG Delays #1382

Closed

split max delay and sampled delay

3682974

add test case for default delay base values

zhangchiqing approved these changes Sep 29, 2021

View reviewed changes

module/dkg/controller.go Outdated Show resolved Hide resolved

jordanschalm and others added 2 commits September 29, 2021 14:26

Update module/dkg/controller.go

58ff7a5

Co-authored-by: Leo Zhang <zhangchiqing@gmail.com>

Merge branch 'master' into jordan/5840-dkg-initial-delays

7edb23f

bors bot merged commit 13ef07a into master Sep 29, 2021

bors bot deleted the jordan/5840-dkg-initial-delays branch September 29, 2021 21:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add jitter to DKG start and upon processing phase 1 broadcast messages #1303

Add jitter to DKG start and upon processing phase 1 broadcast messages #1303

jordanschalm commented Sep 16, 2021 •

edited

codecov-commenter commented Sep 21, 2021 •

edited

tarakby left a comment

zhangchiqing Sep 21, 2021

jordanschalm Sep 21, 2021

zhangchiqing Sep 21, 2021

jordanschalm Sep 21, 2021

zhangchiqing Sep 21, 2021 •

edited

jordanschalm Sep 21, 2021 •

edited

jordanschalm commented Sep 21, 2021 •

edited

tarakby commented Sep 21, 2021 •

edited

jordanschalm commented Sep 21, 2021 •

edited

tarakby commented Sep 21, 2021

jordanschalm commented Sep 22, 2021 •

edited

tarakby left a comment •

edited

zhangchiqing left a comment

jordanschalm commented Sep 29, 2021

bors bot commented Sep 29, 2021

	// 500µs results in per-broadcast delays of max=11.25s, ave=5.625s.
	// 500ms results in per-broadcast delays of max=11.25s, ave=5.625s.

Add jitter to DKG start and upon processing phase 1 broadcast messages #1303

Add jitter to DKG start and upon processing phase 1 broadcast messages #1303

Conversation

jordanschalm commented Sep 16, 2021 • edited

codecov-commenter commented Sep 21, 2021 • edited

Codecov Report

tarakby left a comment

Choose a reason for hiding this comment

zhangchiqing Sep 21, 2021

Choose a reason for hiding this comment

jordanschalm Sep 21, 2021

Choose a reason for hiding this comment

zhangchiqing Sep 21, 2021

Choose a reason for hiding this comment

jordanschalm Sep 21, 2021

Choose a reason for hiding this comment

zhangchiqing Sep 21, 2021 • edited

Choose a reason for hiding this comment

jordanschalm Sep 21, 2021 • edited

Choose a reason for hiding this comment

jordanschalm commented Sep 21, 2021 • edited

tarakby commented Sep 21, 2021 • edited

jordanschalm commented Sep 21, 2021 • edited

tarakby commented Sep 21, 2021

jordanschalm commented Sep 22, 2021 • edited

tarakby left a comment • edited

Choose a reason for hiding this comment

zhangchiqing left a comment

Choose a reason for hiding this comment

jordanschalm commented Sep 29, 2021

bors bot commented Sep 29, 2021

jordanschalm commented Sep 16, 2021 •

edited

codecov-commenter commented Sep 21, 2021 •

edited

zhangchiqing Sep 21, 2021 •

edited

jordanschalm Sep 21, 2021 •

edited

jordanschalm commented Sep 21, 2021 •

edited

tarakby commented Sep 21, 2021 •

edited

jordanschalm commented Sep 21, 2021 •

edited

jordanschalm commented Sep 22, 2021 •

edited

tarakby left a comment •

edited