[Access node] implement new splitter engine #947

synzhu · 2021-07-12T19:14:31Z

Implements a new splitter engine described in https://github.com/dapperlabs/flow-go/issues/5669.

We also create a new splitter network implementation that can be passed into other engines to allow multiple engines to register for messages on the same network channel.

closes dapperlabs/flow-go#5669

Co-authored-by: turbolent <turbolent@users.noreply.github.com> Co-authored-by: Janez Podhostnik <67895329+janezpodhostnik@users.noreply.github.com>

…/flow-go into smnzhu/multiplexer-engine

engine/common/multiplexer/engine.go

AlexHentschel

Not sure whether this is suggestion is possible within the scope of this PR:

your multiplexer.Engine essentially only implements a multiplexing on the Process and Submit method, but not the SubmitLocal or ProcessLocal:
- From a high-level, engines are vertices in a data flow graph. The fact that the networking layer can feed data into engines is an auxiliary functionality, but not the primary purpose of engines.
- Your multiplexer.Engine is primarily an engine, but your implementation is focused on networking purposes only, neglecting other functions that are vital to an engine's interface (i.e. SubmitLocal or ProcessLocal).
You are putting the channel as the engine's primary focus even though it should not be.
I think we have two options:
1. multiplexer.Engine delegates all calls to the wrapped engines. I think this would be fine, as we can assemble any data flow pattern through multiplexers and engines that filter based on channel.
  - so instead of having one multiplexer that is "channel aware" (playmobil approach), create a multiplexer that only consumes messages for one channel. Thereby, the multiplexer can forward all calls to the engines it wraps. Then it would be truly implementing the Engine interface and multiplex all calls to the wrapped engines.
  - In the multiplexer.network you can create one dedicated multiplexer.Engine per channel.
2. We implement a multiplexer for the MessageProcessor interface and remove the Engine interface from the networking layer into the module package (there exists already an engine interface there, which is wrapping network.Engine)

AlexHentschel · 2021-07-13T22:53:00Z

engine/common/multiplexer/engine.go

+	return e, nil
+}
+
+func (e *Engine) RegisterEngine(channel network.Channel, engine network.Engine) error {


⚠️ ❗ ⚠️
This requires concurrency handling! With the current implementation, there is no guarantee whatsoever that a registered engine will ever receive any messages.

engine/common/multiplexer/engine.go

huitseeker

This in general looks great, I left a few comments.

huitseeker · 2021-07-19T19:09:29Z

engine/common/splitter/engine.go

+)
+
+type Engine struct {
+	mu      sync.RWMutex


Nit: something to indicate the coverage of the Mutex might help, e.g. enginesMu

huitseeker · 2021-07-19T19:13:10Z

engine/common/splitter/engine.go

+
+// process calls the given function in parallel for all the engines that have
+// registered with this splitter.
+func (e *Engine) process(f func(module.Engine) error) {


Nit: f is the one argument I'd really like to track here, having it be a longer string would help.

huitseeker · 2021-07-19T19:28:35Z

engine/common/splitter/network/network.go

+// Register will subscribe the given engine with the spitter on the given channel, and all registered
+// engines will be notified with incoming messages on the channel.
+// The returned Conduit can be used to send messages to engines on other nodes subscribed to the same channel
+func (n *Network) Register(channel network.Channel, e network.Engine) (network.Conduit, error) {


OK, this may be a comment sitting across this PR and #984 but:

I see the appeal in a generic splitter, but I would also appreciate a strong godoc paragraph at the beginning of the engine that lays out where and when the splitter receives its channel processing obligations over time, and how it maintains them.

The topology of the splitter is clear, but to Vishal's point the channel responsibilities over the lifecycle of the component are a bit harder to tease out when channels are per-method parameters.

Another way to do this would be to write a mixed splitter / relay runnable example that would demonstrate this over time (esp. w/ unsubscribes), and this is stg we can defer to a further PR

synzhu · 2021-07-19T19:42:30Z

Thanks for the suggestions @AlexHentschel, I knew that the distinction between Process and Submit had to do with synchronous vs asynchronous processing, but I wasn't aware that the convention for Process is that all the logic runs in the calling thread.

I mostly agree with your point about splitter engine's Submit method, will make those changes. Originally I had implemented it this way because it resulted in more concise code since a single function implements most of the logic. There is one thing I don't think will be doable:

SubmitLocal and Submit: any returned errors are logged as fatal.

The Submit method does not return anything (because the convention is that processing is asynchronous and errors are merely logged), so if the splitter engine's Submit calls into the Submit methods of the downstream engines, we will not be able to capture any downstream errors. The alternative is for splitter engine's Submit to call in to the downstream engines' Process methods, but this would require starting a new goroutine for each downstream engine. Ultimately I don't think any of this is necessary, as convention already states that any caller who wants to know about unexpected problems that occur should be calling the splitter engine's Process method rather than Submit in the first place.

As for the Process method however, I'm not sure how I feel about calling each downstream engine sequentially in an unspecified order. I guess the long term solution is to move towards a non-blocking API for the application layer, but currently if we are to obey the conventions then it seems there would be no way to create an engine that actually calls the processing logic on each downstream engine in parallel (which is arguably what we really want) because network will always call splitter's Process method.

I think a possible alternative here is to still start a separate goroutine for each downstream engine, but capture all the errors and return them in a multierror instead of logging them and returning nil. This would allow us to still satisfy the following:

Therefore, the core business logic can return errors, which we should capture and propagate.

Furthermore, using the WaitGroup allows us to still block the caller until all of the engines are finished executing their processing logic. It seems like these two properties are the crux of what we really care about?

I remember this book saying that goroutines are actually very lightweight, and we normally don't have to worry about creating them since they create minimal overhead. In the case of the splitter engine's Process method, I'd argue the tradeoff between goroutine overhead and the amount of processing time we could save by calling the downstream engines in parallel instead of sequentially is one that it may make sense to make

However, I don't have a strong opinion on this given that we will probably be moving towards MessageConsumer API eventually anyways.

codecov-commenter · 2021-07-20T15:52:21Z

Codecov Report

Merging #947 (61e30f4) into master (e35d792) will increase coverage by 0.00%.
The diff coverage is 55.44%.

@@           Coverage Diff            @@
##           master     #947    +/-   ##
========================================
  Coverage   54.81%   54.81%            
========================================
  Files         277      279     +2     
  Lines       18548    18649   +101     
========================================
+ Hits        10167    10223    +56     
- Misses       7006     7047    +41     
- Partials     1375     1379     +4

Flag	Coverage Δ
unittests	`54.81% <55.44%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
network/p2p/network.go	`0.00% <ø> (ø)`
engine/common/splitter/network/network.go	`50.00% <50.00%> (ø)`
engine/common/splitter/engine.go	`60.37% <60.37%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e35d792...61e30f4. Read the comment docs.

synzhu · 2021-07-20T16:13:32Z

@AlexHentschel please take a look at the latest changes and lmk what you think

AlexHentschel

👍

huitseeker

I thought you could simplify the multierror logic with a multierror.Group (from multierror 1.1), but I'm not happy with the logic in the WaitGroup therein, and your approach is in the end fine.

turbolent and others added 4 commits July 12, 2021 11:33

update Cadence to commit 81999eb7875b169466279527d07204ade2f5cddf (#894)

3321165

Co-authored-by: turbolent <turbolent@users.noreply.github.com> Co-authored-by: Janez Podhostnik <67895329+janezpodhostnik@users.noreply.github.com>

Create engine.go

6c9d2e8

Update engine.go

16288cd

Updated engine to keep a separate set of all engines

08867e4

synzhu requested a review from vishalchangrani July 12, 2021 19:14

turbolent and others added 13 commits July 12, 2021 12:26

update Cadence to commit 81999eb7875b169466279527d07204ade2f5cddf (#894)

47051d4

Co-authored-by: turbolent <turbolent@users.noreply.github.com> Co-authored-by: Janez Podhostnik <67895329+janezpodhostnik@users.noreply.github.com>

Create engine.go

bef6132

Update engine.go

ae394bc

Updated engine to keep a separate set of all engines

4073662

Test boilerplate

ae67878

Merge branch 'smnzhu/multiplexer-engine' of https://github.com/onflow…

d442fe4

…/flow-go into smnzhu/multiplexer-engine

test happy path

1aab3ff

Update engine_test.go

2d0399d

Update engine_test.go

a1bdce7

test doiwnsteam failure

b790fb1

Test process unregistered channel

8acd7eb

test duplicate registrations

18c60aa

Update engine_test.go

902d335

synzhu requested review from AlexHentschel, Kay-Zee and zhangchiqing July 12, 2021 21:32

synzhu added 2 commits July 12, 2021 17:13

Test Ready

67d3cad

Update engine_test.go

3648bd8

vishalchangrani changed the base branch from master to smnzhu/network-multi-channel July 13, 2021 00:51

Update tests

f22cc1c

onflow deleted a comment from codecov-commenter Jul 13, 2021

vishalchangrani reviewed Jul 13, 2021

View reviewed changes

engine/common/multiplexer/engine.go Outdated Show resolved Hide resolved

Split into MultiplexerEngine and MultiplexerNetwork

5ce70ef

synzhu requested a review from vishalchangrani July 13, 2021 02:47

AlexHentschel reviewed Jul 13, 2021

View reviewed changes

huitseeker reviewed Jul 19, 2021

View reviewed changes

huitseeker approved these changes Jul 19, 2021

View reviewed changes

synzhu mentioned this pull request Jul 19, 2021

[Access Node] create godoc example for splitter / relay engine #992

Closed

synzhu added 12 commits July 19, 2021 13:19

rename mu to enginesMu

d1ac517

modify Submit and Process methods

de63955

Update engine.go

5087e0c

rename process function to more descriptive name

d2152df

Add documentation

e24c87d

Merge branch 'master' into smnzhu/multiplexer-engine

17a77c3

Update engine.go

0819218

Added test for concurrent events

59fa917

format

7d6ab34

embed Network interface

4756c7d

Merge branch 'master' into smnzhu/multiplexer-engine

2af5867

fix tests

332b612

synzhu added 3 commits July 20, 2021 08:57

simplify process method

20bec4d

update synchronization code

154cd24

undo removed RLock

c32a8fc

update Ready and Done methods

a0b48c8

synzhu requested a review from yhassanzadeh13 as a code owner July 20, 2021 18:44

vishalchangrani approved these changes Jul 20, 2021

View reviewed changes

huitseeker self-requested a review July 20, 2021 21:10

AlexHentschel approved these changes Jul 20, 2021

View reviewed changes

Merge branch 'master' into smnzhu/multiplexer-engine

61e30f4

huitseeker approved these changes Jul 20, 2021

View reviewed changes

synzhu merged commit 62ce19f into master Jul 20, 2021

synzhu deleted the smnzhu/multiplexer-engine branch July 20, 2021 23:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Access node] implement new splitter engine #947

[Access node] implement new splitter engine #947

synzhu commented Jul 12, 2021 •

edited

Loading

AlexHentschel left a comment

AlexHentschel Jul 13, 2021

synzhu Jul 14, 2021

huitseeker left a comment

huitseeker Jul 19, 2021

huitseeker Jul 19, 2021

huitseeker Jul 19, 2021 •

edited

Loading

synzhu commented Jul 19, 2021 •

edited

Loading

codecov-commenter commented Jul 20, 2021 •

edited

Loading

synzhu commented Jul 20, 2021

AlexHentschel left a comment

huitseeker left a comment

[Access node] implement new splitter engine #947

[Access node] implement new splitter engine #947

Conversation

synzhu commented Jul 12, 2021 • edited Loading

AlexHentschel left a comment

Choose a reason for hiding this comment

AlexHentschel Jul 13, 2021

Choose a reason for hiding this comment

synzhu Jul 14, 2021

Choose a reason for hiding this comment

huitseeker left a comment

Choose a reason for hiding this comment

huitseeker Jul 19, 2021

Choose a reason for hiding this comment

huitseeker Jul 19, 2021

Choose a reason for hiding this comment

huitseeker Jul 19, 2021 • edited Loading

Choose a reason for hiding this comment

synzhu commented Jul 19, 2021 • edited Loading

codecov-commenter commented Jul 20, 2021 • edited Loading

Codecov Report

synzhu commented Jul 20, 2021

AlexHentschel left a comment

Choose a reason for hiding this comment

huitseeker left a comment

Choose a reason for hiding this comment

synzhu commented Jul 12, 2021 •

edited

Loading

huitseeker Jul 19, 2021 •

edited

Loading

synzhu commented Jul 19, 2021 •

edited

Loading

codecov-commenter commented Jul 20, 2021 •

edited

Loading