rewrite pull syncer #1451

acud · 2019-06-11T13:36:20Z

pull syncer has a few edge case bugs that are very hard to trace and debug.

we sometimes do not get chunks at the expected nodes and it is very difficult to support these debugging efforts with the current syncer infrastructure.

work outline:

submit an initial spec to iterate on top of
write the protocol boilerplate for initialising a new protocol over the devp2p network

sync:

retrieve stream cursors upon node connection
drop cursors on node moved out of depth
establish streams inside NN according to kademlia depth
make sure that stream cancellations happen on depth change
debounce mechanism

sync-localstore-intervals:

make sure closed intervals are always delivered from localstore pull

get stream < cursor (history priority):

test case for continuous intervals (no gaps)
test case for missing intervals (enclosing interval should still persist)

get stream > cursor (live priority):

test case for continuous intervals (no gaps)
~~test case for missing intervals (enclosing interval should still persist)~~ difficult to test since intervals are fetched faster than they can be erased to create gaps. on hold

consistency:

test case for no duplicate chunks delivered
check no overlap between historical and live streaming
make sync bins within depth feature toggle configurable and write test cases that validate syncing with it on and off

cluster/snapshot:

test that chunks are sent correctly in a star topology (adapt second test from syncer_test.go from existing stream package)
test that 3 nodes with full connectivity sync between them and that on each node will be the union of all three nodes' localstores
test that chunks are synced correctly in a larger full topology w/ discovery. this test vector needs more description

resilience:

guaranty that there's always a live historical fetch with an unbounded stream
check that existing historical and live historical stream fetchers terminate when depth changes and node moves out of depth

optimisations/benchmarking:

is it faster to concurrently deliver 3000 chunks in 3000 different messages between two nodes? or is it more effective to send one message with 3000 chunks? this should be easily measurable

tooling:

add assert for smoke tests to know when syncing is done

logging:

~~how do add node identity to every logline our node outputs? related to improve tracing of syncing protocol #1393~~

The text was updated successfully, but these errors were encountered:

nonsense · 2019-06-11T13:44:39Z

Related: ethersphere/user-stories#16

acud self-assigned this Jun 11, 2019

acud added stability stream labels Jun 11, 2019

FantasticoFox added the priority label Jun 11, 2019

This was referenced Jun 23, 2019

network/newstream: new stream! protocol base implementation #1500

Merged

[Upgrade] Stream! Module (Refactor pull sync) ethersphere/user-stories#51

Closed

holisticode mentioned this issue Jul 4, 2019

[User-Story] 200MB upload and availability ethersphere/user-stories#17

Closed

8 tasks

acud added the meta metaissues describing a label label Jul 5, 2019

acud assigned janos and acud and unassigned acud Jul 5, 2019

acud mentioned this issue Jul 29, 2019

network: new stream! protocol and pull syncer implementation #1538

Merged

janos closed this as completed in #1538 Sep 11, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rewrite pull syncer #1451

rewrite pull syncer #1451

acud commented Jun 11, 2019 •

edited

Loading

nonsense commented Jun 11, 2019

rewrite pull syncer #1451

rewrite pull syncer #1451

Comments

acud commented Jun 11, 2019 • edited Loading

nonsense commented Jun 11, 2019

acud commented Jun 11, 2019 •

edited

Loading