persist downloaded atx identifiers and download them in background #5553

dshulyak · 2024-02-08T09:29:49Z

in the implementation we are using request to get the set of existing activations known by peer for two purposes:

download all activations during initial synchronization
download the difference with the peer

this request is not cheap, growth linearly with the size of activations and became very problematic with the growth of the number of activations. in many cases the number of such requests was due to the other problematic sync mechanism #5522. but even with that disabled we are running it on every restart for activations known in the last epoch every time node restart. if it fails node can't sync full activations, and if node is restarted it has to be retried all over again.

overview

once we successfully downloaded N sets of activation identitiers, we can persist them in the database. N can be 1, but can be also larger so that we can ask avoid asking poorly synced node. once they are saved we don't have to continuously ask peers on restarts, and can instead download activations from the set saved in the database.

all activations that target current epoch or below have to be downloaded before downloading ballots. but for the ongoing activations, that target next epoch, we can offload them to the background thread. this way they will not block "syncedness" progress on restart and will enable faster rejoin if node was offline for short period.

also in that same background thread we can ask random peer if they learned any more atxs. this can be done rarely e.g every 2 hours.

storing identifiers

we want to avoid asking nodes for possibly invalid identifiers, as it creates trivial dos opportunity. malicious nodes may create false set of activation identifiers, send them out to the network and make everyone ask for them repeatedly.

to prevent that we should track how many times activation was asked for and failed to be downloaded. and for example stop asking for it after we tried to download it 2 times. and reset this counter every time when someone advertises that his node knows about such identifier through get_epoch_info response. and we should prioritize asking peer that advertised such identifier, this is already implemented, but this information will be also lost on restart.

part of: #5553 when requested we ask configured number of peers for epoch info (collection of atxs from that epoch). on a successful response we save known ids, and will ask again only in 30 minutes (configurable). also on restart we check persisted data, and potentially avoiding eager queries, if last query was made close to the epoch end. concurrently with requesting epoch info updates, we will download atxs from peers. download is scheduled in batches, so that we can report progress. if peer advertised invalid atx id, we will evict such id after reaching max number of retries (20 in the pr). to make error checking possible i extended errors emitted by p2p/server and fetcher.

dshulyak added area/sync area/atx labels Feb 8, 2024

dshulyak self-assigned this Feb 8, 2024

dshulyak changed the title ~~persist download atx identifiers and update them in background~~ persist downloaded atx identifiers and update them in background Feb 8, 2024

dshulyak changed the title ~~persist downloaded atx identifiers and update them in background~~ persist downloaded atx identifiers and download them in background Feb 12, 2024

dshulyak removed their assignment Feb 12, 2024

dshulyak mentioned this issue Feb 13, 2024

DB compaction and slow/failing activations sync from genesis on every startup #5415

Closed

dshulyak self-assigned this Feb 23, 2024

This was referenced Feb 24, 2024

[Merged by Bors] - atx syncer that persists results #5599

Closed

[Merged by Bors] - download atxs from current epoch in background without blockinng syncedness #5600

Closed

spacemesh-bors bot closed this as completed in f133cba Mar 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

persist downloaded atx identifiers and download them in background #5553

persist downloaded atx identifiers and download them in background #5553

dshulyak commented Feb 8, 2024 •

edited

Loading

persist downloaded atx identifiers and download them in background #5553

persist downloaded atx identifiers and download them in background #5553

Comments

dshulyak commented Feb 8, 2024 • edited Loading

overview

storing identifiers

dshulyak commented Feb 8, 2024 •

edited

Loading