Extract warp sync strategy from `ChainSync` #2467

dmitry-markin · 2023-11-23T13:11:25Z

Extract WarpSync (and StateSync as part of warp sync) from ChainSync as independent syncing strategy called by SyncingEngine. Introduce SyncingStrategy enum as a proxy between SyncingEngine and specific syncing strategies.

Limitations

Gap sync is kept in ChainSync for now because it shares the same set of peers as block syncing implementation in ChainSync. Extraction of a common context responsible for peer management in syncing strategies able to run in parallel is planned for a follow-up PR.

Further improvements

A possibility of conversion of SyncingStartegy into a trait should be evaluated. The main stopper for this is that different strategies need to communicate different actions to SyncingEngine and respond to different events / provide different APIs (e.g., requesting justifications is only possible via ChainSync and not through WarpSync; SendWarpProofRequest action is only relevant to WarpSync, etc.)

altonen

Very nice work! I left a bunch of comments but apart from state sync still partly being in ChainSync and the inability to share BlocksCollection between strategies (which can be addressed in a follow-up), I think it's looking good

altonen · 2023-11-29T07:15:50Z

substrate/client/network/sync/src/chain_sync.rs

+		let warp_sync_progress = self.gap_sync.as_ref().map(|gap_sync| WarpSyncProgress {
+			phase: WarpSyncPhase::DownloadingBlocks(gap_sync.best_queued_number),
+			total_bytes: 0,
+		});


I don't think this belongs here but I understand that in practice it might be harder to refactor out. We need to find a way to share BlocksCollection/peer download states and share them between strategies. This would be needed to run Sync 1.0 and 2.0 in parallel.

No sharing of BlockCollection yet as it's not needed to run GapSync in parallel with ChainSync (it has its own instance of BlockCollection), but here is a draft PR with the initial implementation of sharing of peers between the strategies: #2814.

I don't think time should be invested now into reviewing it as a lot could change, but hopefully it shows the general idea of what a follow-up PR may look like.

substrate/client/network/sync/src/chain_sync.rs

substrate/client/network/sync/src/engine.rs

substrate/client/network/sync/src/warp.rs

Co-authored-by: Aaro Altonen <48052676+altonen@users.noreply.github.com>

dmitry-markin · 2023-11-29T11:33:41Z

Very nice work! I left a bunch of comments but apart from state sync still partly being in ChainSync and the inability to share BlocksCollection between strategies (which can be addressed in a follow-up), I think it's looking good

Another thing that should be shared between the strategies running in parallel similarly to BlocksCollection is the set of peers and their states. Unless we accept that it's OK to treat peers independently in different strategies and issue parallel requests to them.

altonen · 2023-11-30T09:23:51Z

Unless we accept that it's OK to treat peers independently in different strategies and issue parallel requests to them.

I don't think it's a good idea. All strategies should have the same view of connected peers. We can't have two simultaneous in-flight request to any peer so their states have to be synchronized across strategies.

If we introduce some kind of peer + blocks collection which is shared between strategies using Arc<Mutex<Context>> (and maybe some nice API on top of it), that would probably work.

substrate/client/network/sync/src/engine.rs

substrate/client/network/sync/src/strategy.rs

substrate/client/network/test/src/sync.rs

substrate/client/network/sync/src/strategy.rs

Co-authored-by: Aaro Altonen <48052676+altonen@users.noreply.github.com>

substrate/client/network/sync/src/strategy/chain_sync.rs

substrate/client/network/sync/src/strategy/chain_sync/test.rs

substrate/client/network/sync/src/strategy.rs

skunert · 2024-01-11T08:50:36Z

substrate/client/network/sync/src/strategy.rs

+	#[must_use]
+	pub fn actions(&mut self) -> Box<dyn Iterator<Item = SyncingAction<B>>> {
+		match self {
+			SyncingStrategy::WarpSyncStrategy(strategy) =>


Nit: Just for readability, maybe a good place for an Into implementation? But just a matter of taste.

Good point. I'll do it in a follow-up PR #2814 to not complicate rebasing though)

substrate/client/network/sync/src/strategy/state.rs

substrate/client/network/sync/src/strategy/warp.rs

michalkucharczyk · 2024-01-11T14:01:15Z

lgtm, left some nits.

nazar-pc · 2024-01-15T12:11:00Z

substrate/client/network/sync/src/strategy.rs

+use warp::{EncodedProof, WarpProofRequest, WarpSync, WarpSyncAction, WarpSyncConfig};
+
+/// Corresponding `ChainSync` mode.
+fn chain_sync_mode(sync_mode: SyncMode) -> ChainSyncMode {


Is this function really necessary? It reverted paritytech/substrate#14465 and the fact that there are two types again makes it painful to backport paritytech/substrate#14482 (whose goal is to allow combining Substrate's sync with a custom sync mechanism we have in Subspace).

Can this be replaced with a method on SyncMode that returns true for both SyncMode::Full and SyncMode::Warp? Or match enum against two possible variants explicitly.

What this and upcoming PRs aim to achieve is to clearly separate syncing strategies like Warp, State, ChainSync, etc. Introducing an atomic variable accessible by ChainSync that refers to warp sync would be going into the opposite direction.

On the other hand, once the refactoring is over, it should be possible to plug custom sync as a separate strategy and switch to ChainSync only when the initial sync is over, without the need to pause ChainSync. May be this will suite you?

Yes, that sounds like exactly what I need. Not sure if it will be flexible enough, but definitely sounds promising.

Extract `WarpSync` (and `StateSync` as part of warp sync) from `ChainSync` as independent syncing strategy called by `SyncingEngine`. Introduce `SyncingStrategy` enum as a proxy between `SyncingEngine` and specific syncing strategies. ## Limitations Gap sync is kept in `ChainSync` for now because it shares the same set of peers as block syncing implementation in `ChainSync`. Extraction of a common context responsible for peer management in syncing strategies able to run in parallel is planned for a follow-up PR. ## Further improvements A possibility of conversion of `SyncingStartegy` into a trait should be evaluated. The main stopper for this is that different strategies need to communicate different actions to `SyncingEngine` and respond to different events / provide different APIs (e.g., requesting justifications is only possible via `ChainSync` and not through `WarpSync`; `SendWarpProofRequest` action is only relevant to `WarpSync`, etc.) --------- Co-authored-by: Aaro Altonen <48052676+altonen@users.noreply.github.com>

dmitry-markin added 11 commits November 20, 2023 14:42

Make warpSync a self-contained syncing strategy

b9b543b

Cleanup warp sync leftovers in ChainSync

4b4c880

Introduce SyncingStrategy proxy enum

1c45b93

Plug SyncingStrategy into SyncingEngine

efd0511

Resolve borrowing issues in WarpSync

fc5f074

Report WarpSyncProgress during gap sync

abff720

Actualize docs

f913b65

Merge remote-tracking branch 'origin/master' into dm-warp-sync-strategy

d2a7bbd

Fix: stop state requests once state sync is over

9c5884d

Pass connected peers to ChainSync upon construction

1ace10f

Fix: warp sync should be considered major syncing

124ea31

dmitry-markin added the T0-node This PR/Issue is related to the topic “node”. label Nov 23, 2023

dmitry-markin requested a review from altonen November 23, 2023 13:11

dmitry-markin marked this pull request as draft November 23, 2023 13:48

dmitry-markin added 5 commits November 24, 2023 17:51

Fall back to full sync if warp sync has failed

c7be20b

Incapsulate strategy switching in SyncingStrategy

cf46bce

minor: remove unused structs in WarpSync

982069d

Extract state sync strategy from warp sync strategy

54607ef

minor: revert useless change in types.rs

3fcfa13

altonen reviewed Nov 29, 2023

View reviewed changes

Apply suggestions from code review

a61401b

Co-authored-by: Aaro Altonen <48052676+altonen@users.noreply.github.com>

dmitry-markin added 5 commits November 29, 2023 16:41

Merge remote-tracking branch 'origin/master' into dm-warp-sync-strategy

d6969bb

Apply review suggestions

0b07b36

Apply more review suggestions

fab9cd2

Define LOG_TARGET in crate root and get rid of hardcoded targets

cfde130

Fix tests compilation

8e704e1

dmitry-markin added 2 commits November 30, 2023 14:41

Fix best block update in ChainSync::on_validated_block_announce

37c79ae

Fix zombienet warp sync tests

8bd7e39

minor: formatting

7a242a4

dmitry-markin mentioned this pull request Dec 26, 2023

Share peers between syncing strategies #2814

Closed

altonen approved these changes Jan 9, 2024

View reviewed changes

altonen reviewed Jan 9, 2024

View reviewed changes

substrate/client/network/sync/src/strategy.rs Show resolved Hide resolved

dmitry-markin and others added 3 commits January 9, 2024 12:35

Apply suggestions from code review

dd70e6c

Co-authored-by: Aaro Altonen <48052676+altonen@users.noreply.github.com>

Add comment re skipping proofs in state sync

0c9d09c

Fix issue with slow target block download during warp sync

eeb14c3

dmitry-markin requested a review from skunert January 10, 2024 10:30

skunert reviewed Jan 10, 2024

View reviewed changes

skunert reviewed Jan 11, 2024

View reviewed changes

skunert approved these changes Jan 11, 2024

View reviewed changes

michalkucharczyk reviewed Jan 11, 2024

View reviewed changes

michalkucharczyk approved these changes Jan 11, 2024

View reviewed changes

dmitry-markin added 5 commits January 11, 2024 17:11

Merge remote-tracking branch 'origin/master' into dm-warp-sync-strategy

3825565

Apply review suggestions

3d50db2

Don't log state sync result twice

180e582

Print nicer informant messages during state sync

e72c559

Update warp sync zombienet tests

398c8b6

dmitry-markin added this pull request to the merge queue Jan 12, 2024

github-merge-queue bot removed this pull request from the merge queue due to no response for status checks Jan 12, 2024

dmitry-markin added this pull request to the merge queue Jan 12, 2024

Merged via the queue into master with commit 5208bed Jan 12, 2024
120 checks passed

dmitry-markin deleted the dm-warp-sync-strategy branch January 12, 2024 16:47

nazar-pc reviewed Jan 15, 2024

View reviewed changes

nazar-pc mentioned this pull request Jan 18, 2024

Upgrade Substrate subspace/subspace#2424

Merged

1 task

github-actions bot mentioned this pull request Mar 13, 2024

Update polkadot-sdk from v1.3.0 to v1.7.2 moonbeam-foundation/moonbeam#2703

Closed

zjb0807 mentioned this pull request May 7, 2024

"waiting for 3 peers to be connected" while sync a node AcalaNetwork/Acala#2750

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extract warp sync strategy from `ChainSync` #2467

Extract warp sync strategy from `ChainSync` #2467

dmitry-markin commented Nov 23, 2023 •

edited

Loading

altonen left a comment

altonen Nov 29, 2023

dmitry-markin Dec 26, 2023

dmitry-markin commented Nov 29, 2023

altonen commented Nov 30, 2023 •

edited

Loading

skunert Jan 11, 2024

dmitry-markin Jan 11, 2024

michalkucharczyk commented Jan 11, 2024

nazar-pc Jan 15, 2024

dmitry-markin Jan 15, 2024

nazar-pc Jan 15, 2024

Extract warp sync strategy from ChainSync #2467

Extract warp sync strategy from ChainSync #2467

Conversation

dmitry-markin commented Nov 23, 2023 • edited Loading

Limitations

Further improvements

altonen left a comment

Choose a reason for hiding this comment

altonen Nov 29, 2023

Choose a reason for hiding this comment

dmitry-markin Dec 26, 2023

Choose a reason for hiding this comment

dmitry-markin commented Nov 29, 2023

altonen commented Nov 30, 2023 • edited Loading

skunert Jan 11, 2024

Choose a reason for hiding this comment

dmitry-markin Jan 11, 2024

Choose a reason for hiding this comment

michalkucharczyk commented Jan 11, 2024

nazar-pc Jan 15, 2024

Choose a reason for hiding this comment

dmitry-markin Jan 15, 2024

Choose a reason for hiding this comment

nazar-pc Jan 15, 2024

Choose a reason for hiding this comment

Extract warp sync strategy from `ChainSync` #2467

Extract warp sync strategy from `ChainSync` #2467

dmitry-markin commented Nov 23, 2023 •

edited

Loading

altonen commented Nov 30, 2023 •

edited

Loading