[Merged by Bors] - Improve eth1 block sync #2008

paulhauner · 2020-11-29T07:57:00Z

Issue Addressed

NA

Proposed Changes

Log about eth1 whilst waiting for genesis.
For the block and deposit caches, update them after each download instead of when all downloads are complete.
- This prevents the case where a single timeout error can cause us to drop all previously download blocks/deposits.
Set max_log_requests_per_update to avoid timeouts due to very large log counts in a response.
Set max_blocks_per_update to prevent a single update of the block cache to download an unreasonable number of blocks.
- This shouldn't have any affect in normal use, it's just a safe-guard against bugs.
Increase the timeout for eth1 calls from 15s to 60s, as per @pawanjay176's experience with Infura.

Additional Info

NA

michaelsproul · 2020-11-30T05:53:41Z

beacon_node/client/src/notifier.rs

+                        head_info
+                            .genesis_time
+                            .saturating_sub(voting_periods_past * voting_period_duration)


I think this should subtract the follow distance as well, otherwise we expect our cache to contain the block from the start of the (virtual) voting period rather than the latest block ETH1_FOLLOW_DISTANCE behind the start of the virtual voting period

beacon_node/client/src/notifier.rs

pawanjay176 · 2020-11-30T06:42:28Z

beacon_node/eth1/src/service.rs

@@ -27,7 +27,7 @@ pub const DEFAULT_NETWORK_ID: Eth1Id = Eth1Id::Goerli;
 /// Indicates the default eth1 chain id we use for the deposit contract.
 pub const DEFAULT_CHAIN_ID: Eth1Id = Eth1Id::Goerli;

-const STANDARD_TIMEOUT_MILLIS: u64 = 15_000;
+const STANDARD_TIMEOUT_MILLIS: u64 = 60_000;


I think the higher timeout could be applied to just the GET_DEPOSIT_LOG_TIMEOUT_MILLIS.
Other calls are lightweight and them taking > 15 secs could imply bad health of the eth1 nodes.

pawanjay176 · 2020-11-30T07:33:07Z

beacon_node/eth1/src/service.rs

+            //
+            // This prevents the block downloading routine for running for a very long time and
+            // starving the deposit cache updater.
+            if Instant::now().duration_since(start_instant) > max_runtime {


I'm not sure I understand why we can't complete the blocks downloads before going back to deposits.

Even if we go back to updating deposits every 14secs, we usually won't get more than 1-2 new logs every update and this would slow down the block download.

I'm not sure I understand why we can't complete the blocks downloads before going back to deposits.

I suspect we could do this and I had a look at it, but it was sufficiently complicated that I wasn't confident to do it before mainnet :)

I meant we could just remove the
if Instant::now().duration_since(start_instant) > max_runtime
check and let all the block downloads complete. If we get an error somewhere in between, we start from that point in the next update.

I had some reasoning behind this, but it was loosely held. I think I'll just follow your suggestion and leave it out :)

paulhauner · 2020-11-30T10:25:41Z

All comments addressed!

michaelsproul

LGTM!

paulhauner · 2020-11-30T20:28:59Z

Thanks reviewers!

bors r+

@pawanjay176

## Issue Addressed NA ## Proposed Changes - Log about eth1 whilst waiting for genesis. - For the block and deposit caches, update them after each download instead of when *all* downloads are complete. - This prevents the case where a single timeout error can cause us to drop *all* previously download blocks/deposits. - Set `max_log_requests_per_update` to avoid timeouts due to very large log counts in a response. - Set `max_blocks_per_update` to prevent a single update of the block cache to download an unreasonable number of blocks. - This shouldn't have any affect in normal use, it's just a safe-guard against bugs. - Increase the timeout for eth1 calls from 15s to 60s, as per @pawanjay176's experience with Infura. ## Additional Info NA

bors · 2020-11-30T21:45:56Z

Pull request successfully merged into unstable.

Build succeeded:

## Issue Addressed NA ## Proposed Changes - Set version to `v1.0.3` - Run cargo update ## Additional Info - ~~Blocked on #2008~~

paulhauner added the work-in-progress PR is a work-in-progress label Nov 29, 2020

paulhauner added 3 commits November 30, 2020 11:13

Log eth1 prior to genesis, fix block imports

df6ea1a

Change eth1 syncing message

aacf725

Incrementally import deposit logs

0155e1d

paulhauner force-pushed the eth1-sync branch from 2146954 to 0155e1d Compare November 30, 2020 00:19

paulhauner added 2 commits November 30, 2020 11:32

Add comments, extend block download runtime

282999b

Add more comments, tidy

2f04835

paulhauner changed the base branch from stable to unstable November 30, 2020 01:02

Increase timeout on eth1 calls

ad10afd

paulhauner added ready-for-review The code is ready for review and removed work-in-progress PR is a work-in-progress labels Nov 30, 2020

paulhauner marked this pull request as ready for review November 30, 2020 04:36

michaelsproul reviewed Nov 30, 2020

View reviewed changes

beacon_node/client/src/notifier.rs Show resolved Hide resolved

pawanjay176 reviewed Nov 30, 2020

View reviewed changes

paulhauner added 4 commits November 30, 2020 20:28

Address review comments

c3f4679

Tidy, fix compile errors

669ecc7

Add to FAQ

caf64da

Remove max_runtime

1d3ca40

AgeManning mentioned this pull request Nov 30, 2020

ETH1 Deposits timeout when retrieved for mainnet pre genesis #1980

Closed

pawanjay176 approved these changes Nov 30, 2020

View reviewed changes

michaelsproul approved these changes Nov 30, 2020

View reviewed changes

paulhauner mentioned this pull request Nov 30, 2020

[Merged by Bors] - Bump version to v1.0.3 #2024

Closed

bors bot changed the title ~~Improve eth1 block sync~~ [Merged by Bors] - Improve eth1 block sync Nov 30, 2020

bors bot closed this Nov 30, 2020

michaelsproul deleted the eth1-sync branch November 30, 2020 22:35

bors bot pushed a commit that referenced this pull request Nov 30, 2020

Bump version to v1.0.3 (#2024)

65dcdc3

## Issue Addressed NA ## Proposed Changes - Set version to `v1.0.3` - Run cargo update ## Additional Info - ~~Blocked on #2008~~

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Merged by Bors] - Improve eth1 block sync #2008

[Merged by Bors] - Improve eth1 block sync #2008

paulhauner commented Nov 29, 2020 •

edited

michaelsproul Nov 30, 2020

pawanjay176 Nov 30, 2020

pawanjay176 Nov 30, 2020

paulhauner Nov 30, 2020

pawanjay176 Nov 30, 2020

paulhauner Nov 30, 2020

paulhauner commented Nov 30, 2020

michaelsproul left a comment

paulhauner commented Nov 30, 2020

bors bot commented Nov 30, 2020

[Merged by Bors] - Improve eth1 block sync #2008

[Merged by Bors] - Improve eth1 block sync #2008

Conversation

paulhauner commented Nov 29, 2020 • edited

Issue Addressed

Proposed Changes

Additional Info

michaelsproul Nov 30, 2020

Choose a reason for hiding this comment

pawanjay176 Nov 30, 2020

Choose a reason for hiding this comment

pawanjay176 Nov 30, 2020

Choose a reason for hiding this comment

paulhauner Nov 30, 2020

Choose a reason for hiding this comment

pawanjay176 Nov 30, 2020

Choose a reason for hiding this comment

paulhauner Nov 30, 2020

Choose a reason for hiding this comment

paulhauner commented Nov 30, 2020

michaelsproul left a comment

Choose a reason for hiding this comment

paulhauner commented Nov 30, 2020

bors bot commented Nov 30, 2020

paulhauner commented Nov 29, 2020 •

edited