Tracking: Staged sync #40

onbjerg · 2022-10-11T12:26:49Z

Stage abstraction

This abstraction should be mostly done, pending changes related to how the database abstractions evolve - e.g. instead of taking a raw MDBX transaction, we will likely receive another type in the future.

Pipeline

Better unwind priorities (@onbjerg): The current unwind priority system is based on Akula's method, but it can and should be simplified to prevent footgunning
Error and skip events (@onbjerg): The pipeline emits events that are currently only used for testing, but may be useful later on for metrics or other things. In some cases Ran and Unwound events are emitted with "special" values that denote that a stage either failed or was skipped. We should just add events for these cases
Commit intervals (@onbjerg): Currently data is committed to the database every time a stage returns from Stage::execute, but realistically this behavior should be tuneable to only commit meaningful progress

Tooling

Benchmarking helpers: We want to benchmark stages, so we will probably end up needing some utilities to make that easier
Profiling: We want insight into what the stages are doing to find paths to optimize. Currently we use tracing to mark out spans and emit events - we might be able to leverage this info in conjunction with e.g. tracing_tracy to be able to use Tracy. However, there may be tools that are better suited for profiling in our case.
Metrics: While not only a thing for staged sync (we need them in general), tools to expose metrics should be provided as well.

Stages

Initially we will use the good learnings from Akula, which is based on good learnings from Silkworm and Erigon, and essentially delineate the stages around the same boundaries as they have. As we progress, we might need more stages than listed here (or fewer).

For the more complex stages I propose we create separate tracking issues that link back to this one.

These stages are generally what I would categorize as indexes, which we may be able to generalize somewhat. ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶ ↩⁷

The text was updated successfully, but these errors were encountered:

gakonst · 2022-10-12T22:02:16Z

Great breakdown, agree on all. @rkrasiuk can you pls open an issue for Headers stage? And let's open them one by one as we pursue them. @rakita opened #39 which is relevant to general eth testing of stages, Dragan do you want to open another specific one for how you're going to be approaching the Executor + Execution Stage?

gakonst · 2022-10-14T06:07:57Z

@onbjerg Noticing there's a bunch of stages in erigon not present here, WDYT? e.g. https://github.com/ledgerwatch/erigon/tree/devel/eth/stagedsync#stage-15-transaction-pool-stage

onbjerg · 2022-10-14T09:25:19Z

It seems we're only missing 2? Transpilation stage, which we can't have because we don't have anything like TEVM, and the txpool stage, but we talked about having block building be a separate part since it's a bit more involved (might be custom, flashbots etc etc) so I don't think having that stage makes sense for us. Instead the block building part will just push down the block elsewhere through the pipeline

rakita · 2022-10-14T12:24:13Z

I added an additional trackng issue and reworded existing one:

Tracking: Eth chain tests #39 I would need mockings of databases and p2p to pass through all stages. I am assuming that there will be some minor modifications to stages (As in header stage to simplify it) but I am not sure atm extent of them. But i like idea of using chain tests to cover all stages.
Tracking: Execution/Validation of blocks #72 It is good to have execution and validation in one place. And there are additional functionalities that this module can give (As in building of blocks and execution of transactions). Utilities/functionalities would be aligned with the needs of stages.

onbjerg · 2022-10-14T12:25:09Z

I think in terms of #72 that would be in the consensus engine mostly, no? Or at least part of it @rakita

rakita · 2022-10-14T13:55:07Z

@onbjerg there are things that are common for all consensus types so that thing can be in reth-executor. I am not sure atm if consensus is going to call execution for additional verification or execution is going to call consensus we can see this later.

onbjerg · 2022-10-18T20:06:31Z

@rakita My point is - should these commonalities not be in a consensus crate (or a consensus-traits crate) instead of the execution crate? From what I've seen from e.g. Akula and Erigon, the stage calls consensus and the VM itself does not

rakita · 2022-10-18T21:44:55Z

I am not sure to be honest, consensus should contain only different consensus engines, for common things I mean block building, roots, execution etc. I would like to separate them into a standalone crate to have them in one place.

I see your point, for the stage side, to not complicate things maybe it is best just to use one trait Consensus and put any function that stages would need there, Consensus can just use whatever it needs internally.

onbjerg · 2023-01-23T15:49:26Z

Closing this as it is out of date - the stages have been shuffled around/merged/split etc. We need a few more stages, but those are handled in separate issues.

onbjerg assigned onbjerg and rkrasiuk and unassigned onbjerg and rkrasiuk Oct 11, 2022

onbjerg added C-tracking-issue An issue that collects information about a broad development initiative A-staged-sync Related to staged sync (pipelines and stages) labels Oct 11, 2022

onbjerg assigned onbjerg and rkrasiuk Oct 24, 2022

rkrasiuk mentioned this issue Nov 9, 2022

Sender Recovery Stage #180

Closed

akirillo mentioned this issue Dec 6, 2022

Tracking: Full sync high-level roadmap #341

Closed

gakonst added this to Reth Tracker Dec 26, 2022

onbjerg moved this to Todo in Reth Tracker Jan 4, 2023

onbjerg moved this from Todo to In Progress in Reth Tracker Jan 4, 2023

onbjerg moved this from In Progress to Tracking in Reth Tracker Jan 4, 2023

onbjerg closed this as completed Jan 23, 2023

github-project-automation bot moved this from Tracking to Done in Reth Tracker Jan 23, 2023

anonymousGiga added a commit to anonymousGiga/reth that referenced this issue Feb 20, 2024

update revm (paradigmxyz#40)

25f5994

anonymousGiga added a commit to anonymousGiga/reth that referenced this issue Feb 20, 2024

update revm (paradigmxyz#40)

3df5ca4

yutianwu pushed a commit to yutianwu/reth that referenced this issue Jul 1, 2024

fix: unwrap failed on fcu_resp (paradigmxyz#40)

8b125a8

AshinGau added a commit to AshinGau/reth that referenced this issue Oct 13, 2024

grevm: update grevm workflows (paradigmxyz#40)

07a71e4

AshinGau added a commit to AshinGau/reth that referenced this issue Oct 13, 2024

grevm: update grevm workflows (paradigmxyz#40)

151c109

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tracking: Staged sync #40

Tracking: Staged sync #40

onbjerg commented Oct 11, 2022 •

edited

Loading

gakonst commented Oct 12, 2022 •

edited

Loading

gakonst commented Oct 14, 2022

onbjerg commented Oct 14, 2022

rakita commented Oct 14, 2022

onbjerg commented Oct 14, 2022

rakita commented Oct 14, 2022

onbjerg commented Oct 18, 2022

rakita commented Oct 18, 2022

onbjerg commented Jan 23, 2023

Tracking: Staged sync #40

Tracking: Staged sync #40

Comments

onbjerg commented Oct 11, 2022 • edited Loading

Stage abstraction

Pipeline

Tooling

Stages

Footnotes

gakonst commented Oct 12, 2022 • edited Loading

gakonst commented Oct 14, 2022

onbjerg commented Oct 14, 2022

rakita commented Oct 14, 2022

onbjerg commented Oct 14, 2022

rakita commented Oct 14, 2022

onbjerg commented Oct 18, 2022

rakita commented Oct 18, 2022

onbjerg commented Jan 23, 2023

onbjerg commented Oct 11, 2022 •

edited

Loading

gakonst commented Oct 12, 2022 •

edited

Loading