Design Documents Modular Besu

Modular Besu

Modularization of Besu - can we make Besu more flexible by factoring it into decoupled components which can be exchanged for alternate implementations?

Goal of this document:

Starting a conversation about modularizing Besu.
Keeping track of the discussions.

Modularity Implementation Approach - how modules are divided into business logic vs. cross-cutting concerns, and a proposed proof-of-concept.
Modular Consensus - a preliminary approach to modularizing consensus (BFT) via the plug-in system.
Besu Software Component Map (DRAFT) - a tour of how the Besu codebase is organized into Gradle projects.

General context

We are getting various signals that the future of blockchain technologies is all about modularity. If L2 chains on top of L1 chains are the future, how can we make an L1 client that can be composed from various implementations of sub-components? We also see evidence of this elsewhere - The Merge separated consensus from execution. MEV actors like Flashbots separate proposing a block from building it. Even within clients, we see teams like Erigon re-writing their client in different languages, and combining the best performing subcomponents regardless of language.

Apart from the general direction of blockchain, software has been trending away from monolithic implementations, in order to maximize developer efficiency and reduce change fatigue. Smaller components can reach stability more easily than large monoliths can.

Goals

The goals of this work are to expand the contexts in which Besu can be valuable to users and operators while reducing tech debt in maintenance of the code base and its release process. New use-cases require client modification (customization of Besu's rules). New use-cases may also want some, but not all, of the functionality that Besu provides. These use-cases may also want an easy way to package and distribute their work with Besu's permissive licensing.

To support flexibility and development of Besu in novel contexts going forward, the client needs a new approach to its existing monolithic architecture. Today, the protocol schedule defines how the monolith operates, with different sets of rules, but is unwieldy and hard to modify. We need a new approach that allows for the evolution of the client without the baggage of the monolithic approach.

Enter modularity.

The modularity work can be largely set against three goals:

Resolution of tech debt - to support today's existing use-cases in Besu, we have an unwieldy monolithic approach. This is becoming hard to manage and should be addressed (with the added benefit of the two below goals).
1. Incremental, mergeable, no big bangs, review cycle to warrant inclusion.
Better Distribution - Tailoring the code-base by customizing a set of modules of "Besu" to provide users exactly what they need in context. I.e. I need a private network distribution of Besu, so I need PoA consensus, but not PoW validation rules. These distributions can also have their own CI/release process to adjust testing definition and quality standards. These can also be for individual modules like the EVM.
Better Client Modification - Tailoring the run-time to suit user/developer needs, allowing for deep customization of the client, its rules, and components. Here we have some approaches:
1. Plug-ins and the plug-in API - Using the existing plug-in API, developers can inject modifications into the Besu code at start up that replaces the "vanilla" rule-sets and functionality. It also allows for new components to interface with Besu via the API, but does not allow for whole-sale swapping of components.
2. Modules - Creating boundaries and interfaces in the code-base so Besu's existing components can stand-alone and/or be replaced with like modules. This opens up the client to flexibility of its modules, like replacing the EVM or consensus mechanisms with novel ones (or a new storage/peering stack, etc.). This can help with tech debt, but is a heavy-handed approach to customization.
3. Remote APIs - It is currently possible to drive a blockchain purely over the rpc-apis, using the Engine API developed to facilitate Proof of Stake. This approach could be expanded to allow for other use cases that want to interact with the blockchain.

The latter two goals are somewhat linked. Distributions can be tackled with any client modification approach. APIs vs. modules, however, should be considered as differing tracks.

Potential Benefits

With the above in mind, we outline some potential benefits:

Releases - finer grained components could have a finer grained release process, speeding up the release cycle.
1. Distributions can be cut more easily for specific use-cases, based on what components and customizations to the base client are needed (Mainnet, ETC, Private nets, standalone EVM).
Customization - Modular components with plug-ins enable customization of the client to fit the needs of different use-cases in a clean way, with well-defined APIs and module boundaries.
1. Linea Rollup definition, using plug-ins and novel components to allow Besu to operate an L2 network. Distributing these changes in a repeatable, reliable way to users and node operators.
Increases pace of innovation - experiments and prototypes become much easier, faster, and lower risk to pursue.
1. New use-cases for Besu can be piloted quickly, without maintaining complex forks of the Besu client.
End User Control - software modularity should lend itself easily to greater customizability for the end user.
1. Client modification can be done easily by swapping or altering components. Altered components are Besu at their core, but the plug-in system changes their behavior. Modular components may be Besu components or completely novel components that work with Besu via documented interfaces (i.e. a Rust EVM).
Reduces cognitive complexity - better defined scope for contributors to target a specific part of the codebase. New developers can focus more narrowly, and get up to speed faster with fewer distractions.

General Concerns and Challenges, Possible Mitigations

Engineering effort around Besu
1. Large engineering effort - we will need to always prefer incremental delivery over greenfield or big-bang approaches.
2. Series of workshops to define the work.
Technical project organization
1. Communication planning
  1. Internal - how do we make sure all Besu contributors can keep their finger on the pulse of this initiative.
  2. External - do we need to convey this to external users or interested parties? If so, how?
2. Multi-stakeholder discussion, federating people around modular Besu. Examples of stakeholders this would benefit:
  1. MEV searchers
  2. Rollup implementers
  3. Infrastructure providers
  4. Developers

Besu Minimum Useful Components

Hypothetical situations that would benefit from component composition:

Isolatable execution engine.
- EVM and state are needed and not the Consensus. Ex: Rollups, Hedera Hashgraph, EVM testing tools.
Transaction pool, transaction validation, and block gossip needed. MEV searchers.
- Possibly EVM needed for gas use analysis.
Use case specific builds.
- All-in-one mainnet client that provides Ethereum proof-of-stake as its only consensus mechanism.
State Sync Testbed, rapid prototyping for data stores which can be populated with state changes from a moving chain.

Potential First Steps

Catalog all components.
Test approach on one or more situations listed above.
Extrapolate out a rough timeline on MVP scope and modules timing vs the catalog.
Scope MVP (minimum viable platform).

Questions

Plug-ins vs. modules - will we expand the plug-in API to be unwieldy and exposing too much?

Modularity Implementation Approach

Context

Making Besu more modular requires thinking about the dimensions along which those modules are divided up and separated from each other. Those separations can be categorized into two types: areas of related business logic, and areas of related software function. The former is the domain of "stuff Ethereum needs" and the latter is the domain of "stuff good Java software needs". When we modularize Besu, we are looking to produce reusable, composable software components within the business logic domain.

Bounded Contexts / Business Logic / Blockchain Domain are all synonyms for the type of modules we want to create. We will know we are doing this right when users can easily create a Besu that combines different modules into the client they need.

Cross Cutting Concerns are a little bit different. These should be invisible to the users, but very helpful to the developers. Any Bounded Context may depend on any mix of Cross Cutting Concerns. This part is ok to be a web of dependencies; they are often external projects/libraries we have selected for use all throughout Besu.

Business Logic	Cross Cutting Concerns
1. Consensus 1. Proof of Work 2. External (or none?): Proof of Stake 1. driven by Engine API 3. IBFT 4. QBFT 2. P2P 1. ETH/66 and prior 2. DevP2P 3. Execution 1. EVM 2. Tracing 4. Transaction Management 1. Tessera integration 2. MEV 3. public and private transaction support 4. Proof of work tx validation 5. Synchronizing 1. Full sync 2. Fast sync 3. Snap sync 4. Checkpoint sync 5. Backwards sync 6. Storage 1. World State 1. Forest 2. Bonsai 3. Snapshot (RocksDB specific) 4. Verkle 2. Blockchain 3. Key-Value specific implementations under each.	1. Cryptography 1. Elliptic Curves 2. Signatures 3. Hashing 4. native implementations 2. Serialization 1. RLP 2. JSON 3. GraphQL 3. APIs 1. RPC 1. HTTP 2. Websockets 2. GraphQL 3. IPC 4. Inversion of Control 1. Dagger 2. Spring 5. Observability 1. Logging 2. Metrics 3. Debugging extras 6. Configuration 1. PicoCLI 2. Genesis state vs. named networks 7. Builds 1. Static code analysis 2. Use-case specific distributions 3. test automation 1. Unit 2. Integration 3. System 4. Fuzz

Goals

Pick relevant modules for abstraction against the goals outlined for Modular Besu. A good consideration is the rule of threes: we should only abstract a module when we have a need for it in three contexts.
Begin work on one abstraction.
1. Document approach.
2. Design implementation of interface or inversion of control.
3. Review software engineering practices with the working group.
4. Review code changes and implementation details with the working group.
Define success criteria for remaining modules.
Template design work from one slice and share learnings.
Determine remaining modules (what needs new abstraction, against our goals).
Create working group plan and discuss division of work.

Proof of Concept

Introduce Inversion of Control to implement a vertical slice of functionality. We need to find a feature that touches on a few cross-cutting concerns, but just one Bounded Context.

Nominees:

Transaction validation stack - touches each consensus mechanism, any L2 network, MEV, block building, and more. Good candidate.
MetricsSystem - Each time we need to expose a metric, we need to get this MetricsSystem from upper layers and transfer it from constructor to constructor.
Configuration management.
Jumpdest caching. Only relevant to the EVM, but requires caching, configuration, observability, and hashing. Plan would be to provide each of these as a dependency.
PoA consensus mechanisms. Impact TBD.
Transaction pool. Impact TBD.
Merge Context. What if the Merge Coordinator and related classes could be a dagger module?
Protocol Schedule. Impact TBD.

Once the question of "will dependency injection make for cleaner composition of modules into an application" is answered, we can then start to incrementally adopt it within each bounded context, and across the cross-cutting concerns.

Modular Consensus

Context

This describes a preliminary approach to modularizing the consensus mechanisms in Besu using the plug-in system and accompanying refactoring. The goals of this modularity are to remove friction in development of Besu and to ensure users can always take the latest updates, regardless of their consensus mechanism.

First Steps

Establish a working group. Review the Modularity Implementation Approach.

A possible approach to starting BFT modularity:

Possible approach to BFT modularity

Some suggested aims/ideals for BFT modularity. Not all of these may be possible, but starting with them in mind will help us identify when we have deviated from them, and make sure appropriate docs etc are written to help users migrate:

An existing Besu user can stop their Besu node, pull the new modular Besu image, and restart their Besu node without resyncing.
The user experience of running a Besu QBFT chain is unchanged.
A Besu QBFT node can be run in a single runtime.
BFT-related metrics continue to be produced without name changes.
Existing BFT-specific configuration is honoured (config or genesis file settings).
No additional ports are required to be opened to run a modular Besu QBFT network.
1. I.e. BFT validator peering logic continues to run over the standard P2P port.

Besu Software Component Map (DRAFT)

Note

This is a draft document attempting to explain the layout of the Besu project at the time of writing. Project structure has since evolved. We attempt to answer the question "why is this organized the way it is?" Software components to be considered include but are not limited to Gradle projects, GitHub repos and sub-modules, and Java packages.

Design Values

There are various trade-offs we make when deciding where a software component fits in our mental model. Those trade-off choices reflect our values in organizing software, and are useful when considering where to put something and how to name it. Having a variety of values also allows us flexibility in our choices, without being constrained by a need for consistency.

Monorepos: We value a single repo to house all submodules, since it is easy to search, and allows for greater reuse and inter-dependency. For instance, our tests may depend on a wide variety of components, depending on how high the testing scope goes. Not everything should be tested via unit tests, and for integration and system scoped testing, it is very likely to depend on many different components.
Single Binary: This is a packaging concern that allows all the various runtime options for Besu to be met by a single executable.

Gradle projects

At the time of writing, Besu was divided into many different gradle projects. Each of the bullet points below could likely be expanded a bit, and perhaps a README.md added next to each build.gradle file explaining it.

acceptance-tests - What scope of testing does this represent? Who/what is doing the accepting implied? Seems to provide a DSL to be used to write tests. Do these tests depend on the besu implementation specifically, or do they run over RPC and could be divorced from the besu implementation code?
metrics - rocksdb is an entire gradle project for only 1 class. This was likely done to isolate the metrics used to observe this db implementation. The core seems to be the metrics reporting scaffold, not besu specific, makes sense as its own module.
consensus - consensus seems a little more important than just a feature. Looks like it is only the PoA stuff. Each consensus mechanism has a means of creating a block, has some corresponding rpc methods, may have some unique network messaging, and means to validate other proposed blocks. The common module has a lot of bft related stuff, probably extended by ibft2 and qbft.
crypto - probably makes sense to package by function instead of by feature here, to help avoid rolling your own crypto. Much easier to audit and upgrade all in one place.
enclave - isolates the code needed to work with key management appliances used during PoA-based private enterprise nets. It's the code for communicating with private p2p enclaves such as Tessera and Orion. It's mainly used for private ethereum networks.
util
config - a single package containing configuration structs, mostly for PoA consensus.
plugins - only contains the rocksdb submodule. This is a pluggable key value store implementation.
nat - figuring out the client's IP address is non trivial in many of the environments we expect it to run.
container-tests - pretty small module, looks like it is to be used by CI/CD in order to test configuration and startup of besu containers.
plugin-api - toolkit for devs who want to build plugins for besu.
besu - this can be thought of as the application module. Contains things the user interacts with, such as the command line interface, export tools (for exporting blocks to the filesystem), import tools (for direct importation of blocks from the filesystem), controllers, and services (implementations of the plugin-api).
privacy-contracts - example contracts to be used on PoA networks.
testutil - same as util package, but for tests. Utility classes to be used when testing enterprise features like Enclave, Orion and Tessera.
services
ethereum - this is where ethereum protocols are implemented. All clients likely have implementations analogous to those found in this module. Are ethstats and evmtool specific to Besu, or are they implementing a spec that is common across clients? What is the scope of retesteth and referencetests?
error-prone - project specific rules to be enforced by the errorprone compiler.
pki - a layer of abstraction up from cryptography. This module handles dealing with keystores and certificates, likely used by PoA networks.

⚠️ This wiki is generated. Pages are published from besu-eth/wiki and edits made here are overwritten on the next publish. To change a page, open a pull request against the source repo instead. See Home for how.

Besu Wiki

Contributing

Development & Testing

Developing & Conventions

Project Process

Defect Prioritization Guide

Governance

Incident Reports

Performance & Stability

Design Documents

Programs & Mentorship

Uh oh!

Design Documents Modular Besu

Modular Besu

On this page

General context

Goals

Potential Benefits

General Concerns and Challenges, Possible Mitigations

Besu Minimum Useful Components

Potential First Steps

Questions

Modularity Implementation Approach

Context

Goals

Proof of Concept

Modular Consensus

Context

First Steps

Besu Software Component Map (DRAFT)

Design Values

Gradle projects

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Besu Wiki

Clone this wiki locally