WIP Spec: Draft for top-level p2p spec #80

jmalicevic · 2023-01-05T12:25:09Z

--
This is the beginning of a high-level p2p spec addressing #20 for discussions. I started top down trying to understand what p2p is supposed to do rather than how p2p should do things. This collects findings from several discussion.

Co-authored-by: Lasaro Camargos <lasaro@informal.systems>

…int into josef/i9573-josef-p2p

spec/p2p/p2p-top-level.md

…ated things I will fix later

lasarojc · 2023-01-25T10:37:30Z

spec/p2p/p2p-mempool-reqs.md

+client of the blockchain but has not yet been committed to the blockchain.
+The mempool is thus fed with client transactions,
+that a priori can be submitted to any node in the network.
+And it is consumed by the consensus protocol, more specifically by validator nodes,


Consensus shouldn't have to know about transactions, requiring only that the mempool return a potential block payload. The fact that consensus knows transactions should be a detail of our implementation.

I'm not sure if I agree. Transactions are values for consensus. Blockchain consensus protocol do batching, a block is a batch of values.

I understand but I don't see the value in that as the application itself can do the batching, as the SDK does, specially with ABCI 1 and the application's ability to redo the whole block. But I guess a better point is that the mempool shouldn't have to know about who is consuming its transactions, only that they may be reaped and become invalid due to the transactions being executed or the app state changing.

spec/p2p/p2p-mempool-reqs.md

spec/p2p/p2p-top-level.md

Co-authored-by: Lasaro <lasaro@informal.systems>

… def)

spec/p2p/p2p-mempool-reqs.md

josef-widder · 2023-02-06T10:46:39Z

spec/p2p/reactor-survey.md

+
+#### [CM-REQ-MEMP-GOSSIP.0]
+
+1. every transaction submitted to a correct full node must eventually be included in the mempool of a correct validator


We should figure out whether we want the same guarantees for transactions submitted to any fullnode, of e.g. for MEV, you submit your transaction to a special full node (or a set of full nodes) and you get better guarantees. Best effort transactions vs. transactions with guarantees.
Might be a question for the mempool spec.

Daniel: Define stable nodes vs. transient nodes is important. We can give guarantees wrt stable nodes, while we allow transient nodes.

08d2

:|

08d2 · 2023-04-05T05:09:57Z

spec/p2p/reactor-survey.md

+on the mempool, e.g., every transaction that is submitted to a full node should 
+eventually be present at the mempool of a validator/proposer. 


No way to guarantee this property.

Thanks for looking at the document and giving feedback. Please note that this is a working document that should describe requirements one may expect from the reactors in order to formalize, in turn, what the reactors expect from the p2p system below.

By describing stronger properties (than those we actually need) here, and deriving properties for p2p from them, we can eventually specify a p2p system that may do a bit more than is needed and thus suffices for what the reactors need in reality. We had noted that on line 104, which says

Note that in this document, we want to capture (an overapproximation of) the requirements the mempool puts on p2p.

We will try to make this clearer in a revision to avoid future confusion.

That's all fine, I think. But it is not the case that "every transaction that is submitted to a full node should eventually be present at the mempool of a validator/proposer".

08d2 · 2023-04-05T05:10:18Z

spec/p2p/reactor-survey.md

+into requirements on the p2p layer that are similar to those of the
+consensus reactor.
+
+- end-to-end requirement: "every transaction submitted should be eventually put into a block"


Not possible to guarantee.

See this comment.

08d2 · 2023-04-05T05:15:48Z

spec/p2p/reactor-survey.md

+- current solution to address this requirement: 
+    - the mempool of correct full nodes is organized as a list. 
+    This list can be understood as a shared data structure with relaxed semantics, that is,
+    not all nodes need to agree on the order of the elements, but we have a property that 
+    approximately says "if at time *t* a transaction tx1 is in the list of **all** nodes, and a 
+    client submits transaction tx2 after time t, then it is never the case the tx2 is before
+    tx1 in a list of a correct validator.


This property is not provided by the current spec or implementation. It is not possible to guarantee in a weakly-connected, eventually-consistent, gossip-based network as is defined by the p2p layer.

Not sure I understand the comment. Are you worried about the "if" part or the "then" part here? My understanding is that the property is actually quite week as the "if" part might be hard to be satisfied, so this condition holds quite easily.

I think this may boil down to the definition of a "correct" node. If "correct" means that a node will reliably receive any message gossiped from some other "correct" node within some time bound, then "correct" is relative to an observer, it has no well-defined value in general, and it can't be used to assert stuff like what is asserted here.

If you want to define "correct" in this way, and use that definition to say things like "it is never the case that tx2 is before tx1 among correct nodes", then you're defining a node that suffers any kind of network error to be incorrect, which is unknowable by the protocol as evaluated at any given node.

It is entirely possible that a transaction arrives in the mempool of only (say) 70% of the validators in a network, and if the proposer is one of those 70% of validators that transaction gets put into a proposed block, and that block is validated and committed, all without that transaction ever entering the mempool of (say) 30% of the validators of that network. Those 30% validators can fail to receive the gossip messages that would put the transaction in their mempool, while at the same time successfully receiving (and approving) the consensus messages required to commit a block which included that transaction to the chain.

So "if at time t1 a transaction tx1 is in the list of all nodes" is never possible to assume, because there is no situation where any given transaction is guaranteed to be known by all validators, unless it is put through consensus (and even then it is only guaranteed to be known to the approving quorum); "and a client submits transaction tx2 after time t1" is not something that can even be evaluated, as times t1 and t2 are not reliably comparable, and even if they were that comparison can only be made by nodes which know about both tx1 and tx2, which are not guaranteed to be a quorum (or even a majority) of nodes at any time; and finally if "it is never the case that tx2 is before tx1 in a list of a correct validator" defines order as time order, which is not necessarily true (app side mempools, and alternative prepare proposal hooks, can re-order mempool tx order by any criteria) even if time is reliable (which it isn't, afaiu).

08d2 · 2023-04-05T05:18:16Z

spec/p2p/reactor-survey.md

+
+#### [CM-REQ-MEMP-GOSSIP.0]
+
+1. every transaction submitted to a correct full node must eventually be included in the mempool of a correct validator


It is definitely not true that every transaction submitted to a correct full node will, or can, be included in the mempool of any other arbitrary "correct" validator.

It is important to point out that gossipped information has no delivery guarantees whatsoever. No message that goes through the p2p layer is, by itself, reliably delivered to any specific destination. Gossip does not guarantee that a message is reliably delivered to any/every node in any time frame. Only the information which goes through consensus can be assumed as true.

See this comment.

Let me add the we need to capture what mempool is doing and what it should do. Just saying there are "no guarantees" in the extreme argument leads to the question why there is a mempool after all? My current understanding is that the design of the gossiping in mempool is trying to achieve a very strong end-to-end semantics, although its actual guarantees depend on the connectivity provided by the p2p, which, in practice seems to be quite strong actually, but I agree with you the connectivity cannot be guaranteed all the time in all cases.

Gossip provides no guarantees about delivery that can be assumed by the protocol at any specific node or time. It guarantees eventual delivery, over an unbounded time window, yes, but that doesn't allow the protocol as executed at any specific node to assert specific assumptions like a GST, or that the mempool will definitely contain any specific set of elements. Gossipping message M1 from node N1 at time T1 does not guarantee that M1 will be received by node N2 at time T2 within any time limit. T2 could be T1 plus 1s, or T1 plus 1h, or T1 plus 1d, or T1 plus 1w.

Let me add the we need to capture what mempool is doing and what it should do. Just saying there are "no guarantees" in the extreme argument leads to the question why there is a mempool after all?

"The mempool" is not some thing that has any specific definition at a network level, it is a set of transactions in each full node which is entirely independent between full nodes, at least from the perspective of the protocol.

Gossiping a message to one node does not guarantee that the message will exist in the mempool of any other node by any specific time bounds. It is likely that the mempools of validators, and probably a lot of full nodes, have transaction sets with a lot of overlap. But you can't assume any specific properties of those transaction sets when making decisions at the protocol level.

I understand that it would be very useful if you could do that! But that desire doesn't change reality 🙃 There are no strong end-to-end semantics in the mempool. Stochastically, at a system level, maybe yes, but those properties can't be treated as invariants by protocol code that runs on individual nodes.

08d2 · 2023-04-05T05:18:35Z

spec/p2p/reactor-survey.md

+#### [CM-REQ-MEMP-GOSSIP.0]
+
+1. every transaction submitted to a correct full node must eventually be included in the mempool of a correct validator
+2. if a transaction is included in the mempool of a correct validator eventually it is included in the mempool of all correct validators 


This is also definitely not true.

See this comment

08d2 · 2023-04-05T05:19:52Z

spec/p2p/reactor-survey.md

+1. An evidence transaction reaches all validator nodes as "fast as possible".
+1. Evidence is not written into a block after it has expired
+
+> With more control over the four points, in principle we might define a real-time property which formalizes "if evidence is submitted at least a day before it expires, then evidence is written into a block before it expires". 


Nothing guarantees that evidence submitted at some wall-clock time X will be written into a block before some later wall-clock time Y.

I agree. The statement also doesn't contradict this.

Does the statement not assert that evidence submitted at wall-clock time X (where X is at least a day before the expiration time of that evidence) will be written into a block before some later wall-clock time Y (where Y is the expiration time of the evidence)"?

josef-widder · 2023-04-26T13:18:40Z

I close as the target location of the text is unclear. I have moved relevant parts into our new knowledge base.

…… (backport cometbft#75) (cometbft#80) * perf: Micro optimization to save one allocation per packet (backport cometbft#3018) (cometbft#3023) This PR is a slight optimization to save one allocation per packet. We have much more worthwhile performance improvements to pursue, just driveby noticed it as I was reading through the code. (Though I am surprised this did add up to 1 second in total -- 4% of the processing time) This re-uses the byte reader's allocation across all ReadMsg's. There is no concurrenct access possible under safe usage (also implied by the reader) This is the cause of the 1s time on the far right: ![image](https://github.com/cometbft/cometbft/assets/6440154/d6c6bfaa-d287-4355-b094-9bdcbc6379c8) --- #### PR checklist - [x] Tests written/updated - covered by existing tests - [x] Changelog entry added in `.changelog` (we use [unclog](https://github.com/informalsystems/unclog) to manage our changelog) - [x] Updated relevant documentation (`docs/` or `spec/`) and code comments - [x] Title follows the [Conventional Commits](https://www.conventionalcommits.org/en/v1.0.0/) spec <hr>This is an automatic backport of pull request cometbft#3018 done by [Mergify](https://mergify.com). --------- Co-authored-by: Dev Ojha <ValarDragon@users.noreply.github.com> Co-authored-by: Anton Kaliaev <anton.kalyaev@gmail.com> (cherry picked from commit ad8f851) * changelog (cherry picked from commit dd19cb0) # Conflicts: # CHANGELOG.md --------- Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com> Co-authored-by: Dev Ojha <dojha@berkeley.edu> Co-authored-by: PaddyMc <paddymchale@hotmail.com>

josef-widder and others added 16 commits November 17, 2022 11:48

start of a top-level English spec for the p2p layer

a2c7ee9

start of intro paragraphs

eeacdae

intro done?

8158a88

rename

2397f81

to be shared

8ec77d4

some requirements discussion

0461d43

things learned from the SRDS 2021 paper.

6c04c1d

minor changes

128b617

Update spec/p2p/p2p-top-level.md

a0d0856

Co-authored-by: Lasaro Camargos <lasaro@informal.systems>

incorporated comments from the PR discussion

7ef04ee

Update spec/p2p/p2p-top-level.md

46c857c

Co-authored-by: Lasaro Camargos <lasaro@informal.systems>

added (unstructured) nodes after office hour with Bucky

84b521a

Merge branch 'josef/i9573-josef-p2p' of github.com:tendermint/tenderm…

0fbeaaf

…int into josef/i9573-josef-p2p

consensus reactor

e3fd2cf

requirements

2d85387

req

3bc10d1

jmalicevic added p2p spec Specification-related wip Work in progress labels Jan 5, 2023

jmalicevic requested review from a team as code owners January 5, 2023 12:25

jmalicevic marked this pull request as draft January 5, 2023 12:25

jmalicevic assigned josef-widder Jan 9, 2023

cason and others added 6 commits January 16, 2023 11:27

spec/p2p: mempool requirements, intro to mempool

22c99c2

spec/p2p: mempool internal interactions

10e6489

spec/p2p: mempool first formalization attempt

73b2fe2

first draft of mempool section

ee1d351

discussion on evidence reactor

d24f2a8

evidence consensus parameters

5178701

cason reviewed Jan 23, 2023

View reviewed changes

addressed two simple comments by Daniel on the Draft PR. More complic…

6923593

…ated things I will fix later

lasarojc reviewed Jan 25, 2023

View reviewed changes

cason and others added 3 commits January 26, 2023 08:15

Apply suggestions from code review

f0454df

Co-authored-by: Lasaro <lasaro@informal.systems>

Update spec/p2p/p2p-top-level.md

713c66e

Co-authored-by: Lasaro <lasaro@informal.systems>

minor changes following PR discussion (without changes to add pending…

a61f85f

… def)

josef-widder reviewed Jan 26, 2023

View reviewed changes

spec/p2p/p2p-mempool-reqs.md Show resolved Hide resolved

josef-widder and others added 2 commits January 26, 2023 16:42

Update spec/p2p/p2p-mempool-reqs.md

268ff6d

rename to CometBFT

61e8a9a

josef-widder mentioned this pull request Jan 27, 2023

Mempool transaction handling properties #223

Open

josef-widder added 8 commits January 27, 2023 11:13

pending discussion

e710f4d

requirements with tags

3be36d0

sync protocols

3f3e8f7

fixed TODOs in the beginning

d19f085

moved reactor discussion out of p2p spec

73a4ddb

clean up expectations. put at right spot for now

0e4e76a

final before new PR

b0fc580

pass over reactor survey

f2f73d2

josef-widder reviewed Feb 6, 2023

View reviewed changes

This was referenced Apr 3, 2023

spec/p2p: Document reactors and the Reactor interface #599

Closed

spec/p2p: Define a structure for the P2P specification #598

Closed

08d2 reviewed Apr 5, 2023

View reviewed changes

josef-widder closed this Apr 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP Spec: Draft for top-level p2p spec #80

WIP Spec: Draft for top-level p2p spec #80

jmalicevic commented Jan 5, 2023

lasarojc Jan 25, 2023

cason Jan 26, 2023

lasarojc Jan 26, 2023 •

edited

Loading

josef-widder Feb 6, 2023

josef-widder Feb 6, 2023

08d2 left a comment

08d2 Apr 5, 2023

josef-widder Apr 5, 2023

08d2 Apr 7, 2023

08d2 Apr 5, 2023

josef-widder Apr 5, 2023

08d2 Apr 5, 2023

josef-widder Apr 5, 2023

08d2 Apr 7, 2023

08d2 Apr 5, 2023

08d2 Apr 5, 2023 •

edited

Loading

josef-widder Apr 5, 2023

08d2 Apr 7, 2023 •

edited

Loading

08d2 Apr 5, 2023

josef-widder Apr 5, 2023

08d2 Apr 5, 2023

josef-widder Apr 5, 2023

08d2 Apr 7, 2023

josef-widder commented Apr 26, 2023


		#### [CM-REQ-MEMP-GOSSIP.0]

		1. every transaction submitted to a correct full node must eventually be included in the mempool of a correct validator

		on the mempool, e.g., every transaction that is submitted to a full node should
		eventually be present at the mempool of a validator/proposer.

WIP Spec: Draft for top-level p2p spec #80

WIP Spec: Draft for top-level p2p spec #80

Conversation

jmalicevic commented Jan 5, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lasarojc Jan 26, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

08d2 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

08d2 Apr 5, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

08d2 Apr 7, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

josef-widder commented Apr 26, 2023

lasarojc Jan 26, 2023 •

edited

Loading

08d2 Apr 5, 2023 •

edited

Loading

08d2 Apr 7, 2023 •

edited

Loading