RPC chunks with streaming SSZ decoding, snappy frames, and stricter DOS limits where possible. #1606

protolambda · 2020-02-04T13:19:02Z

This is a proposal to focus on length-encoding SSZ contents, enable streaming of chunk contents, and put stricter DOS limits in place.

ssz without compression is still exactly the same, testnets can live on.

So far I am aware that Prysm has non-streaming Snappy compression in place as option, and Lighthouse has a PR open that made a start on snappy support, but blocked by questions/ongoing network changes by Adrian. In other clients, such as Lodestar, Artemis and Nimbus, I could find the Snappy dependency, but no implementation of ssz_snappy for RPC.

The motivation here is that it is relatively quick to calculate the encoded length of an SSZ value, since fixed-length values are easily computed from the type, and list values often from multiplying a fixed-length element size with a list length.

Given a SSZ length, a SSZ decoder can directly read contents from the stream, avoiding the need for a temporary buffer. Additionally, snappy-frames avoid the need for more than a single frame of buffered bytes to decompress data (worst case 2**16=65536 bytes), avoiding the need to buffer the compressed bytes fully.
The writer just calculates the SSZ length, and can then stream-encode the contents, even if using compression. Instead of having to fully buffer the compressed data to get the compressed length.

Additionally, this puts better DOS protection in place, as we can calculate the maximum size of a SSZ object, and should use it to be stricter about inputs. And given the worst-case snappy encoded length, we can account for compressed bytes even if we don't know the exact number of compressed bytes in advance. We don't need to know the exact number, as we can just verify the decompressed bytes instead, while checking the bytes read from the stream against the limits we should be checking regardless.

…f chunk contents, and put stricter DOS limits in place

protolambda · 2020-02-04T13:21:52Z

@arnetheduck @AgeManning @nisdas Please have a look at this proposal. The PR description may not be that clear, the updated networking doc explains it best.

nisdas · 2020-02-04T15:06:10Z

specs/phase0/p2p-interface.md

+A reader:
+- SHOULD not read more than `max_encoded_len(n)` bytes (`32 + n + n/6`) after reading the SSZ length prefix `n` from the header, [this is considered the worst-case compression result by Snappy](https://github.com/google/snappy/blob/537f4ad6240e586970fe554614542e9717df7902/snappy.cc#L98).
+- SHOULD not accept a SSZ length prefix that is bigger than the expected maximum length for the SSZ type (derived from SSZ type information such as vector lengths and list limits).
+- MUST consider remaining bytes, after having read the `n` SSZ bytes, as an invalid input. An EOF is expected.


This should be clarified, if we read more than n bytes, we should return an error on the request/response and ignore the data.

@protolambda can you clean this one up too?

Yes, looking into this now.

nisdas · 2020-02-04T15:20:23Z

specs/phase0/p2p-interface.md

+Snappy has two formats: "block" and "frames" (streaming). To support large requests and response chunks, snappy-framing is used.
+
+Since snappy frame contents [have a maximum size of `65536` bytes](https://github.com/google/snappy/blob/master/framing_format.txt#L104)
+ and frame headers are just `identifier (1) + checksum (4)` bytes, the expected buffering of a single frame is acceptable.


So what I understand is now we have two length prefixes encoded into our data ?
one is for snappy frames in the frame header and the other for our ssz objects in their decompressed form ?

Possibly, 1 Snappy frame doesn't necessarily match 1 eth2 rpc chunk. With status quo it does match one snappy block, but it also can't stream within a chunk.

AgeManning · 2020-02-06T13:03:47Z

I think it's reasonable to use the framed format of snappy over the wire. It may not be super important for us now given current objects but could be more useful for larger objects in the future.

In principle, we could have a snappy_framed and snappy_block protocols and clients could take preference over either, but i'm happy with this current proposal.

djrtwo · 2020-02-15T20:30:06Z

I am pro frames.
@protolambda is there anything we need to do here to get this merged?

specs/phase0/p2p-interface.md

Co-Authored-By: Hsiao-Wei Wang <hwwang156@gmail.com>

protolambda · 2020-02-27T19:42:36Z

Clarified the invalid input handling, and cleaned up to sum the different cases better. Can I get a final review?

djrtwo

looks good to me

Proposal to focus on length-encoding SSZ contents, enable streaming o…

93249aa

…f chunk contents, and put stricter DOS limits in place

protolambda added the scope:networking label Feb 4, 2020

nisdas reviewed Feb 4, 2020

View reviewed changes

hwwhww reviewed Feb 18, 2020

View reviewed changes

specs/phase0/p2p-interface.md Outdated Show resolved Hide resolved

specs/phase0/p2p-interface.md Outdated Show resolved Hide resolved

specs/phase0/p2p-interface.md Outdated Show resolved Hide resolved

pawanjay176 mentioned this pull request Feb 19, 2020

Add snappy compression support sigp/lighthouse#866

Merged

djrtwo and others added 2 commits February 27, 2020 12:00

@hwwhww feedback

4d72dcf

Co-Authored-By: Hsiao-Wei Wang <hwwang156@gmail.com>

clean up, add invalid input handling

bb82a05

djrtwo mentioned this pull request Mar 9, 2020

Release v0.11.0 into master #1645

Merged

djrtwo approved these changes Mar 9, 2020

View reviewed changes

djrtwo merged commit 6230a22 into dev Mar 9, 2020

djrtwo deleted the rpc-snappy-lengths branch March 9, 2020 17:04

terencechain mentioned this pull request Mar 22, 2020

Align to spec v0.11 prysmaticlabs/prysm#5119

Closed

16 tasks

nisdas mentioned this pull request Mar 23, 2020

Add Snappy Framing to the Encoder prysmaticlabs/prysm#5172

Merged

2 tasks

ColinSchwarz mentioned this pull request Mar 24, 2020

Use snappy frames to support streaming ChainSafe/lodestar#749

Closed

protolambda mentioned this pull request Jun 25, 2020

Snappy frames delimiter wanted #1819

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RPC chunks with streaming SSZ decoding, snappy frames, and stricter DOS limits where possible. #1606

RPC chunks with streaming SSZ decoding, snappy frames, and stricter DOS limits where possible. #1606

protolambda commented Feb 4, 2020

protolambda commented Feb 4, 2020

nisdas Feb 4, 2020

djrtwo Feb 27, 2020

protolambda Feb 27, 2020

nisdas Feb 4, 2020

protolambda Feb 4, 2020

AgeManning commented Feb 6, 2020

djrtwo commented Feb 15, 2020

protolambda commented Feb 27, 2020

djrtwo left a comment

RPC chunks with streaming SSZ decoding, snappy frames, and stricter DOS limits where possible. #1606

RPC chunks with streaming SSZ decoding, snappy frames, and stricter DOS limits where possible. #1606

Conversation

protolambda commented Feb 4, 2020

protolambda commented Feb 4, 2020

nisdas Feb 4, 2020

Choose a reason for hiding this comment

djrtwo Feb 27, 2020

Choose a reason for hiding this comment

protolambda Feb 27, 2020

Choose a reason for hiding this comment

nisdas Feb 4, 2020

Choose a reason for hiding this comment

protolambda Feb 4, 2020

Choose a reason for hiding this comment

AgeManning commented Feb 6, 2020

djrtwo commented Feb 15, 2020

protolambda commented Feb 27, 2020

djrtwo left a comment

Choose a reason for hiding this comment