Clear decoder buffer #166

n8maninger · 2024-06-06T15:03:48Z

Ensures the decoder's buffer is cleared when Read encounters an error. This should fix OOM issues we're seeing when syncing Mainnet. It may also fix SiaFoundation/hostd#406, but I haven't looked deep into that one yet.

n8maninger · 2024-06-06T15:25:44Z

@lukechampine I think there's another potential attack vector where a huge prefix for an array is sent that is technically smaller than d.lr.N. since we use make([]foo, d.ReadPrefix()) everywhere the decoder allocates the entire slice immediately which could expand significantly.

For example, when decoding a block sent by a peer, d.lr.N is 50MB. That would enable a slice like block.FileContracts to allocate up to 50M items. I propose changing ReadPrefix to ReadPrefix(max int64) to prevent that. An alternative is adding MaxSize() to DecoderFrom

types/encoding.go

lukechampine · 2024-06-07T13:06:20Z

For example, when decoding a block sent by a peer, d.lr.N is 50MB. That would enable a slice like block.FileContracts to allocate up to 50M items. I propose changing ReadPrefix to ReadPrefix(max int64) to prevent that. An alternative is adding MaxSize() to DecoderFrom

The old encoder took a different approach: https://github.com/NebulousLabs/Sia/blob/master/encoding/marshal.go#L341

// NextPrefix is like NextUint64, but performs sanity checks on the prefix.
// Specifically, if the prefix multiplied by elemSize exceeds MaxSliceSize,
// NextPrefix returns 0 and sets d.Err().
func (d *Decoder) NextPrefix(elemSize uintptr) uint64 {

But this kinda sucked because your callsites end up looking like:

addrs := make([]types.Address, d.NextPrefix(unsafe.Sizeof(types.Address{}))

i.e. you had to import unsafe all over the place (or hardcode a value). I would be open to a generic version that calls unsafe.Sizeof internally, so you could write:

addrs := d.ReadSlice[[]types.Address]()

...but Go doesn't support generic methods, so I guess you'd have to do types.ReadSlice[[]types.Address](d)?

Backing up, though: I did consider this issue when I implemented the decoder. My feeling at the time was that, although it does allow amplification attacks, the amplification is linear, and in practice the scaling factor is not enormous; the biggest is probably V2FileContract, at 368 bytes (if my math is correct). So yeah, if you're decoding a 50MB SendBlocks buffer, that can potentially blow up to a 18.4 GB allocation. Not great.

But is this actually a problem? When you call make([]byte, HUGE_NUMBER), the OS doesn't immediately reserve all that memory. To a first approximation, you only pay for what you use. Here's a Go Playground where I allocate an 18 GB slice and modify parts of it. You can see it runs instantly. If you change it to modify the entire slice, it errors out. But in the context of our decoder, this is totally fine: the attack can make us allocate a bunch of memory, but they can't make us write to that memory unless the encoded message is just as big. So in practice, we'll "allocate" 18 GB, page ~50 MB of it into RAM, and then the decoder will hit EOF and the slice will get garbage-collected.

n8maninger added 2 commits June 6, 2024 08:01

gateway: revert memory reuse

31a155d

types: clear decoder buffer when read encounters an error

9ec8e49

n8maninger requested a review from lukechampine June 6, 2024 15:03

n8maninger requested review from peterjan and ChrisSchinnerl June 6, 2024 15:41

consensus: lint suggestions

9d2a157

n8maninger commented Jun 6, 2024

View reviewed changes

types/encoding.go Outdated Show resolved Hide resolved

n8maninger commented Jun 7, 2024

View reviewed changes

types/encoding.go Show resolved Hide resolved

lukechampine reviewed Jun 7, 2024

View reviewed changes

types/encoding.go Outdated Show resolved Hide resolved

upgrade Go

223630e

n8maninger force-pushed the nate/clear-decoder-buffer branch from 15a039b to 223630e Compare June 7, 2024 15:53

n8maninger requested a review from lukechampine June 7, 2024 15:53

n8maninger added 2 commits June 7, 2024 09:05

ci: fix golangci-lint, fix linter errors

76c8780

ci: upgrade workflows

5c81d31

n8maninger merged commit c89664e into master Jun 7, 2024
8 checks passed

n8maninger deleted the nate/clear-decoder-buffer branch June 7, 2024 16:11

lukechampine mentioned this pull request Jun 11, 2024

types: Implement generic encoder functions #170

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clear decoder buffer #166

Clear decoder buffer #166

n8maninger commented Jun 6, 2024 •

edited

Loading

n8maninger commented Jun 6, 2024 •

edited

Loading

lukechampine commented Jun 7, 2024

Clear decoder buffer #166

Clear decoder buffer #166

Conversation

n8maninger commented Jun 6, 2024 • edited Loading

n8maninger commented Jun 6, 2024 • edited Loading

lukechampine commented Jun 7, 2024

n8maninger commented Jun 6, 2024 •

edited

Loading

n8maninger commented Jun 6, 2024 •

edited

Loading