feat(gossipsub): use `Bytes` to cut down on allocations #4751

joshuef · 2023-10-27T11:36:40Z

Description

Sets up gossipsub to use Bytes internally in place of Vec<u8> for message processing. This should help avoid potentially costly clones over the message as it is processed and published.

Notes & open questions

Change checklist

I have performed a self-review of my own code
I have made corresponding changes to the documentation (I dont think any are needed, it's all internal?)
I have added tests that prove my fix is effective or that my feature works (well.. current tests pass for me. Not sure more are needed?)
A changelog entry has been made in the appropriate crates

joshuef · 2023-10-27T11:40:53Z

I purposefully avoided making sig/key bytes at this point just to keep this slim, but I don't think there'd be any blockers there?

(Also: I debated an API change to just bytes, though I don't think the current change would cause a clone if you already pass Bytes, so maybe it's fine to leave that to the consumer?)

thomaseizinger

This makes sense to me. Couple of things:

If we just change internals we don't need a changelog entry.
You can't change generated code.
I am open to changing the API if we keep it as impl Into<Bytes>.

protocols/gossipsub/src/generated/gossipsub/pb.rs

joshuef · 2023-10-28T11:53:45Z

Thanks @thomaseizinger.

i've updated the PR to:

use Into<Bytes>
added a changelog for that as it's changing the API
removed the changes to the protobuf files and updated elsewhere for that 👍

This should help avoid potentially costly clones over as it is processed and published

thomaseizinger

Thanks! Two minor comments :)

protocols/gossipsub/src/behaviour.rs

protocols/gossipsub/CHANGELOG.md

joshuef · 2023-10-29T12:32:39Z

Okay, updated as per feedback 🙇 .

mxinden · 2023-10-29T13:18:42Z

Thank you for the work here.

Not sure how desirable this is for main, but we've been having quite a bit of mem coming from gossip. We're not transferring wildly large data in the msg either, a few kbs).

Would you mind sharing some numbers before and after? E.g. a heaptrack profile of a node running with and without this patch.

joshuef · 2023-10-29T13:40:01Z

Abbbbsolutely. Literally doing a bit of heaptrackery the now. 👍

Thank you guys for all the work. I'm happy to wee PR in where I can!

thomaseizinger

Great! A few more suggestions after looking at it closely now :)

protocols/gossipsub/CHANGELOG.md

examples/chat/src/main.rs

protocols/gossipsub/src/behaviour.rs

protocols/gossipsub/src/protocol.rs

protocols/gossipsub/src/transform.rs

protocols/gossipsub/src/types.rs

Co-authored-by: Thomas Eizinger <thomas@eizinger.io>

…related improvements

joshuef · 2023-10-30T08:56:38Z

Good catches. Updated 👍 🙇

joshuef · 2023-10-30T11:07:44Z

@mxinden

Okay, so here is main-no-libp2p-bytes-heaptrack.safenode.4817.zst.zip, looking at allocations there (specifically searching for gossip, there are a gooood amount.

This is roughly the same load in both instances (~1k nodes, same folders being uploaded and so the same quantity of gossip msgs being generated). I've searched through the heaptracks (we have 20 nodes across the network here heaptracking) for worst case examples.

And here with the 3 PRs applied together. libp2p-bytes-updates-heaptrack.safenode.6027.zst.zip

The latter case here actually has has marginally more data go through it as w/ main the network degrades to become unusable after not too long.

This is where I'm comparing allocations. Total mem used / leaked etc is about the same in both instances as you'll see. But I think all the extra allocations causes a fair amount of load that's crippling us here (as we're perhaps sending much more than was originally envisaged over gossip). But allocations wise, the Bytes code performs around 1/3 as many, it seems, and is performing as normal for us now 💪

protocols/gossipsub/CHANGELOG.md

thomaseizinger

Great work, thank you!

thomaseizinger · 2023-10-31T00:47:34Z

You could look into patching https://github.com/tafia/quick-protobuf to use Bytes which would likely allow for using even fewer allocations.

jxs · 2023-11-01T03:40:36Z

Hi, some Lighthouse nodes have also been OOM'ing with the following jemalloc memory dump

which is probably due to forward_msg cloning the Rpc message for each connection handler (in case of Lighthouse it's usually ~100 connections).
I created a messy (touches the generated protobufs code) hotfix which Arcs the message just to confirm it helps easing the memory usage. I can rework the fix to submit it as a proper PR which along this one should help cutting the allocations.

thomaseizinger · 2023-11-01T09:46:20Z

Thanks for weighing in here @jxs. @mxinden and I discussed this yesterday and we have some doubts about the optimisations presented in this PR. The issue is, the protobuf files contain owned values of data and thus, sooner or later, we will allocate for each message that is being sent.

This can only be fixed if we start to use borrowed data for the protobuf structs which would actually be great. I think the last time we tried this, it didn't work well with asynchronous-codec but the latest version now uses GATs to allow for encoding of borrowed data meaning this could be an option.

@joshuef With the above reasoning, I cannot explain why you are seeing performance improvement with this PR. Are you sure the two heaptrack snapshots show the same workload? They also show the ConnectionHandler::poll function as the source of the allocations but after looking at the code, I can't see where we would actually be allocating there?

joshuef · 2023-11-01T12:37:40Z

@thomaseizinger within this PR there are no clone sites, but outwith there are some RawMessage clones that otherwise will duplicate the entire data field there.

It also helps prevent / ease any handling of the messages for the consumer too.

(I agree though, if we got this into the lower layers that'd be even better)

thomaseizinger · 2023-11-02T00:23:37Z

@thomaseizinger within this PR there are no clone sites, but outwith there are some RawMessage clones that otherwise will duplicate the entire data field there.

But these sites don't show up in your heaptrack screenshots? Perhaps I am misreading it but that mostly shows the codec and the ConnectionHandler, not the behaviour.

That is what makes a bit doubtful that we are optimising the right thing here.

I am definitely on-board with optimising memory usage. I just want to understand and see the improvement :)

thomaseizinger · 2023-11-02T06:14:29Z

@joshuef Looking in more detail at the code and this screenshots, I am pretty confident I now know where these allocations are coming from. In the current implementation of quick-protobuf-codec, we allocate a Vec for each message, serialize into this vec and then pass it to varint codec which writes it into the BytesMut of the Framed buffer. From there, they get written into the stream.

Framed is designed to reuse a buffer between writes but we don't make use of this and instead allocate an completely new buffer temporarily for writing the message + varint.

I think I improved on this in greatly in #4782. Now, we write to the BytesMut directly which should be reused throughout the lifetime of a Framed. In gossipsub, this Framed is actually long-lived (because the stream is long-lived) and thus there should only be a single allocation for the entire stream during writing (modulo resizings if the current buffer is not big enough for the message).

Currently, these allocations happen in ConnectionHandler::poll because there we call Framed::start_send which internally delegates to the codec which would allocate the temporary Vec.

It would be great if you could test the above PR and check if number of allocations go down :)

joshuef · 2023-11-02T08:33:31Z

Looking in more detail at the code and this screenshots

Ah right, yeh I was just focussing on the gossip allocations there. You have the two heaptracks entirely in the same message there if you want to deep dive into the actual data.

For me it's about being able to safely handle/consume/clone these as a user, and also within libp2p, I think. As The RawMessage and so one are cloned about in the libp2p code, as well as Records (in the Kad pr) being similarly used in libp2p and cloned a fair bit in our own code.

It would be great if you could test the above PR and check if number of allocations go down :)

I will absolutely have at that now. I assume in isolation from these PRs?

I suspect it'll get allocations down, but we'll still want these changes or similar so clones of RawMessage or Record eg are not so allocation heavy?

thomaseizinger · 2023-11-02T09:44:08Z

For me it's about being able to safely handle/consume/clone these as a user, and also within libp2p, I think. As The RawMessage and so one are cloned about in the libp2p code, as well as Records (in the Kad pr) being similarly used in libp2p and cloned a fair bit in our own code.
[...]
I suspect it'll get allocations down, but we'll still want these changes or similar so clones of RawMessage or Record eg are not so allocation heavy?

We can definitely land these changes, I'd just like to see them having an impact first :)
I doubt we will see big improvements with the current changes because we are constructing the proto::RPC instances so early in the process.

Looking in more detail at the code and this screenshots

Ah right, yeh I was just focussing on the gossip allocations there. You have the two heaptracks entirely in the same message there if you want to deep dive into the actual data.

I'd love to but ironically, heaptrack segfaults when I try to load these files 🙃

joshuef · 2023-11-02T10:08:47Z

Also, I was just poking about quick-protobuf w/r/t Bytes, but I dont think it makes sense there. The generated code would require that the consuming app was having a Bytes dep, which I thiiiink is probably pretty poor UX and likely why they've gone Cow. I think a containing type to minimise the actual allocations until the last moment (and which I think you noted we can avoid w/ Cow::borrowed) makes sense 👍

thomaseizinger · 2023-11-02T21:13:57Z

Also, I was just poking about quick-protobuf w/r/t Bytes, but I dont think it makes sense there. The generated code would require that the consuming app was having a Bytes dep, which I thiiiink is probably pretty poor UX and likely why they've gone Cow. I think a containing type to minimise the actual allocations until the last moment (and which I think you noted we can avoid w/ Cow::borrowed) makes sense 👍

You are right! I came to the same conclusion after experimenting a bit now.

mergify · 2024-04-15T09:06:04Z

This pull request has merge conflicts. Could you please resolve them @joshuef? 🙏

thomaseizinger reviewed Oct 27, 2023

View reviewed changes

protocols/gossipsub/src/generated/gossipsub/pb.rs Outdated Show resolved Hide resolved

joshuef force-pushed the GossipBytes branch from 3935a2f to e1afc88 Compare October 28, 2023 11:52

fix(gossip): convert gossip data to Bytes.

25893b2

This should help avoid potentially costly clones over as it is processed and published

joshuef force-pushed the GossipBytes branch from e1afc88 to 25893b2 Compare October 28, 2023 12:09

joshuef mentioned this pull request Oct 28, 2023

feat(kad): use Bytes for Record::value #4753

Open

4 tasks

thomaseizinger reviewed Oct 28, 2023

View reviewed changes

protocols/gossipsub/src/behaviour.rs Outdated Show resolved Hide resolved

protocols/gossipsub/CHANGELOG.md Outdated Show resolved Hide resolved

joshuef added 2 commits October 29, 2023 13:31

chore(gossipsub): update changelog with PR ref

8eeba58

chore(gossipsub): remove double conversion to Bytes on publish

ae93015

thomaseizinger reviewed Oct 29, 2023

View reviewed changes

joshuef and others added 3 commits October 30, 2023 09:49

chore: update protocols/gossipsub/CHANGELOG.md

77d4092

Co-authored-by: Thomas Eizinger <thomas@eizinger.io>

test(gossip): improve Bytes arbitrary generation

9ce9b8c

Co-authored-by: Thomas Eizinger <thomas@eizinger.io>

chore(gossip): changelog update for complete API changes, other Byte …

cf5d24f

…related improvements

thomaseizinger changed the title ~~fix(gossip): convert gossip data to Bytes.~~ feat(gossipsub): use Bytes to cut down on allocations during message processing Oct 31, 2023

thomaseizinger reviewed Oct 31, 2023

View reviewed changes

protocols/gossipsub/CHANGELOG.md Outdated Show resolved Hide resolved

Update protocols/gossipsub/CHANGELOG.md

b9285a7

thomaseizinger approved these changes Oct 31, 2023

View reviewed changes

thomaseizinger added the send-it label Oct 31, 2023

thomaseizinger changed the title ~~feat(gossipsub): use Bytes to cut down on allocations during message processing~~ feat(gossipsub): use Bytes to cut down on allocations Oct 31, 2023

thomaseizinger removed the send-it label Oct 31, 2023

thomaseizinger mentioned this pull request Nov 2, 2023

Reducing allocations for transfer-heavy protocols by using borrowed data in protobuf files #4781

Open

thomaseizinger mentioned this pull request Nov 7, 2023

refactor(gossipsub): send more specific messages to ConnectionHandler #4811

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(gossipsub): use `Bytes` to cut down on allocations #4751

feat(gossipsub): use `Bytes` to cut down on allocations #4751

joshuef commented Oct 27, 2023 •

edited by thomaseizinger

Loading

joshuef commented Oct 27, 2023 •

edited

Loading

thomaseizinger left a comment

joshuef commented Oct 28, 2023

thomaseizinger left a comment

joshuef commented Oct 29, 2023

mxinden commented Oct 29, 2023

joshuef commented Oct 29, 2023

thomaseizinger left a comment

joshuef commented Oct 30, 2023

joshuef commented Oct 30, 2023 •

edited

Loading

thomaseizinger left a comment

thomaseizinger commented Oct 31, 2023

jxs commented Nov 1, 2023

thomaseizinger commented Nov 1, 2023

joshuef commented Nov 1, 2023 •

edited

Loading

thomaseizinger commented Nov 2, 2023

thomaseizinger commented Nov 2, 2023

joshuef commented Nov 2, 2023

thomaseizinger commented Nov 2, 2023

joshuef commented Nov 2, 2023 •

edited

Loading

thomaseizinger commented Nov 2, 2023

mergify bot commented Apr 15, 2024

feat(gossipsub): use Bytes to cut down on allocations #4751

Are you sure you want to change the base?

feat(gossipsub): use Bytes to cut down on allocations #4751

Conversation

joshuef commented Oct 27, 2023 • edited by thomaseizinger Loading

Description

Notes & open questions

Change checklist

joshuef commented Oct 27, 2023 • edited Loading

thomaseizinger left a comment

Choose a reason for hiding this comment

joshuef commented Oct 28, 2023

thomaseizinger left a comment

Choose a reason for hiding this comment

joshuef commented Oct 29, 2023

mxinden commented Oct 29, 2023

joshuef commented Oct 29, 2023

thomaseizinger left a comment

Choose a reason for hiding this comment

joshuef commented Oct 30, 2023

joshuef commented Oct 30, 2023 • edited Loading

thomaseizinger left a comment

Choose a reason for hiding this comment

thomaseizinger commented Oct 31, 2023

jxs commented Nov 1, 2023

thomaseizinger commented Nov 1, 2023

joshuef commented Nov 1, 2023 • edited Loading

thomaseizinger commented Nov 2, 2023

thomaseizinger commented Nov 2, 2023

joshuef commented Nov 2, 2023

thomaseizinger commented Nov 2, 2023

joshuef commented Nov 2, 2023 • edited Loading

thomaseizinger commented Nov 2, 2023

mergify bot commented Apr 15, 2024

feat(gossipsub): use `Bytes` to cut down on allocations #4751

feat(gossipsub): use `Bytes` to cut down on allocations #4751

joshuef commented Oct 27, 2023 •

edited by thomaseizinger

Loading

joshuef commented Oct 27, 2023 •

edited

Loading

joshuef commented Oct 30, 2023 •

edited

Loading

joshuef commented Nov 1, 2023 •

edited

Loading

joshuef commented Nov 2, 2023 •

edited

Loading