kafka/protocol: use frag vec for offset fetch #15918

rockwotj · 2024-01-02T19:37:22Z

We've seen oversized allocs with this field when group fetches are used.
This switches to a frag vec for partitions.

Backports Required

Release Notes

Bug Fixes

Prevent oversized allocs when group fetching from many partitions.

We've seen oversized allocations in many partition test with group fetches, switch to using frag vec to prevent that. Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

oleiman · 2024-01-02T19:55:10Z

src/v/kafka/protocol/offset_fetch.h

@@ -62,7 +63,8 @@ struct offset_fetch_response final {
        data.error_code = error_code::none;
        if (topics) {
            for (auto& topic : *topics) {
-                std::vector<offset_fetch_response_partition> partitions;
+                small_fragment_vector<offset_fetch_response_partition>


question: I'm probably missing some context here, but do we have a sense of what the "usual" size of this vector might be? As in, whether the small_fragment_vector fragment size (1KiB I believe) is the best/safest choice.

Yeah large_fragment_vector is the other option, which is close to 128KiB and probably fine? I'm not really sure to be honest. I'll switch

fetchable_partition_response is also small? So I guess it's expected to stay small because it's only data from this shard I believe?

ya, i don't really know. I assume you could use fragmented_vector with any arbitrary fragment size if you were so inclined (rather than one of the convenience typedefs), but it's not clear to me whether there's any special reason to do so.

Certainly the 1KiB fragment size should prevent bad_alloc in all cases, so I don't want to hold this up. Just curious mostly.

oleiman

lgtm

vbotbuildovich · 2024-01-03T14:33:29Z

/backport v23.3.x

vbotbuildovich · 2024-01-03T14:33:30Z

/backport v23.2.x

vbotbuildovich · 2024-01-03T14:33:31Z

/backport v23.1.x

vbotbuildovich · 2024-01-03T14:34:24Z

Failed to create a backport PR to v23.1.x branch. I tried:

git remote add upstream https://github.com/redpanda-data/redpanda.git
git fetch --all
git checkout -b backport-pr-15918-v23.1.x-847 remotes/upstream/v23.1.x
git cherry-pick -x 7e9111efe99120b6bcad95f8e4eafefa3bba737f

Workflow run logs.

kafka/protocol: frag vec for group fetch partitions

7e9111e

We've seen oversized allocations in many partition test with group fetches, switch to using frag vec to prevent that. Signed-off-by: Tyler Rockwood <rockwood@redpanda.com>

github-actions bot added the area/redpanda label Jan 2, 2024

rockwotj requested review from BenPope, oleiman, graphcareful and michael-redpanda January 2, 2024 19:38

rockwotj self-assigned this Jan 2, 2024

rockwotj linked an issue Jan 2, 2024 that may be closed by this pull request

Oversized allocation: 327680 bytes in kafka::group::handle_offset_fetch #15909

Closed

oleiman reviewed Jan 2, 2024

View reviewed changes

oleiman approved these changes Jan 2, 2024

View reviewed changes

rockwotj merged commit 10fca00 into redpanda-data:dev Jan 3, 2024
22 checks passed

rockwotj deleted the group-fetch-large-alloc branch January 3, 2024 14:33

vbotbuildovich mentioned this pull request Jan 3, 2024

[v23.2.x] Oversized allocation: 327680 bytes in kafka::group::handle_offset_fetch #15925

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kafka/protocol: use frag vec for offset fetch #15918

kafka/protocol: use frag vec for offset fetch #15918

rockwotj commented Jan 2, 2024 •

edited

oleiman Jan 2, 2024

rockwotj Jan 2, 2024

rockwotj Jan 2, 2024

oleiman Jan 2, 2024

oleiman left a comment

vbotbuildovich commented Jan 3, 2024

vbotbuildovich commented Jan 3, 2024

vbotbuildovich commented Jan 3, 2024

vbotbuildovich commented Jan 3, 2024

kafka/protocol: use frag vec for offset fetch #15918

kafka/protocol: use frag vec for offset fetch #15918

Conversation

rockwotj commented Jan 2, 2024 • edited

Backports Required

Release Notes

Bug Fixes

oleiman Jan 2, 2024

Choose a reason for hiding this comment

rockwotj Jan 2, 2024

Choose a reason for hiding this comment

rockwotj Jan 2, 2024

Choose a reason for hiding this comment

oleiman Jan 2, 2024

Choose a reason for hiding this comment

oleiman left a comment

Choose a reason for hiding this comment

vbotbuildovich commented Jan 3, 2024

vbotbuildovich commented Jan 3, 2024

vbotbuildovich commented Jan 3, 2024

vbotbuildovich commented Jan 3, 2024

rockwotj commented Jan 2, 2024 •

edited