Fixed large allocation in `kafka::wait_for_leaders` #16287

mmaslankaprv · 2024-01-25T10:17:38Z

Previously we used a simple std::vector of futures to make
waiting for partition leaders concurrent. Using a vector has a drawback
when dealing with large number of topics and partitions since it may be
required to allocate large contiguous chunk of memory for a future
vector. In this particular case we may not use a fragmented vector or
chunked fifo as the when_all uses a plain vector internally.

To make sure no large chunk of memory is allocated to wait for the
partition leaders changed the logic to use
seastar::max_concurrent_for_each.

Fixes: #15908, #16270
Fixes: #16036

Backports Required

Release Notes

none

mmaslankaprv · 2024-01-25T13:00:16Z

/dt

mmaslankaprv · 2024-01-26T08:10:32Z

/ci-repeat 10
skip-units
dt-repeat=100
tests/rptest/tests/topic_creation_test.py

mmaslankaprv · 2024-01-26T13:54:48Z

/ci-repeat 10
skip-units
dt-repeat=20
tests/rptest/tests/topic_creation_test.py

mmaslankaprv · 2024-01-29T09:36:45Z

/ci-repeat 10
skip-units
dt-repeat=20
tests/rptest/tests/topic_creation_test.py

mmaslankaprv · 2024-01-29T17:03:27Z

/ci-repeat 10
skip-units
dt-repeat=20
tests/rptest/tests/topic_creation_test.py

vbotbuildovich · 2024-01-29T19:40:35Z

ducktape was retried in https://buildkite.com/redpanda/redpanda/builds/44441#018d5671-7d76-49d7-a337-d78544fb0815

ducktape was retried in https://buildkite.com/redpanda/redpanda/builds/44441#018d5671-7d75-45c1-983f-293bda48ad87

ducktape was retried in https://buildkite.com/redpanda/redpanda/builds/44441#018d5671-7d77-4000-99ae-f3f7f04f915f

ducktape was retried in https://buildkite.com/redpanda/redpanda/builds/44441#018d5671-7d7b-496a-bbdb-3eaa08be2e48

ducktape was retried in https://buildkite.com/redpanda/redpanda/builds/44441#018d5671-7d7a-447e-a2d4-d765dc5ac918

ducktape was retried in https://buildkite.com/redpanda/redpanda/builds/44441#018d5671-7d7b-4c3f-b1a8-5126d06818d0

ducktape was retried in https://buildkite.com/redpanda/redpanda/builds/44441#018d5671-7d7c-40cf-8ab0-812dbfb4ac7c

mmaslankaprv · 2024-01-30T06:24:48Z

/dt

src/v/kafka/server/handlers/topics/topic_utils.cc

Previously we used a simple `std::vector` of futures to make waiting for partition leaders concurrent. Using a vector has a drawback when dealing with large number of topics and partitions since it may be required to allocate large contiguous chunk of memory for a future vector. In this particular case we may not use a fragmented vector or chunked fifo as the `when_all` uses a plain vector internally. To make sure no large chunk of memory is allocated to wait for the partition leaders changed the logic to use `seastar::max_concurrent_for_each`. Fixes: redpanda-data#15908 Signed-off-by: Michal Maslanka <michal@redpanda.com>

Sometimes it may happen that producer swarm is stopped after topic is recreated leading to a test failure. Added check restarting the producer if necessary Signed-off-by: Michal Maslanka <michal@redpanda.com>

vbotbuildovich · 2024-02-01T07:18:13Z

/backport v23.3.x

vbotbuildovich · 2024-02-01T07:18:14Z

/backport v23.2.x

vbotbuildovich · 2024-02-01T07:19:07Z

Oops! Something went wrong.

Workflow run logs.

vbotbuildovich · 2024-02-01T07:19:07Z

Oops! Something went wrong.

Workflow run logs.

gousteris · 2024-02-01T16:58:08Z

/backport v23.3.x

gousteris · 2024-02-01T16:58:12Z

/backport v23.2.x

github-actions bot added the area/redpanda label Jan 25, 2024

mmaslankaprv force-pushed the wait-for-leaders-large-alloc branch from 20666d7 to 204e1f5 Compare January 29, 2024 17:03

redpanda-data deleted a comment from vbotbuildovich Jan 29, 2024

mmaslankaprv force-pushed the wait-for-leaders-large-alloc branch from 204e1f5 to ea86488 Compare January 30, 2024 06:54

mmaslankaprv changed the title ~~wip~~ Fixed large allocation in kafka::wait_for_leaders Jan 30, 2024

mmaslankaprv marked this pull request as ready for review January 30, 2024 06:55

mmaslankaprv requested review from bharathv and ztlpn January 30, 2024 16:41

rockwotj reviewed Jan 30, 2024

View reviewed changes

src/v/kafka/server/handlers/topics/topic_utils.cc Outdated Show resolved Hide resolved

mmaslankaprv force-pushed the wait-for-leaders-large-alloc branch from ea86488 to 6a75ebc Compare January 31, 2024 07:04

mmaslankaprv requested a review from rockwotj January 31, 2024 09:08

ztlpn previously approved these changes Jan 31, 2024

View reviewed changes

src/v/kafka/server/handlers/topics/topic_utils.cc Outdated Show resolved Hide resolved

rockwotj previously approved these changes Jan 31, 2024

View reviewed changes

mmaslankaprv added 2 commits January 31, 2024 19:02

tests: restart producer swarm if needed

a18fdcd

Sometimes it may happen that producer swarm is stopped after topic is recreated leading to a test failure. Added check restarting the producer if necessary Signed-off-by: Michal Maslanka <michal@redpanda.com>

mmaslankaprv requested review from ztlpn and rockwotj January 31, 2024 18:02

mmaslankaprv dismissed stale reviews from rockwotj and ztlpn via a18fdcd January 31, 2024 18:02

mmaslankaprv force-pushed the wait-for-leaders-large-alloc branch from 6a75ebc to a18fdcd Compare January 31, 2024 18:02

ztlpn approved these changes Jan 31, 2024

View reviewed changes

rockwotj approved these changes Jan 31, 2024

View reviewed changes

mmaslankaprv merged commit ba04490 into redpanda-data:dev Feb 1, 2024
18 checks passed

mmaslankaprv deleted the wait-for-leaders-large-alloc branch February 1, 2024 07:18

This was referenced Feb 1, 2024

[v23.2.x] Fixed large allocation in kafka::wait_for_leaders #16424

Closed

[v23.3.x] Fixed large allocation in kafka::wait_for_leaders #16425

Closed

mmaslankaprv mentioned this pull request Feb 6, 2024

[v23.2.x] Fixed large allocation in kafka::wait_for_leaders #16493

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed large allocation in `kafka::wait_for_leaders` #16287

Fixed large allocation in `kafka::wait_for_leaders` #16287

mmaslankaprv commented Jan 25, 2024 •

edited

mmaslankaprv commented Jan 25, 2024

mmaslankaprv commented Jan 26, 2024

mmaslankaprv commented Jan 26, 2024

mmaslankaprv commented Jan 29, 2024

mmaslankaprv commented Jan 29, 2024

vbotbuildovich commented Jan 29, 2024 •

edited

mmaslankaprv commented Jan 30, 2024

vbotbuildovich commented Feb 1, 2024

vbotbuildovich commented Feb 1, 2024

vbotbuildovich commented Feb 1, 2024

vbotbuildovich commented Feb 1, 2024

gousteris commented Feb 1, 2024

gousteris commented Feb 1, 2024

Fixed large allocation in kafka::wait_for_leaders #16287

Fixed large allocation in kafka::wait_for_leaders #16287

Conversation

mmaslankaprv commented Jan 25, 2024 • edited

Backports Required

Release Notes

mmaslankaprv commented Jan 25, 2024

mmaslankaprv commented Jan 26, 2024

mmaslankaprv commented Jan 26, 2024

mmaslankaprv commented Jan 29, 2024

mmaslankaprv commented Jan 29, 2024

vbotbuildovich commented Jan 29, 2024 • edited

mmaslankaprv commented Jan 30, 2024

vbotbuildovich commented Feb 1, 2024

vbotbuildovich commented Feb 1, 2024

vbotbuildovich commented Feb 1, 2024

vbotbuildovich commented Feb 1, 2024

gousteris commented Feb 1, 2024

gousteris commented Feb 1, 2024

Fixed large allocation in `kafka::wait_for_leaders` #16287

Fixed large allocation in `kafka::wait_for_leaders` #16287

mmaslankaprv commented Jan 25, 2024 •

edited

vbotbuildovich commented Jan 29, 2024 •

edited