Adjust maximum value for `memory_share_for_fetch` in `MemoryStressTest.test_fetch_with_many_partitions` #11533

dlex · 2023-06-19T15:41:25Z

The failing case is OOM when memory reserved for fetch is 80% of kafka memory, but apparently some knobs in memory control do not reflect what is going on with allocations indeed. For this specific test, the setting should go down gradually until this crash is gone, and that should become the highest recommended setting for now.

Getting it down to 0.7.

Fixes #11458

Backports Required

Release Notes

none

The test still fails at 0.8, getting it down to 0.7

dlex · 2023-06-20T15:29:33Z

CI failures in https://buildkite.com/redpanda/redpanda/builds/31611#0188d489-fb1c-4494-ad16-477c664829b7 are not relevant

michael-redpanda

This seems fine. My only question is was there a reason 0.8 was selected in the past? Are we just putting a band-aid over an actual problem by reducing this?

dlex · 2023-06-21T22:08:43Z

@michael-redpanda 0.8 was the maxiumum that allowed to avoid OOM, but apparently that's not right (see this comment). Re the band-aid, the memory semaphore solution is a band-aid, see this.

dlex · 2023-06-21T22:10:39Z

All CI failures are irrelevant

michael-redpanda · 2023-06-21T23:39:33Z

CI Issues:

CI Failure (timeout in kafka-topics.sh create topic) in RackAwarePlacementTest.test_replica_placement in DEBUG builds #11276
CI Failure (Timeout: Leadership did not stablize) in MultiTopicAutomaticLeadershipBalancingTest.test_topic_aware_rebalance #11454 (closed after CI run)

vbotbuildovich · 2023-06-22T00:06:42Z

/backport v23.1.x

vbotbuildovich · 2023-06-22T00:06:43Z

/backport v22.3.x

vbotbuildovich · 2023-06-22T00:07:41Z

Failed to run cherry-pick command. I executed the commands below:

git checkout -b backport-pr-11533-v23.1.x-557 remotes/upstream/v23.1.x
git cherry-pick -x 51b3333a0bedb061307e422a0790bed0ef67f3d0

Workflow run logs.

vbotbuildovich · 2023-06-22T00:07:55Z

Failed to run cherry-pick command. I executed the commands below:

git checkout -b backport-pr-11533-v22.3.x-438 remotes/upstream/v22.3.x
git cherry-pick -x 51b3333a0bedb061307e422a0790bed0ef67f3d0

Workflow run logs.

tests: adjust maximum value for memory_share_for_fetch

51b3333

The test still fails at 0.8, getting it down to 0.7

dlex requested review from michael-redpanda and dotnwat June 19, 2023 15:41

dlex self-assigned this Jun 20, 2023

dlex marked this pull request as ready for review June 20, 2023 14:41

michael-redpanda approved these changes Jun 20, 2023

View reviewed changes

piyushredpanda merged commit 49c85d4 into redpanda-data:dev Jun 22, 2023
18 checks passed

vbotbuildovich mentioned this pull request Jun 22, 2023

[v23.1.x] Adjust maximum value for memory_share_for_fetch in MemoryStressTest.test_fetch_with_many_partitions #11594

Closed

vbotbuildovich mentioned this pull request Jun 22, 2023

[v22.3.x] Adjust maximum value for memory_share_for_fetch in MemoryStressTest.test_fetch_with_many_partitions #11595

Closed

michael-redpanda mentioned this pull request Jul 24, 2023

[v23.1.x] Adjust maximum value for in #12404

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adjust maximum value for `memory_share_for_fetch` in `MemoryStressTest.test_fetch_with_many_partitions` #11533

Adjust maximum value for `memory_share_for_fetch` in `MemoryStressTest.test_fetch_with_many_partitions` #11533

dlex commented Jun 19, 2023

dlex commented Jun 20, 2023

michael-redpanda left a comment

dlex commented Jun 21, 2023

dlex commented Jun 21, 2023

michael-redpanda commented Jun 21, 2023

vbotbuildovich commented Jun 22, 2023

vbotbuildovich commented Jun 22, 2023

vbotbuildovich commented Jun 22, 2023

vbotbuildovich commented Jun 22, 2023

Adjust maximum value for memory_share_for_fetch in MemoryStressTest.test_fetch_with_many_partitions #11533

Adjust maximum value for memory_share_for_fetch in MemoryStressTest.test_fetch_with_many_partitions #11533

Conversation

dlex commented Jun 19, 2023

Backports Required

Release Notes

dlex commented Jun 20, 2023

michael-redpanda left a comment

Choose a reason for hiding this comment

dlex commented Jun 21, 2023

dlex commented Jun 21, 2023

michael-redpanda commented Jun 21, 2023

vbotbuildovich commented Jun 22, 2023

vbotbuildovich commented Jun 22, 2023

vbotbuildovich commented Jun 22, 2023

vbotbuildovich commented Jun 22, 2023

Adjust maximum value for `memory_share_for_fetch` in `MemoryStressTest.test_fetch_with_many_partitions` #11533

Adjust maximum value for `memory_share_for_fetch` in `MemoryStressTest.test_fetch_with_many_partitions` #11533