Skip to content

Conversation

@swiatekm
Copy link
Contributor

@swiatekm swiatekm commented Nov 21, 2025

Our integration tests create a lot of indices due to each test running in its own namespace. Recently, we've been hitting the per-node cluster wide limit of 1000 shards, causing some tests to randomly be flaky - see #11325 for example.

I've looked into this in #11329, and it does look like we create almost 500 indicies, each of which gets an inactive replica shard that counts towards the limit. I tried to disable replication at first, but doing so for managed indices turns out to be a huge pain. Instead, I decided to just increase the limit, as the replica shards aren't really doing much anyway.

Our integration tests create a lot of indices due to each test running
in its own namespace. Increase the limit, as the performance
penalty doesn't matter for the purpose of our tests.
@swiatekm swiatekm added Team:Elastic-Agent-Control-Plane Label for the Agent Control Plane team skip-changelog chore Tasks that just need to be done, they are neither bug, nor enhancements backport-active-all Automated backport with mergify to all the active branches labels Nov 21, 2025
@swiatekm swiatekm marked this pull request as ready for review November 21, 2025 18:25
@swiatekm swiatekm requested a review from a team as a code owner November 21, 2025 18:25
@elasticmachine
Copy link
Contributor

Pinging @elastic/elastic-agent-control-plane (Team:Elastic-Agent-Control-Plane)

@swiatekm swiatekm changed the title Set max_shards_per_node to 2000 Set max_shards_per_node to 2000 for the integration test cluster Nov 21, 2025
@swiatekm swiatekm added the flaky-test Unstable or unreliable test cases. label Nov 21, 2025
@swiatekm swiatekm enabled auto-merge (squash) November 21, 2025 19:01
@swiatekm swiatekm merged commit 81150c8 into main Nov 21, 2025
32 checks passed
@swiatekm swiatekm deleted the ci/increase-ess-max-shard-per-node branch November 21, 2025 19:45
@github-actions
Copy link
Contributor

@Mergifyio backport 8.19 9.1 9.2

@mergify
Copy link
Contributor

mergify bot commented Nov 21, 2025

@elasticmachine
Copy link
Contributor

💛 Build succeeded, but was flaky

Failed CI Steps

cc @swiatekm

mergify bot pushed a commit that referenced this pull request Nov 21, 2025
Our integration tests create a lot of indices due to each test running
in its own namespace. Increase the limit, as the performance
penalty doesn't matter for the purpose of our tests.

(cherry picked from commit 81150c8)
mergify bot pushed a commit that referenced this pull request Nov 21, 2025
Our integration tests create a lot of indices due to each test running
in its own namespace. Increase the limit, as the performance
penalty doesn't matter for the purpose of our tests.

(cherry picked from commit 81150c8)
mergify bot pushed a commit that referenced this pull request Nov 21, 2025
Our integration tests create a lot of indices due to each test running
in its own namespace. Increase the limit, as the performance
penalty doesn't matter for the purpose of our tests.

(cherry picked from commit 81150c8)
swiatekm added a commit that referenced this pull request Nov 21, 2025
Our integration tests create a lot of indices due to each test running
in its own namespace. Increase the limit, as the performance
penalty doesn't matter for the purpose of our tests.

(cherry picked from commit 81150c8)

Co-authored-by: Mikołaj Świątek <mail@mikolajswiatek.com>
swiatekm added a commit that referenced this pull request Nov 21, 2025
Our integration tests create a lot of indices due to each test running
in its own namespace. Increase the limit, as the performance
penalty doesn't matter for the purpose of our tests.

(cherry picked from commit 81150c8)

Co-authored-by: Mikołaj Świątek <mail@mikolajswiatek.com>
swiatekm added a commit that referenced this pull request Nov 21, 2025
Our integration tests create a lot of indices due to each test running
in its own namespace. Increase the limit, as the performance
penalty doesn't matter for the purpose of our tests.

(cherry picked from commit 81150c8)

Co-authored-by: Mikołaj Świątek <mail@mikolajswiatek.com>
hayotbisonai pushed a commit to hayotbisonai/elastic-agent that referenced this pull request Nov 23, 2025
Our integration tests create a lot of indices due to each test running
in its own namespace. Increase the limit, as the performance
penalty doesn't matter for the purpose of our tests.
swiatekm added a commit that referenced this pull request Nov 24, 2025
Our integration tests create a lot of indices due to each test running
in its own namespace. Increase the limit, as the performance
penalty doesn't matter for the purpose of our tests.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

backport-active-all Automated backport with mergify to all the active branches chore Tasks that just need to be done, they are neither bug, nor enhancements flaky-test Unstable or unreliable test cases. skip-changelog Team:Elastic-Agent-Control-Plane Label for the Agent Control Plane team

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants