New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Benchmark job with backoff limit per index #121393
Benchmark job with backoff limit per index #121393
Conversation
This issue is currently awaiting triage. If a SIG or subproject determines this is a relevant issue, they will accept it by applying the The Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
3ec9651
to
b78af4c
Compare
b78af4c
to
2962755
Compare
/test pull-kubernetes-unit |
@alculquicondor PTAL at the benchmark results.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why is this benchmark relevant?
If there are no Pod failures, why would having backoffLimitPerIndex
have an impact?
This is a sanity benchmark to make sure there is no significant penalty for stepping into some new branches of code like here or here. However, to verify the new branches don't add significant penalty at scale we would need to have many failures, probably >1000. At this point, delay due to expotential backoff would take over. Since the delay patterns are different between global and perIndex backoffLimit I'm not sure how to design the proper full benchmark. |
One idea that occurs to me now is to use an integration test. Mark 1000 pods as failed, and measure just the duration of EDIT: I'm trying yet another approach with setting the backoffDelay to 10ms, or less only to make it less relevant.
I will publish the results when have them. |
2297edd
to
4819547
Compare
5c04f66
to
ff87bf3
Compare
SGTM. I have prepared the KEP update: kubernetes/enhancements#4321 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/approve
/retitle Benchmark job with backoff limit per index |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: alculquicondor, mimowo The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
ff87bf3
to
168e016
Compare
/test pull-kubernetes-unit |
Ticketed: #121652 |
/retest |
/remove-sig api-machinery |
/retest |
oops |
LGTM label has been added. Git tree hash: d25535bb9583f54359ab98c41aa974c115131fde
|
This is a test change. I suppose that we can add milestone v1.29 for this. |
Yes, I think it makes sense to add the milestone. |
/assign @soltysh |
/milestone v1.29 |
What type of PR is this?
/kind cleanup
What this PR does / why we need it:
Which issue(s) this PR fixes:
Part of: kubernetes/enhancements#3850
Special notes for your reviewer:
BenchmarkLargeFailureHandling
test. Here we haveto minimize the timing differences due to different backoff strategies.
The results of the benchmark are put the KEP update PR: kubernetes/enhancements#4321.
Also, lowered the nPods parameter to 10 and 100 to avoid issues with long running time on CI. The parameters can be easily increased for local testing.
Does this PR introduce a user-facing change?
Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.: