dra test: enhance performance of test driver controller #119819

pohly · 2023-08-08T13:23:42Z

What type of PR is this?

/kind cleanup

What this PR does / why we need it:

Analyzing the CPU profile of

go test -timeout=0 -count=5 -cpuprofile profile.out -bench=BenchmarkPerfScheduling/.*Claim.* -benchtime=1ns -run=xxx ./test/integration/scheduler_perf

showed that a significant amount of time was spent iterating over allocated claims to determine how many were allocated per node. That "naive" approach was taken to avoid maintaining a redundant data structure, but now that performance measurements show that this comes at a cost, it's not "premature optimization" anymore to introduce such a second field.

The average scheduling throughput in
SchedulingWithResourceClaimTemplate/2000pods_100nodes increases from 16.4 pods/s to 19.2 pods/s.

Does this PR introduce a user-facing change?

NONE

Analyzing the CPU profile of go test -timeout=0 -count=5 -cpuprofile profile.out -bench=BenchmarkPerfScheduling/.*Claim.* -benchtime=1ns -run=xxx ./test/integration/scheduler_perf showed that a significant amount of time was spent iterating over allocated claims to determine how many were allocated per node. That "naive" approach was taken to avoid maintaining a redundant data structure, but now that performance measurements show that this comes at a cost, it's not "premature optimization" anymore to introduce such a second field. The average scheduling throughput in SchedulingWithResourceClaimTemplate/2000pods_100nodes increases from 16.4 pods/s to 19.2 pods/s.

k8s-ci-robot · 2023-08-08T13:23:46Z

Please note that we're already in Test Freeze for the release-1.28 branch. This means every merged PR will be automatically fast-forwarded via the periodic ci-fast-forward job to the release branch of the upcoming v1.28.0 release.

Fast forwards are scheduled to happen every 6 hours, whereas the most recent run was: Mon Aug 7 22:33:07 UTC 2023.

k8s-ci-robot · 2023-08-08T13:24:44Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: pohly

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~test/e2e/dra/OWNERS~~ [pohly]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

bart0sh · 2023-08-09T07:41:44Z

/triage accepted
/priority important-longterm

bart0sh · 2023-08-09T07:41:53Z

/retest

bart0sh · 2023-08-16T10:27:58Z

/lgtm

k8s-ci-robot · 2023-08-16T10:28:05Z

LGTM label has been added.

Git tree hash: 1d3b19e3bfe01f295f5c1ebe289a6f1d1b8a15fc

k8s-ci-robot added release-note-none Denotes a PR that doesn't merit a release note. kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt. size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Aug 8, 2023

k8s-ci-robot requested review from bart0sh and klueska August 8, 2023 13:24

k8s-ci-robot added area/test sig/node Categorizes an issue or PR as relevant to SIG Node. sig/testing Categorizes an issue or PR as relevant to SIG Testing. and removed do-not-merge/needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Aug 8, 2023

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Aug 8, 2023

pohly mentioned this pull request Aug 8, 2023

dynamic resource allocation: optimize class.SuitableNodes usage #114685

Merged

bart0sh added this to Triage in SIG Node PR Triage Aug 8, 2023

pohly mentioned this pull request Aug 9, 2023

dynamic resource allocation: benchmark scheduler plugin #113701

Open

bart0sh moved this from Triage to Needs Reviewer in SIG Node PR Triage Aug 9, 2023

SergeyKanzhelev added this to Triage in SIG Node CI/Test Board Aug 9, 2023

k8s-ci-robot assigned bart0sh Aug 16, 2023

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Aug 16, 2023

bart0sh moved this from Needs Reviewer to Needs Approver in SIG Node PR Triage Aug 16, 2023

k8s-ci-robot merged commit e298e92 into kubernetes:master Aug 16, 2023
15 checks passed

SIG Node CI/Test Board automation moved this from Triage to Done Aug 16, 2023

SIG Node PR Triage automation moved this from Needs Approver to Done Aug 16, 2023

k8s-ci-robot added this to the v1.29 milestone Aug 16, 2023

pacoxu mentioned this pull request Sep 5, 2023

dynamic resource allocation kubernetes/enhancements#3063

Open

34 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dra test: enhance performance of test driver controller #119819

dra test: enhance performance of test driver controller #119819

pohly commented Aug 8, 2023

k8s-ci-robot commented Aug 8, 2023

k8s-ci-robot commented Aug 8, 2023

bart0sh commented Aug 9, 2023

bart0sh commented Aug 9, 2023

bart0sh commented Aug 16, 2023

k8s-ci-robot commented Aug 16, 2023

dra test: enhance performance of test driver controller #119819

dra test: enhance performance of test driver controller #119819

Conversation

pohly commented Aug 8, 2023

What type of PR is this?

What this PR does / why we need it:

Does this PR introduce a user-facing change?

k8s-ci-robot commented Aug 8, 2023

k8s-ci-robot commented Aug 8, 2023

bart0sh commented Aug 9, 2023

bart0sh commented Aug 9, 2023

bart0sh commented Aug 16, 2023

k8s-ci-robot commented Aug 16, 2023