scheduler: fix perf downgrade of cases without presence of (anti-)affinity pods #76973

Huang-Wei · 2019-04-24T00:30:21Z

What type of PR is this?

/kind bug
/sig scheduling
/assign @bsalamat @ravisantoshgudimetla

What this PR does / why we need it:

Some background: scheduler: performance improvement on PodAffinity #76243 was introduced to improve pod affinity, but it sorts of regress other usecases, so we reverted that PR.
Root cause of old PR: For a typical kubemark test, it just runs some regular workloads. No affinity pods are involved in. With that said, the mutex in inter-podaffinity priority is actually a non-op. So, what's the exact overhead old pr brings in? I did a test and it turns out the allocation of *int64 pointer for each node is the culprit.
How this PR fixes the issue: This PR introduces a dynamic init mechanism for those *int64 pointers. Then in the final score calculation phase, if it's not inited, we simply assign score 0 to it.
Benchmark tests for Inter-PodAffinity Priority: It shows old PR has 60%+ overhead (for cases without affinity pods), and new(this) PR has zero overhead.
Can this PR still help PodAffinity cases: Yes.

Which issue(s) this PR fixes:

A robust version of #76243.

Special notes for your reviewer:

If you have already reviewed the old PR, just take a look at commit 4bc5476.

Does this PR introduce a user-facing change?:

NONE

(I guess we shouldn't mention it in release-note again, yes or no?)

This reverts commit 6d89279.

… priority

bsalamat

Thanks, @Huang-Wei! It looks good. I hope this one behaves as expected.
I also see a couple other potential optimizations in finding min/max, but let's first see if this PR improves performance. We can add those in follow-up PRs.

bsalamat · 2019-04-24T18:47:05Z

pkg/scheduler/algorithm/priorities/interpod_affinity.go

 		}
 	}

 	// calculate final priority score for each node
 	result := make(schedulerapi.HostPriorityList, 0, len(nodes))
+	maxMinDiff := maxCount - minCount


It looks to me that if maxMinDiff is zero, we can skip the for loop below altogether.

Thanks for the review!

Yes, that could be possible. I can investigate later to see whether current Priority reduce interface is fine with an empty []schedulerapi.HostPriorityList vs. mandatory entries with schedulerapi.HostPriority{Host: node.Name, Score: 0} in the result set.

bsalamat

/lgtm
/approve

k8s-ci-robot · 2019-04-24T20:38:42Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: bsalamat, Huang-Wei

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~pkg/scheduler/OWNERS~~ [Huang-Wei,bsalamat]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

Huang-Wei · 2019-04-24T22:42:09Z

/retest

k8s-ci-robot assigned bsalamat Apr 24, 2019

k8s-ci-robot added the release-note-none Denotes a PR that doesn't merit a release note. label Apr 24, 2019

k8s-ci-robot assigned ravisantoshgudimetla Apr 24, 2019

k8s-ci-robot requested review from smarterclayton and thockin April 24, 2019 00:31

Huang-Wei mentioned this pull request Apr 24, 2019

Prow suggests approver from not necessary parent OWNERS kubernetes/test-infra#12331

Closed

Huang-Wei added 2 commits April 24, 2019 08:13

Revert "Revert "scheduler: performance improvement on PodAffinity""

854a266

This reverts commit 6d89279.

lazy/dynamic initilization on the int64 pointers of inter-podaffinity…

492b970

… priority

Huang-Wei force-pushed the lazy-init-int64-ptr branch from 4bc5476 to 492b970 Compare April 24, 2019 15:14

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Apr 24, 2019

bsalamat requested review from bsalamat and removed request for smarterclayton and thockin April 24, 2019 17:33

bsalamat reviewed Apr 24, 2019

View reviewed changes

bsalamat approved these changes Apr 24, 2019

View reviewed changes

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Apr 24, 2019

k8s-ci-robot merged commit 1a325db into kubernetes:master Apr 25, 2019

Huang-Wei deleted the lazy-init-int64-ptr branch May 3, 2019 18:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scheduler: fix perf downgrade of cases without presence of (anti-)affinity pods #76973

scheduler: fix perf downgrade of cases without presence of (anti-)affinity pods #76973

Huang-Wei commented Apr 24, 2019

bsalamat left a comment

bsalamat Apr 24, 2019

Huang-Wei Apr 24, 2019

bsalamat left a comment

k8s-ci-robot commented Apr 24, 2019

Huang-Wei commented Apr 24, 2019

scheduler: fix perf downgrade of cases without presence of (anti-)affinity pods #76973

scheduler: fix perf downgrade of cases without presence of (anti-)affinity pods #76973

Conversation

Huang-Wei commented Apr 24, 2019

bsalamat left a comment

Choose a reason for hiding this comment

bsalamat Apr 24, 2019

Choose a reason for hiding this comment

Huang-Wei Apr 24, 2019

Choose a reason for hiding this comment

bsalamat left a comment

Choose a reason for hiding this comment

k8s-ci-robot commented Apr 24, 2019

Huang-Wei commented Apr 24, 2019