Activate unschedulable pods only if the node became more schedulable #70366

mlmhl · 2018-10-29T11:49:53Z

What type of PR is this?
/kind feature

What this PR does / why we need it:

This is a performance optimization for scheduler:

Move unschedulable pods to active queue only if a node's scheduling related properties updated. This PR considers node allocatable, node conditions, node taints, node labels and Node.Spec.Unschedulable as scheduling related properties.

Which issue(s) this PR fixes (optional, in fixes #<issue number>(, fixes #<issue_number>, ...) format, will close the issue(s) when PR gets merged):
Fixes #70316

Does this PR introduce a user-facing change?:

Scheduler only activates unschedulable pods if node's scheduling related properties change.

wgliang · 2018-10-29T13:39:24Z

@mlmhl please make the CI happy(seem like you need to gofmt the code).

wgliang · 2018-10-29T13:41:29Z

And I think this PR need a effective release note.

bsalamat

Thanks so much for working on this so quickly. Given that nodes send updates every 10 seconds, in large clusters we receive hundreds of node updates per second. So, it is important that the checks be very quick. That's the main reason that I suggested a couple of changes to make the logic simpler. Of course, simpler logic is easier to maintain as well.

bsalamat · 2018-10-29T20:25:05Z

pkg/scheduler/factory/factory.go

+	if reflect.DeepEqual(oldAllocatable, newAllocatable) {
+		return false
+	}
+	for resource, newValue := range newAllocatable {


What you have done makes sense, but in order to be quicker in checking node changes and also to be conservative, I would return "true" as long allocatables are changed, no matter whether they are reduced or increased. In other words, remove this for loop and just return true.

bsalamat · 2018-10-29T20:28:05Z

pkg/scheduler/factory/factory.go

+		return false
+	}
+
+	healthyConditions := []v1.NodeConditionType{v1.NodeReady}


Similar to my previous comment, I would return true as long as old and new conditions are not equal. This will help reduce chances of introducing bugs in the future when new node conditions are added.

mlmhl · 2018-10-30T03:29:06Z

@bsalamat @wgliang All comments are updated, PTAL :)

bsalamat

/lgtm
/approve

Thanks, @mlmhl!

k8s-ci-robot · 2018-10-30T05:54:18Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: bsalamat, mlmhl

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~pkg/scheduler/OWNERS~~ [bsalamat]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

mlmhl · 2018-10-30T09:19:03Z

/test pull-kubernetes-e2e-gce-100-performance

k8s-ci-robot requested review from aveshagarwal and bsalamat October 29, 2018 11:50

bsalamat reviewed Oct 29, 2018

View reviewed changes

activate unschedulable pods only if the node became more schedulable

c50f89d

k8s-ci-robot added release-note Denotes a PR that will be considered when it comes time to generate release notes. and removed release-note-none Denotes a PR that doesn't merit a release note. labels Oct 30, 2018

mlmhl force-pushed the scheduler_optimization branch from f7033a4 to c50f89d Compare October 30, 2018 02:51

bsalamat approved these changes Oct 30, 2018

View reviewed changes

k8s-ci-robot assigned bsalamat Oct 30, 2018

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Oct 30, 2018

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Oct 30, 2018

k8s-ci-robot merged commit fda41d1 into kubernetes:master Oct 30, 2018

wojtek-t mentioned this pull request Nov 7, 2018

Scheduler CPU usage hurting scalability? #70708

Closed

bsalamat mentioned this pull request Nov 8, 2018

Revert "Activate unschedulable pods only if the node became more schedulable" #70776

Merged

bsalamat mentioned this pull request Nov 28, 2018

Process unschedulable pods on node updates more efficiently #70316

Closed

mlmhl deleted the scheduler_optimization branch November 29, 2018 07:23

k82cn mentioned this pull request Nov 30, 2018

activate unschedulable pods only if the node became more schedulable #71551

Merged

xiaoxubeii mentioned this pull request Dec 5, 2018

Unschedulable pods may block head of the scheduling queue #71486

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Activate unschedulable pods only if the node became more schedulable #70366

Activate unschedulable pods only if the node became more schedulable #70366

mlmhl commented Oct 29, 2018 •

edited by bsalamat

wgliang commented Oct 29, 2018 •

edited

wgliang commented Oct 29, 2018

bsalamat left a comment

bsalamat Oct 29, 2018

mlmhl Oct 30, 2018

bsalamat Oct 29, 2018

mlmhl Oct 30, 2018

mlmhl commented Oct 30, 2018

bsalamat left a comment

k8s-ci-robot commented Oct 30, 2018

mlmhl commented Oct 30, 2018

Activate unschedulable pods only if the node became more schedulable #70366

Activate unschedulable pods only if the node became more schedulable #70366

Conversation

mlmhl commented Oct 29, 2018 • edited by bsalamat

wgliang commented Oct 29, 2018 • edited

wgliang commented Oct 29, 2018

bsalamat left a comment

Choose a reason for hiding this comment

bsalamat Oct 29, 2018

Choose a reason for hiding this comment

mlmhl Oct 30, 2018

Choose a reason for hiding this comment

bsalamat Oct 29, 2018

Choose a reason for hiding this comment

mlmhl Oct 30, 2018

Choose a reason for hiding this comment

mlmhl commented Oct 30, 2018

bsalamat left a comment

Choose a reason for hiding this comment

k8s-ci-robot commented Oct 30, 2018

mlmhl commented Oct 30, 2018

mlmhl commented Oct 29, 2018 •

edited by bsalamat

wgliang commented Oct 29, 2018 •

edited