Prefered scheduling #211

segator · 2020-01-03T19:02:50Z

It will be nice if we can have soft check like "preferredDuringSchedulingIgnoredDuringExecution"
to calculate if there is a node with better weight than the node where the pod is running to evict the pod and therefore be scheduled in the better node.

For Example because network routes I prefer my deployment PODS run on ZoneA but if not possible because Nodes there are offline then I aceept to be deployed on ZoneB, but if ZoneA is reachable again I want to reschedule back to zoneA.

seanmalloy · 2020-02-06T03:48:36Z

/kind feature

nathan-vp · 2020-02-27T19:43:48Z

Apart from preferredDuringSchedulingIgnoredDuringExecution, it would be nice if soft taints like PreferNoSchedule could also be taken into account.

https://kubernetes.io/docs/concepts/configuration/taint-and-toleration/

asnkh · 2020-03-21T10:54:19Z

I tried to work on it but found it was difficult to implement this feature. I believe it is beneficial for anyone who is concerned with this issue to understand the fundamental difficulty.

It is indeed possible to detect a pod having a more preferable node to be scheduled in terms of the sum of affinity weight. However, even if we evict the pod to let kube-scheduler place it in another node with a higher affinity score, sometimes it results in having the new pod in the same node as before. This is because scheduling is decided not only by node affinity or inter-pod affinity. (https://kubernetes.io/docs/concepts/scheduling/kube-scheduler/#scoring)

So unless descheduler can make the same decision as kube-scheduler, it can cause this kind of ineffective pod evictions. I have no good idea to overcome this difficulty. Copying all scheduling policies in kube-scheduler to descheduler is not realistic.

As a user of kubernetes, I decided to always use requiredDuringSchedulingIgnoredDuringExecution. Doing so brought other issues but they are resolvable in my case.

barucoh · 2020-04-02T15:34:48Z

I am looking for the same behavior, although for a different use case:
I need to "redistribute" the pods with preferredDuringSchedulingIgnoredDuringExecution across the available AZs to ensure High Availability at all times while preserving pod scale-out operations to have more pods than the number of AZs for the cluster (this is because requiredDuringSchedulingIgnoredDuringExecution will allow a maximum number of pods as the number of AZs the cluster sees).

Unless I am missing a Kubernetes feature which I'm unaware of that supports exactly that...

seanmalloy · 2020-04-03T02:49:46Z

I am looking for the same behavior, although for a different use case:
I need to "redistribute" the pods with preferredDuringSchedulingIgnoredDuringExecution across the available AZs to ensure High Availability at all times while preserving pod scale-out operations to have more pods than the number of AZs for the cluster (this is because requiredDuringSchedulingIgnoredDuringExecution will allow a maximum number of pods as the number of AZs the cluster sees).

Unless I am missing a Kubernetes feature which I'm unaware of that supports exactly that...

@barucoh take a look at the topologySpreadConstraints feature. This feature is was promoted to beta and is enabled by default starting with k8s v1.18.

yoda · 2020-04-29T05:49:45Z

This is taken from my other comment on the above closed issue, describing my usecase example

Use case would be where you have 2 autoscaler groups of nodes where one is a spot instance type and the other is standard. The spot nodes get terminated on price going above threshold resulting in the preferredDuringScheduling being invalidated resulting in scheduling onto standard nodes. Over time the price on the spots goes back down and the descheduler can cause the rescheduling of them back onto spot instances.

The issue referenced above does have a PR against it with a potential implementation however
@asnkh has an awkward point that in some cases this would possibly result in "flapping" of the redeploys, but maybe this can be dealt with by additional affinities.

fejta-bot · 2020-07-28T05:53:49Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

seanmalloy · 2020-07-30T13:01:28Z

/remove-lifecycle stale

kesor · 2020-09-23T13:52:21Z

There is some discussion about having the Scheduler have a DryRun capability, which could mean that @asnkh solution doesn't require to import all the scheduling logic into this project. Alas, the issue says that at the moment the way to "check capacity" is using the cluster-capacity tool.

kubernetes/kubernetes#58242 <-- kube-scheduler dry run request
https://github.com/kubernetes-sigs/cluster-capacity <-- cluster-capacity tool to check where a pod "might be" scheduled

fejta-bot · 2020-12-22T14:22:59Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

seanmalloy · 2020-12-22T15:45:29Z

/remove-lifecycle stale

fejta-bot · 2021-03-22T16:39:47Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

seanmalloy · 2021-03-23T04:21:16Z

/remove-lifecycle stale

fejta-bot · 2021-06-21T05:03:18Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

fejta-bot · 2021-07-21T05:38:42Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle rotten

decayofmind · 2021-07-29T11:16:04Z

If someone still wants this functionality in a simple quick and dirty way, please check on https://github.com/decayofmind/kube-better-node

seanmalloy · 2021-08-20T06:09:00Z

/remove-lifecycle rotten

sergeyshevch · 2021-11-15T14:50:37Z

@decayofmind Looks really great! I also have a similar use case. We have SPOT autoscaling groups and I need to be sure that my pods always can be scheduled.

So I cannot use the required node affinity because there can be a case when my spot ASG will be downscaled to 0. And I want to reschedule the pod if the preferred node affinity can be solved.

rajivml · 2021-12-02T03:51:22Z

We also have a similar use case, where we want de-scheduler to work with preferredDuringSchedulingIgnoredDuringExecution because if we use "required" then the no of pods that we could span will be limited to no of Nodes if it's single zone cluster or it will be limited to no of zones if it's a multi zonal cluster

damemi · 2021-12-02T13:31:40Z

@rajivml have you looked at topology spread constraints? Your situation sounds similar to one that was mentioned above #211 (comment)

dvdvorle · 2022-01-27T12:44:54Z

I have the exact same use case as @sergeyshevch, and I don't know of any other way to address this issue.

I'd like to add that for this a 100% solution isn't needed, I'd be happy with 80% as well, and I don't mind that

it can cause this kind of ineffective pod evictions

as @asnkh mentioned earlier. Especially because the RemoveDuplicates strategy has the same restriction, no? It also can lead to ineffective pod evictions, but it's still a useful strategy to have.

As long as the evictions respect PDB's I don't mind them at all.

k8s-triage-robot · 2022-04-27T13:24:08Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

sergeyshevch · 2022-05-06T15:31:58Z

/remove-lifecycle stale

shaneqld · 2022-06-18T01:44:24Z

It is indeed possible to detect a pod having a more preferable node to be scheduled in terms of the sum of affinity weight. However, even if we evict the pod to let kube-scheduler place it in another node with a higher affinity score, sometimes it results in having the new pod in the same node as before. This is because scheduling is decided not only by node affinity or inter-pod affinity. (https://kubernetes.io/docs/concepts/scheduling/kube-scheduler/#scoring)

So unless descheduler can make the same decision as kube-scheduler, it can cause this kind of ineffective pod evictions. I have no good idea to overcome this difficulty. Copying all scheduling policies in kube-scheduler to descheduler is not realistic.

As I understand, the problem here is that this could result in "flapping" of pods. For some of us, this could be acceptable.

We don't current have a good solution for making the same decision of kube-scheduler (eg cluster-capacity tool, dry run, etc), so what about accepting the limitation and reducing the impact of the flapping? For example, if I could say "don't deschedule a pod with an age of less than 30 minutes", the pod could flap at most once every 30 minutes.

The hope would be, in a dynamic cluster, the pod would eventually be moved as desired. Worst case, you have a flap periodically, and you would understand and accept this if you want to use the feature.

k8s-triage-robot · 2022-09-16T02:36:56Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

sergeyshevch · 2022-09-16T07:31:21Z

/remove-lifecycle stale

@seanmalloy I guess that such a use case can be implemented later and we should freeze this issue to continue the discussion and next implementations.

Can you freeze it?

k8s-triage-robot · 2022-12-15T07:54:17Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

z0rc · 2022-12-15T08:25:11Z

/remove-lifecycle stale

k8s-triage-robot · 2023-03-15T09:19:27Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle stale
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

z0rc · 2023-03-15T10:15:45Z

/remove-lifecycle stale

k8s-triage-robot · 2023-06-13T10:26:00Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle stale
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

z0rc · 2023-06-13T11:12:01Z

/remove-lifecycle stale

miqm · 2023-09-04T12:05:25Z

I seems this has been finally implemented: #1210

a7i · 2023-10-25T03:24:33Z

/close

k8s-ci-robot added the kind/feature Categorizes issue or PR as related to a new feature. label Feb 6, 2020

damemi mentioned this issue Mar 10, 2020

RemovePodsViolatingInterPodAntiAffinity ignores preferredDuringSchedulingIgnoredDuringExecution #250

Closed

seanmalloy mentioned this issue Apr 29, 2020

add preferredDuringSchedulingIgnoredDuringExecution option #129

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 28, 2020

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 30, 2020

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 22, 2020

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 22, 2020

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Mar 22, 2021

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Mar 23, 2021

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jun 21, 2021

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jul 21, 2021

k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Aug 20, 2021

dvdvorle mentioned this issue Apr 6, 2022

Add support to upgrade Orchestrator Version for Spot Node Pools Azure/AKS#2134

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 27, 2022

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 6, 2022

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Sep 16, 2022

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Sep 16, 2022

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 15, 2022

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 15, 2022

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Mar 15, 2023

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Mar 15, 2023

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jun 13, 2023

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jun 13, 2023

a7i closed this as completed Oct 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prefered scheduling #211

Prefered scheduling #211

segator commented Jan 3, 2020

seanmalloy commented Feb 6, 2020

nathan-vp commented Feb 27, 2020 •

edited

asnkh commented Mar 21, 2020

barucoh commented Apr 2, 2020 •

edited

seanmalloy commented Apr 3, 2020

yoda commented Apr 29, 2020

fejta-bot commented Jul 28, 2020

seanmalloy commented Jul 30, 2020

kesor commented Sep 23, 2020

fejta-bot commented Dec 22, 2020

seanmalloy commented Dec 22, 2020

fejta-bot commented Mar 22, 2021

seanmalloy commented Mar 23, 2021

fejta-bot commented Jun 21, 2021

fejta-bot commented Jul 21, 2021

decayofmind commented Jul 29, 2021

seanmalloy commented Aug 20, 2021

sergeyshevch commented Nov 15, 2021

rajivml commented Dec 2, 2021 •

edited

damemi commented Dec 2, 2021

dvdvorle commented Jan 27, 2022

k8s-triage-robot commented Apr 27, 2022

sergeyshevch commented May 6, 2022

shaneqld commented Jun 18, 2022

k8s-triage-robot commented Sep 16, 2022

sergeyshevch commented Sep 16, 2022

k8s-triage-robot commented Dec 15, 2022

z0rc commented Dec 15, 2022

k8s-triage-robot commented Mar 15, 2023

z0rc commented Mar 15, 2023

k8s-triage-robot commented Jun 13, 2023

z0rc commented Jun 13, 2023

miqm commented Sep 4, 2023

a7i commented Oct 25, 2023

Prefered scheduling #211

Prefered scheduling #211

Comments

segator commented Jan 3, 2020

seanmalloy commented Feb 6, 2020

nathan-vp commented Feb 27, 2020 • edited

asnkh commented Mar 21, 2020

barucoh commented Apr 2, 2020 • edited

seanmalloy commented Apr 3, 2020

yoda commented Apr 29, 2020

fejta-bot commented Jul 28, 2020

seanmalloy commented Jul 30, 2020

kesor commented Sep 23, 2020

fejta-bot commented Dec 22, 2020

seanmalloy commented Dec 22, 2020

fejta-bot commented Mar 22, 2021

seanmalloy commented Mar 23, 2021

fejta-bot commented Jun 21, 2021

fejta-bot commented Jul 21, 2021

decayofmind commented Jul 29, 2021

seanmalloy commented Aug 20, 2021

sergeyshevch commented Nov 15, 2021

rajivml commented Dec 2, 2021 • edited

damemi commented Dec 2, 2021

dvdvorle commented Jan 27, 2022

k8s-triage-robot commented Apr 27, 2022

sergeyshevch commented May 6, 2022

shaneqld commented Jun 18, 2022

k8s-triage-robot commented Sep 16, 2022

sergeyshevch commented Sep 16, 2022

k8s-triage-robot commented Dec 15, 2022

z0rc commented Dec 15, 2022

k8s-triage-robot commented Mar 15, 2023

z0rc commented Mar 15, 2023

k8s-triage-robot commented Jun 13, 2023

z0rc commented Jun 13, 2023

miqm commented Sep 4, 2023

a7i commented Oct 25, 2023

nathan-vp commented Feb 27, 2020 •

edited

barucoh commented Apr 2, 2020 •

edited

rajivml commented Dec 2, 2021 •

edited