schedule placement based on resource usage/capacity #16

elgnay · 2021-08-09T13:45:37Z

The related feature request: open-cluster-management-io/community#52
Signed-off-by: Yang Le yangle@redhat.com

elgnay · 2021-08-09T13:50:10Z

nobody4t

I have some comments. Most of them are rewording. Please consider them.
Hope it can be helpful to make this doc more readable.

enhancements/sig-architecture/15-resourcebasedscheduling/README.md

nobody4t · 2021-08-12T07:09:20Z

enhancements/sig-architecture/15-resourcebasedscheduling/README.md

+#### Story 3: User is able to create a placement to select clusters based on resource usage, and then keep placement decisions updated according to the changes on cluster resource usage.
+  - A CPU-intensive application would like to be deployed on the cluster with least CPU utilization rate.
+
+#### Story 4: User is able to create a placement to select clusters based on resource usage, and then ignore any resource usage change afterwards to keep the placement decisions pinning.


User is able to select clusters based on some resource(s) usage once, without considering the usage changes afterwards, keeping the decisions pinned.

Hi @elgnay ,

I have some comments. Most of them are rewording. Please consider them.
Hope it can be helpful to make this doc more readable.

@dongwangdw Thanks for the comments. I'll update the proposal accordingly.

nobody4t · 2021-08-12T07:41:25Z

enhancements/sig-architecture/15-resourcebasedscheduling/README.md

+```
+In order to support other resources, like GPU, the `allocatable` and `capacity` should be included in the status of managed cluster either.
+
+### Plugin `mostallocatabletocapacityratio`


I may do not have the background of this scenario.
But from my point, in this scenario, the user will be responsible for the resource capacity level. Like, some cluster got 20 cpu, some got 200 cpu. Then this ratio now may not make scense.

Yes, you are right. That's the reason why we need another type MostAllocatable besides MostAllocatableToCapacityRatio.

Actually, none of them works perfectly in all scenarios. For example, suppose there are two clusters:

cluster1 with 15/20 cpus allocatable;

cluster2 with 20/200 cpus allocatable;

With type MostAllocatableToCapacityRatio, cluster1 will have a higher score than cluster2, while with type MostAllocatable, cluster2 will have a higher score. Which one makes more scenes? it depends on user's requirement.

qiujian16 · 2021-08-12T13:34:47Z

enhancements/sig-architecture/15-resourcebasedscheduling/README.md

+	// the placement.
+	// This field is ignored when NumberOfClusters in spec is not specified.
+	// +optional
+	ClusterResourcePreference *ClusterResourcePreference `json:"clusterResourcePreference,omitempty"`


What is the difference between nil and empty?

Here is the difference:

If nil, it means the placement has no ClusterResourcePreference at all;

if empty, it means the placement has an empty ClusterResourcePreference, whose type is MostAllocatableToCapacityRatio. Consider ClusterResourcePreference.ClusterResource must have at lease one item, an empty ClusterResourcePreference is invalid.

// +kubebuilder:validation:MinItems:=1 // +required ClusterResources []ClusterResource `json:"clusterResources"`

Given steady and balance weighting, it seems like we may want a more generic prioritizing configuration.

A generic prioritizing configuration is provided instead of ClusterResourcePreference

qiujian16 · 2021-08-12T13:38:06Z

enhancements/sig-architecture/15-resourcebasedscheduling/README.md

+- Otherwise, the score for each managed cluster is 0.  
+
+Before returning the scores to the scheduler, the data should be normalized and ensure the value falls in the range between 0 and 100.  
+`normalized = (score - min(score)) * 100 / (max(score) - min(score))`


I think the test cases/version upgrade etc is required. You can set it to N/A if it is not required.

I addition I would like to see some examples, and a discussion on how it works with steady/balance plugin today.

qiujian16 · 2021-08-12T13:40:43Z

enhancements/sig-architecture/15-resourcebasedscheduling/README.md

+### Plugin `mostallocatabletocapacityratio`
+It is a pritoritizer and scores feasible managed clusters with the process below.
+- If the placement has `ClusterResourcePreference` specified in the spec and its `Type` is `MostAllocatableToCapacityRatio`, the score of a managed cluster is the sum of the score for each resource.  
+`score = sum(resource_x_allocatable / resource_x_capacity))`  


it is worth to mention that we treat each resource dimension with equal weight and the reasoning.

deads2k · 2021-08-17T14:46:53Z

enhancements/sig-architecture/15-resourcebasedscheduling/README.md

+
+### Goals
+
+- Sotry 1 and 2, allow user to select managed clusters based on resource usage/capability with the Placement APIs;


deads2k · 2021-08-17T14:47:51Z

enhancements/sig-architecture/15-resourcebasedscheduling/README.md

+
+### Non-Goals
+- Balance workload across the fleet based on cluster resource usage;
+- Story 3 and 4 are related to churning policy of placement, and will be covered in a separated enhancement proposal;


is this different then that the steady weighting?

Yes, they are related. Weight of steady plugin should be adjusted automatically according to the churning policy of a placement. And we may also support some advanced features, like churningSeconds, to describe to what extent we can stand for the cluster churning.

deads2k · 2021-08-17T14:59:34Z

enhancements/sig-architecture/15-resourcebasedscheduling/README.md

+
+Link to the feature request: https://github.com/open-cluster-management-io/community/issues/52
+
+### User Stories


is there a way to express: "I must have at least this much space"? If not, why was that excluded?

how does a user handle it when their clusters are autoscaling? What do they expect to happen? This could be presented as there not being capacity available I think. Or perhaps that capacity is tight, but will expand.

is there a way to express: "I must have at least this much space"? If not, why was that excluded?

No, because we can not support it. Suppose we are able to create a placement which matches all clusters with at least 10G allocatable memory. In the first scheduling cycle, cluster1 is selected by this placement for it has 12G allocatable memory. After several minutes, the allocatable memory of cluster1 is reduced to 8G. We don't know if the reduced 4G memory is consumed by workload associated with this placement. So in the next scheduling cycle, should cluster1 be selected by this placement or not?

how does a user handle it when their clusters are autoscaling? What do they expect to happen? This could be presented as there not being capacity available I think. Or perhaps that capacity is tight, but will expand.

Here is what will happens if managed clusters are autoscaling.

Scale up

User creates a placement with ClusterResourcePreference;

The cluster with most allocatable to capacity ratio or most allocatable will be selected;

Workload will then be deployed on the selected cluster;

If there is no enough resource available or the capacity is tight, new node will be added;

Capacity/allocatable of this cluster changes. That will trigger a new scheduling cycle for placements with ClusterResourcePreference;

Scale down

An underutilized node in a managed cluster is removed;

Capacity/allocatable of this cluster changes. That will trigger a new scheduling cycle for placements with ClusterResourcePreference;

Some Placements may no longer select this cluster because of resource change;

Thanks, that explanation makes sense. I might suggest an appendix with your example of scale up and scale down and your example of why you cannot express "at least this much space".

Signed-off-by: Yang Le <yangle@redhat.com>

deads2k · 2021-08-25T20:43:50Z

enhancements/sig-architecture/15-resourcebasedscheduling/README.md

+
+## Proposal
+
+### 1. Changes on Placement APIs


I think the combination of this API with the prioritizers in item 2 is going to age pretty well.

deads2k · 2021-08-25T20:45:58Z

enhancements/sig-architecture/15-resourcebasedscheduling/README.md

+4. `ResourceAllocatableMemory`, it scores managed clusters according to allocatable Memory.
+
+According to the name it registered, the `resource` plugin uses different formulas to calculate the score of a managed cluster, the value falls in the range between -100 and 100.
+| Prioritizer | Formula |


Thanks for getting very specific here. I think these make sense to me.

deads2k · 2021-08-25T20:57:09Z

This design looks good and I think it integrates well into the prioritizer work already completed. I see how weighting against churn is a different feature.

Once this feature is created, is another possible consideration one that tries to access whether a given placement of a resource appears to be permafailing? That is, "the work was assigned to cluster/A and cluster/B, but on cluster/A it is consistently failing".

/approve
/assign @qiujian16

openshift-ci · 2021-08-25T20:57:14Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: deads2k, elgnay

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [deads2k]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

qiujian16 · 2021-08-26T00:55:04Z

Once this feature is created, is another possible consideration one that tries to access whether a given placement of a resource appears to be permafailing? That is, "the work was assigned to cluster/A and cluster/B, but on cluster/A it is consistently failing".

Yes, I think this is one of the concern raised by @mdelder also.

qiujian16 · 2021-08-26T00:55:44Z

/lgtm

openshift-ci bot requested review from deads2k and mdelder August 9, 2021 13:45

elgnay force-pushed the resource_based_scheduling branch from 64a03a9 to 24ed44c Compare August 9, 2021 13:49

openshift-ci bot assigned deads2k and qiujian16 Aug 9, 2021

elgnay force-pushed the resource_based_scheduling branch 4 times, most recently from 1770be4 to 0cceaae Compare August 10, 2021 07:31

nobody4t reviewed Aug 12, 2021

View reviewed changes

qiujian16 reviewed Aug 12, 2021

View reviewed changes

elgnay force-pushed the resource_based_scheduling branch 2 times, most recently from cf7cd97 to 011050f Compare August 17, 2021 09:54

deads2k reviewed Aug 17, 2021

View reviewed changes

elgnay force-pushed the resource_based_scheduling branch 3 times, most recently from 5241923 to cb0cf57 Compare August 23, 2021 15:49

elgnay force-pushed the resource_based_scheduling branch from cb0cf57 to 2dc8e48 Compare August 25, 2021 10:37

schedule placement based on resource usage/capacity

56436d9

Signed-off-by: Yang Le <yangle@redhat.com>

elgnay force-pushed the resource_based_scheduling branch from 2dc8e48 to 56436d9 Compare August 25, 2021 10:41

deads2k reviewed Aug 25, 2021

View reviewed changes

openshift-ci bot added the approved label Aug 25, 2021

openshift-ci bot added the lgtm label Aug 26, 2021

openshift-merge-robot merged commit 70704aa into open-cluster-management-io:main Aug 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

schedule placement based on resource usage/capacity #16

schedule placement based on resource usage/capacity #16

elgnay commented Aug 9, 2021

elgnay commented Aug 9, 2021

nobody4t left a comment

nobody4t Aug 12, 2021

elgnay Aug 16, 2021

nobody4t Aug 12, 2021

elgnay Aug 16, 2021

qiujian16 Aug 12, 2021

elgnay Aug 16, 2021

deads2k Aug 17, 2021

elgnay Aug 25, 2021

qiujian16 Aug 12, 2021

elgnay Aug 17, 2021

qiujian16 Aug 12, 2021

elgnay Aug 17, 2021

deads2k Aug 17, 2021

elgnay Aug 18, 2021

deads2k Aug 17, 2021

elgnay Aug 18, 2021

deads2k Aug 17, 2021

deads2k Aug 17, 2021

elgnay Aug 18, 2021

elgnay Aug 18, 2021

deads2k Aug 25, 2021

deads2k Aug 25, 2021

deads2k Aug 25, 2021

deads2k commented Aug 25, 2021

openshift-ci bot commented Aug 25, 2021

qiujian16 commented Aug 26, 2021

qiujian16 commented Aug 26, 2021


		### Goals

		- Sotry 1 and 2, allow user to select managed clusters based on resource usage/capability with the Placement APIs;


		Link to the feature request: https://github.com/open-cluster-management-io/community/issues/52

		### User Stories

schedule placement based on resource usage/capacity #16

schedule placement based on resource usage/capacity #16

Conversation

elgnay commented Aug 9, 2021

elgnay commented Aug 9, 2021

nobody4t left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Scale up

Scale down

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

deads2k commented Aug 25, 2021

openshift-ci bot commented Aug 25, 2021

qiujian16 commented Aug 26, 2021

qiujian16 commented Aug 26, 2021