Discussion about the behavior of replica scheduling weight preference #730

Garrybest · 2021-09-15T02:49:35Z

What would you like to be added:
Hi fellows.

Now karmada has provided two kinds of replica division preference when the replica scheduling type is Divided. If the preference is Aggregated, scheduler will divide replicas aggregatedly according to member clusters idle resource. However, when the preference is set as Weighted, scheduler does not care about the dynamic clusters idle resource, and the weight preference here is just a static weight.

I'm thinking about taking dynamaic weight behavior into consideration when the preference is set as Weighted. If the field WeightPreference is nil, scheduler could divide the replicas by member cluster idle resource or member cluster maximum available replicas according to #580.

More suggestions about the behavior of replica scheduling weight preference are welcomed. How do you think?

Why is this needed:
Dynamaic weight behavior by member cluster maximum available replicas helps balance the cluster load. Imagine that we have a deployment with replicas 12 that needs to be propogated into 3 clusters:

Cluster A max available replica: 6
Cluster B max available replica: 12
Cluster C max available replica: 18

So we could divide the replica by 6:12:18, as 2 of cluster A, 4 of cluster B and 6 of cluster C. It is obvious that the division type has a benefit with cluster load balance.

The text was updated successfully, but these errors were encountered:

gf457832386 · 2021-09-23T08:52:42Z

Maybe more dimensions may need to be considered. Is it necessary to anticipate future tasks and make a more rational distribution?
If anyone has a real usage problem, please publicize it and talk about it together！

Garrybest · 2021-09-23T09:04:33Z

Hi Fei, glad to see you again. @gf457832386

Anticipate future tasks? Sounds interesting. I don't know how to do that. Could you please give more demonstration?

gf457832386 · 2021-09-23T10:59:53Z

Our current allocation is a greedy strategy to achieve the current optimal. However, whether some situations can be predicted based on historical resource usage data to make the overall resource utilization more reasonable. For example，now we can devide a task according to the current resource, but maybe considering next tasks in advance can improve resource utilization and save costs.

gf457832386 · 2021-09-23T11:07:18Z

Another question is that if Karmada works need to wait for the resources because of too many works? If the resources are sufficient at any time, the impact of this problem will be much smaller.

Garrybest · 2021-09-23T11:26:53Z

Our current allocation is a greedy strategy to achieve the current optimal. However, whether some situations can be predicted based on historical resource usage data to make the overall resource utilization more reasonable. For example，now we can devide a task according to the current resource, but maybe considering next tasks in advance can improve resource utilization and save costs.

Good thinking. But I'm afraid the prediction could not be reliable. Kubernetes also does not rely on any replica prediction when scheduling. Descheduler would be better since scheduler only cares about the current optimal scheduling result and descheduler make some adjustment in a interval.

Garrybest · 2021-09-23T11:31:10Z

Another question is that if Karmada works need to wait for the resources because of too many works? If the resources are sufficient at any time, the impact of this problem will be much smaller.

Sorry I don't get you. Do you mean that workloads waiting for assigning replicas when a cluster is insufficient with resource? I think so. Now, this behavior is like kube-scheduler. It will do nothing but record a failing condition when lack of resource.

Garrybest · 2021-09-30T03:32:24Z

It seems blocking, any more ideas? @RainbowMango @gf457832386

RainbowMango · 2021-09-30T03:46:23Z

I'll be back to this discussion soon. :)

gf457832386 · 2021-09-30T04:59:35Z

I think we have to think about the proportion of surplus resources. and next we can calculate the resource balance score according to the formula of resource balance and select the best way to allocate resources that meet the conditions of resource balance.

RainbowMango · 2021-10-27T08:38:15Z

Dynamaic weight behavior by member cluster maximum available replicas helps balance the cluster load.

I guess this is the use case and value we want to provide. That makes sense to me.

Not an objection, I wonder to know why you didn't extend the API in ClusterPreferences? (Maybe I missed the info from the meeting)

Garrybest · 2021-10-28T02:14:47Z

Well, I found the behavior is opposite to Aggregated although it is a kind of dynamic weight in some ways. I don't wanna make the weight confusing and complicating. So a Dispersive division preference is added instead.

Garrybest added the kind/feature Categorizes issue or PR as related to a new feature. label Sep 15, 2021

Garrybest mentioned this issue Oct 20, 2021

add dynamic weight by available replicas #841

Merged

karmada-bot closed this as completed in #841 Oct 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Discussion about the behavior of replica scheduling weight preference #730

Discussion about the behavior of replica scheduling weight preference #730

Garrybest commented Sep 15, 2021 •

edited

gf457832386 commented Sep 23, 2021

Garrybest commented Sep 23, 2021

gf457832386 commented Sep 23, 2021 •

edited

gf457832386 commented Sep 23, 2021

Garrybest commented Sep 23, 2021

Garrybest commented Sep 23, 2021

Garrybest commented Sep 30, 2021

RainbowMango commented Sep 30, 2021

gf457832386 commented Sep 30, 2021

RainbowMango commented Oct 27, 2021

Garrybest commented Oct 28, 2021

Discussion about the behavior of replica scheduling weight preference #730

Discussion about the behavior of replica scheduling weight preference #730

Comments

Garrybest commented Sep 15, 2021 • edited

gf457832386 commented Sep 23, 2021

Garrybest commented Sep 23, 2021

gf457832386 commented Sep 23, 2021 • edited

gf457832386 commented Sep 23, 2021

Garrybest commented Sep 23, 2021

Garrybest commented Sep 23, 2021

Garrybest commented Sep 30, 2021

RainbowMango commented Sep 30, 2021

gf457832386 commented Sep 30, 2021

RainbowMango commented Oct 27, 2021

Garrybest commented Oct 28, 2021

Garrybest commented Sep 15, 2021 •

edited

gf457832386 commented Sep 23, 2021 •

edited