Kubernetes - Run the scheduler with correct KUBE_MAX_PD_VOLS Env Variable #186

khenidak · 2017-01-16T18:24:00Z

The scheduler defaults/falls back to 16 as maximum allowed PD per agent.
Check: https://github.com/kubernetes/kubernetes/blob/master/plugin/pkg/scheduler/algorithmprovider/defaults/defaults.go#L39
and
https://github.com/kubernetes/kubernetes/blob/master/plugin/pkg/scheduler/algorithmprovider/defaults/defaults.go#L208

This means clusters running with bigger VMs (e.g. Standard_DS14_v2 accepts 32 data disks) will fail to schedule more than 16 pods that have PDs.

Note:
The scheduler currently makes this filtering decision across all agents, and is not designed for non-uniform/mixed node/agent types. We can either go - during cluster provisioning - with MIN(allowed data disk per Agent) resulting in capacity loss or MAX(allowed data disk per agent) resulting in random errors. We will have to accept one solution until k8s takes into consideration node type (from Cloud Provider) during scheduling decisions.

anhowe · 2017-01-24T01:27:48Z

Thanks @khenidak for this bug report. I have set this as P1, and we will watch to see if other customers hit this.

anhowe · 2017-05-01T20:52:41Z

@khenidak is there a flag to set this. Each VM size can support a different number of disks?

khenidak · 2017-05-03T17:18:52Z

No - I have it on my list to modify the scheduler to accommodate placement according to VM size and current-count-of-attached disks.

Also my understanding is open-shift reattaches 48 disks upon startup (i could be wrong). @jimzim can add more info.

stale · 2019-03-09T20:23:03Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contribution. Note that acs-engine is deprecated--see https://github.com/Azure/aks-engine instead.

anhowe added kind/bug orchestrator/k8s priority/P1 labels Jan 24, 2017

anhowe added kind/feature and removed kind/bug labels May 1, 2017

karataliu mentioned this issue Oct 12, 2017

Enable dynamic configue max number of PDs allowed on a node based on machine type kubernetes/kubernetes#53461

Closed

stale bot added the stale label Mar 9, 2019

stale bot closed this as completed Mar 16, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kubernetes - Run the scheduler with correct KUBE_MAX_PD_VOLS Env Variable #186

Kubernetes - Run the scheduler with correct KUBE_MAX_PD_VOLS Env Variable #186

khenidak commented Jan 16, 2017

anhowe commented Jan 24, 2017

anhowe commented May 1, 2017

khenidak commented May 3, 2017 •

edited

Loading

stale bot commented Mar 9, 2019

Kubernetes - Run the scheduler with correct KUBE_MAX_PD_VOLS Env Variable #186

Kubernetes - Run the scheduler with correct KUBE_MAX_PD_VOLS Env Variable #186

Comments

khenidak commented Jan 16, 2017

anhowe commented Jan 24, 2017

anhowe commented May 1, 2017

khenidak commented May 3, 2017 • edited Loading

stale bot commented Mar 9, 2019

khenidak commented May 3, 2017 •

edited

Loading