Scheduling Improvements #747

bcwaldon · 2014-08-05T14:42:47Z

UPDATE 9/24:

remove note about unfair scheduling as offering/bidding mechanism is gone
remove note about supporting memory-based scheduling

There are two major aspects of scheduling for fleet to focus on: resource scheduling and dependency scheduling.

As far as resource scheduling goes, fleet is not going to have a full-featured scheduler. We have no plans to support any resource-related parameters past the leveling of the number of units scheduled to a particular machine.

Dependency scheduling, however, is incredibly important to get right. The following are the currently-supported parameters:

MachineID bypasses the scheduler altogether and places a unit directly on a machine
-MachineMetadata filters the list of possible machines to which a unit can be scheduled using key-value metadata
MachineOf provides affinity, scheduling a unit to the same machine as another unit
Conflicts provides anti-affinity, scheduling a unit to a different machine than any units that match a glob pattern

There are several ideas for new dependency-scheduling behaviors, which are enumerated below:

MachineOf should support multiple arguments (Fleet scheduling broken for inter-unit dependencies #727)
MachineOf should support wildcards (MachineOf should support wildcards #494)
schedule a machine to a specific host, but never reschedule it (feature request: fleet schedules a unit to a host, but never moves it #667)
Requires/Wants Before/After similar to official systemd Requires/Wants/Before/After, but behave at the cluster-level (Unit does not automatically start Required units #464)

stuart-warren · 2014-08-06T10:00:21Z

As far as resource scheduling goes, fleet is not going to have a full-featured scheduler.

Does CoreOS intend to support other schedulers?

Past the leveling of the number of units across the cluster, fleet will only take into account memory limits.

So it will put the same number units onto each server? What if I have a few different specs of servers, some massively more powerful than others? Can I set some bias in the fleet config perhaps?

bcwaldon · 2014-08-06T18:58:11Z

@stuart-warren We definitely intend to provide a full solution here, we're just not going to make fleet solve everyone's scheduling problems.

The fleet scheduler supports metadata-based filtering, and the memory scheduling will be relative to the available memory of each machine independently.

dbason · 2014-09-22T21:37:04Z

@bcwaldon so what would be considered a full solution? Will we have the ability to weight units (if they require relatively more cpu than other units), or will this be something we need to implement outside of Fleet?

bcwaldon · 2014-09-25T00:44:32Z

At this time, we have no plans to support any resource-related parameters. If this is something you care about, you should explore something like kubernetes or mesos.

gust1n · 2014-10-03T09:58:10Z

I totally get your point about keeping the scheduler simple and instead let others bud more high level tools to solve that. But what about some simple spreading of resources? We're using templates to support simple heroku-like scaling of processes. And we would rather not use the conflict fleet param to spread the jobs since we then have to set limits. But very often if we scale a job to, say 3, they all end up on the same host. And what is worse is that often all jobs of all services end up on the same host. This gives us a scenario where 1 host is under heavy load and the 2 others are not used.

Are there any plans for simple spreading of jobs across a cluster?

bcwaldon · 2014-10-03T16:19:41Z

@gust1n The current scheduler distributes units based on the current number of units scheduled to each host. Are you not not experiencing this?

We've also started a discussion around how fleet can support external schedulers over here: #922. If you have any input, I'd appreciate it greatly.

gust1n · 2014-10-09T23:19:23Z

@bcwaldon Unfortunately (on stable) most of the time almost all jobs (except those with X-Fleet logic) ends up on the same machine. On a machine restart they all migrate to the next one. Don't know if what you're describing is not in the stable channel yet? What you described was what my request was all about, something simple that spreads the jobs. I solved it for now by using some X-Fleet conditions anyways.

bcwaldon · 2014-10-09T23:23:16Z

@gust1n yes, by "current scheduler" I mean fleet v0.7.0+. The stable channel will be updated soon.

jonboulle · 2014-12-05T23:04:22Z

Cross-post: see #922 (comment)

bcwaldon added the discussion label Aug 5, 2014

bacongobbler mentioned this issue Aug 6, 2014

Memory/CPU limits for application containers deis/deis#1513

Merged

sstarcher mentioned this issue Sep 29, 2014

Fleet scheduler to not get memory/cpu scheduling deis/deis#1960

Closed

bcwaldon mentioned this issue Sep 30, 2014

Resource Management Phase I #601

Closed

7 tasks

PierreKircher mentioned this issue Oct 2, 2014

Efficient resource utilisation, re-balancing mandate for Engine #922

Closed

epipho mentioned this issue Oct 4, 2014

Simple share based scheduling #945

Open

bcwaldon added the component/engine label Oct 9, 2014

jonboulle mentioned this issue Dec 5, 2014

RFC: Requirements for an extensible scheduling system #1055

Open

aledbf mentioned this issue Dec 17, 2014

Deis does not always spread out applications across machines deis/deis#2763

Closed

aledbf mentioned this issue Jun 30, 2015

Deis scale isnt respecting the cluster. deis/deis#3940

Closed

jonboulle added kind/design and removed kind/question labels Jan 25, 2016

jonboulle added this to the vfuture milestone Jan 25, 2016

hectorj2f mentioned this issue Apr 3, 2016

Limit number of units per node #1530

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scheduling Improvements #747

Scheduling Improvements #747

bcwaldon commented Aug 5, 2014

stuart-warren commented Aug 6, 2014

bcwaldon commented Aug 6, 2014

dbason commented Sep 22, 2014

bcwaldon commented Sep 25, 2014

gust1n commented Oct 3, 2014

bcwaldon commented Oct 3, 2014

gust1n commented Oct 9, 2014

bcwaldon commented Oct 9, 2014

jonboulle commented Dec 5, 2014

Scheduling Improvements #747

Scheduling Improvements #747

Comments

bcwaldon commented Aug 5, 2014

stuart-warren commented Aug 6, 2014

bcwaldon commented Aug 6, 2014

dbason commented Sep 22, 2014

bcwaldon commented Sep 25, 2014

gust1n commented Oct 3, 2014

bcwaldon commented Oct 3, 2014

gust1n commented Oct 9, 2014

bcwaldon commented Oct 9, 2014

jonboulle commented Dec 5, 2014