Queue Limits By Priority #1697

d80tb7 · 2022-10-25T12:39:51Z

Implements queue limits by priority. This enables lower (preemptible) pirotiy classes to be given higher queue limits in order to incentivize preemption.

Preemption configuration is now of the form:

PreemptionConfig:
PriorityClasses:
  armada-default:
    priority: 30000
    maximalResourceFractionPerQueue:
      cpu: 20
      memory: 20  
  armada-preemptible:
    priority: 20000
    maximalResourceFractionPerQueue:
      cpu: 90
      memory: 90

As part of this PR I removed the preemption logic from the old scheduler.

…limit-by-priority

d80tb7 · 2022-10-28T07:41:55Z

internal/armada/configuration/types.go

@@ -248,7 +252,7 @@ type JetstreamConfig struct {

 type QueueManagementConfig struct {
 	AutoCreateQueues       bool
-	DefaultPriorityFactor  queue.PriorityFactor


I changed this because it was causing a circular reference. Essentially other armada module imported by the configuration package is problematic because configuration is imported everywhere. In this case all we were using this for was a type alias so it shouldn't really matter.

d80tb7 · 2022-10-28T07:42:17Z

internal/armada/scheduling/lease.go

@@ -6,8 +6,6 @@ import (
 	"math/rand"
 	"time"

-	v1 "k8s.io/api/core/v1"


this is the old scheduler where I have removed the preemption logic

d80tb7 · 2022-10-28T07:42:57Z

internal/armada/scheduling/node_matching.go

@@ -108,13 +108,12 @@ func matchAnyNodeTypeAllocation(
 	job *api.Job,
 	nodeAllocations []*nodeTypeAllocation,
 	alreadyConsumed nodeTypeUsedResources,
-	supportedPriorityClasses map[string]int32,
 ) (nodeTypeUsedResources, bool, error) {
 	newlyConsumed := nodeTypeUsedResources{}

 	for _, podSpec := range job.GetAllPodSpecs() {


This is the old scheduler where I have removed preemption;

d80tb7 · 2022-10-28T07:44:58Z

internal/armada/server.go

-// TODO Is this all validation that needs to be done?
-func validateArmadaConfig(config *configuration.ArmadaConfig) error {
+// TODO: Is this all validation that needs to be done?
+func validateCancelJobsBatchSizeConfig(config *configuration.ArmadaConfig) error {


I've renamed this function because it claimed to validate all the armada config but all it actually did was check that the cancelJobsBatchSize was greater than 0! Clealry we need to have a better strategy for validating our configuration but that's a job for another day.

internal/scheduler/legacyscheduler.go

Co-authored-by: JamesMurkin <jamesmurkin@hotmail.com>

d80tb7 added 10 commits October 21, 2022 15:54

remove premption from old scheduler

72a2bc5

Merge branch 'master' of github.com:G-Research/armada into f/chrisma/…

1b6736a

…limit-by-priority

wip

c9c331d

moved to map[string]PriorityClass

09fd3c9

everything compiles

c992802

wip

66b1e6c

Merge branch 'master' of github.com:G-Research/armada into f/chrisma/…

10f957d

…limit-by-priority

tests

f6d527c

Merge branch 'master' of github.com:G-Research/armada into f/chrisma/…

e07f747

…limit-by-priority

linting

1f59e56

d80tb7 changed the title ~~[WIP] Queue Limits By Priority~~ Queue Limits By Priority Oct 28, 2022

d80tb7 commented Oct 28, 2022

View reviewed changes

spelling

275665c

JamesMurkin reviewed Oct 28, 2022

View reviewed changes

internal/scheduler/legacyscheduler.go Outdated Show resolved Hide resolved

d80tb7 and others added 3 commits October 28, 2022 12:22

fix integration tests

2a19575

Merge branch 'master' into f/chrisma/limit-by-priority

fc38827

Update internal/scheduler/legacyscheduler.go

df8e37a

Co-authored-by: JamesMurkin <jamesmurkin@hotmail.com>

d80tb7 enabled auto-merge (squash) October 28, 2022 12:04

JamesMurkin approved these changes Oct 28, 2022

View reviewed changes

d80tb7 merged commit 0b2ca8e into master Oct 28, 2022

owenthomas17 deleted the f/chrisma/limit-by-priority branch August 25, 2023 14:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Queue Limits By Priority #1697

Queue Limits By Priority #1697

d80tb7 commented Oct 25, 2022 •

edited

d80tb7 Oct 28, 2022

d80tb7 Oct 28, 2022

d80tb7 Oct 28, 2022

d80tb7 Oct 28, 2022

Queue Limits By Priority #1697

Queue Limits By Priority #1697

Conversation

d80tb7 commented Oct 25, 2022 • edited

d80tb7 Oct 28, 2022

Choose a reason for hiding this comment

d80tb7 Oct 28, 2022

Choose a reason for hiding this comment

d80tb7 Oct 28, 2022

Choose a reason for hiding this comment

d80tb7 Oct 28, 2022

Choose a reason for hiding this comment

d80tb7 commented Oct 25, 2022 •

edited