set max dispatch workers to same as max forks #11800

kdelee · 2022-02-23T16:07:25Z

Right now, without this, we end up with a different number for max_workers than max_forks. For example, on a control node with 16 Gi of RAM,
max_mem_capacity w/ 100 MB/fork = (16*1024)/100 --> 164
max_workers = 5 * 16 --> 80

This means we would allow that control node to control up to 164 jobs, but all jobs after the 80th job will be stuck in waiting waiting for a dispatch worker to free up to run the job.

SUMMARY

Have max_workers == max forks based on memory capacity to prevent situations where we start jobs because we believe there is enough capacity to control the job, but there are not enough dispatch workers available to actually do so. In cases where a user decides to use the "capacity adjustment" or otherwise limit how many jobs a control node can control, we

ISSUE TYPE

Bugfix Pull Request

COMPONENT NAME

API

AWX VERSION

ADDITIONAL INFORMATION

@jainnikhil30 noticed jobs hanging out in waiting for a long time when he was running many concurrent jobs, @AlanCoding helped identify how the max number of dispatch workers factors into that.

Right now, without this, we end up with a different number for max_workers than max_forks. For example, on a control node with 16 Gi of RAM, max_mem_capacity w/ 100 MB/fork = (16*1024)/100 --> 164 max_workers = 5 * 16 --> 80 This means we would allow that control node to control up to 164 jobs, but all jobs after the 80th job will be stuck in `waiting` waiting for a dispatch worker to free up to run the job.

jainnikhil30 · 2022-02-24T15:25:12Z

+1 tested the patch. It works. Before the patch with 16gb node only 80 jobs were running and rest all went into waiting. But with this patch I am able to run 110 jobs which is close to the max capacity.

AlanCoding

I'm a little bit worried about increasing the dispatcher workers in general, because this puts us closer to... well... failure. But that should be handled by our general coefficients.

github-actions bot added the component:api label Feb 23, 2022

kdelee force-pushed the max_workers_same_as_max_mem_forks branch from 3fc5327 to e1be483 Compare February 23, 2022 16:18

kdelee requested review from AlanCoding and shanemcd February 23, 2022 17:13

CFSNM approved these changes Feb 24, 2022

View reviewed changes

AlanCoding approved these changes Feb 24, 2022

View reviewed changes

kdelee merged commit 4bd6c2a into devel Feb 24, 2022

kdelee deleted the max_workers_same_as_max_mem_forks branch February 24, 2022 15:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

set max dispatch workers to same as max forks #11800

set max dispatch workers to same as max forks #11800

kdelee commented Feb 23, 2022 •

edited

Loading

jainnikhil30 commented Feb 24, 2022

AlanCoding left a comment

set max dispatch workers to same as max forks #11800

set max dispatch workers to same as max forks #11800

Conversation

kdelee commented Feb 23, 2022 • edited Loading

SUMMARY

ISSUE TYPE

COMPONENT NAME

AWX VERSION

ADDITIONAL INFORMATION

jainnikhil30 commented Feb 24, 2022

AlanCoding left a comment

Choose a reason for hiding this comment

kdelee commented Feb 23, 2022 •

edited

Loading