Improve Scaling Jobs docs and jobs scaling logic #187

hmoravec · 2020-06-05T15:13:18Z

Current documentation about jobs scaling is incomplete and the jobs scaler does not behave as one would intuitively expect.

The algorithm for spawning of new jobs is not described in detail in the docs. I would expect that:

When a job pulls a new task from the queue, it should dequeue it from the queue on the start.
The value of metric exposed by scaler should be the length of the queue (number of tasks that have not been pulled by any job yet).
After each pollingInterval KEDA spawns new jobs and their number is equal to value of the metric (length of the queue) - number of jobs that don't have running or completed status.
At the same time the number of new jobs is capped by inequality number of new jobs to spawn + number of jobs without running or completed status <= maxReplicaCount.

The 3. point was proposed also in kedacore/keda#525 (comment) to work like this but it still has not been resolved. If there are already jobs in pending status, KEDA does not take them into account and spawns all jobs in the queue again during next polling which can result in huge number of jobs if e.g. the pending jobs cannot be scheduled or are waiting for new nodes to be provisioned.

Further it is not clear, in the context of jobs scaling, what is the meaning of cooldownPeriod, minReplicaCount and threshold in Prometheus trigger (and similar parameters in other triggers).

The text was updated successfully, but these errors were encountered:

tomkerkhove · 2020-06-05T15:38:56Z

We are working on setting better expectations around Jobs in v2.0 and think this is good input for our docs.
As far as I know this works how we are aiming for it, right @zroubalik?

@hmoravec Are you up for contributing improved docs on this front?

stale · 2021-10-13T18:35:58Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in 7 days if no further activity occurs. Thank you for your contributions.

stale · 2021-10-20T18:36:51Z

This issue has been automatically closed due to inactivity.

triage-new-issues bot added the triage label Jun 5, 2020

TsuyoshiUshio mentioned this issue Jul 25, 2020

[v2] Implement the scaledjob controler for v2 kedacore/keda#945

Merged

2 tasks

stale bot added the stale All issues that are marked as stale due to inactivity label Oct 13, 2021

stale bot closed this as completed Oct 20, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Scaling Jobs docs and jobs scaling logic #187

Improve Scaling Jobs docs and jobs scaling logic #187

hmoravec commented Jun 5, 2020 •

edited

Loading

tomkerkhove commented Jun 5, 2020

stale bot commented Oct 13, 2021

stale bot commented Oct 20, 2021

Improve Scaling Jobs docs and jobs scaling logic #187

Improve Scaling Jobs docs and jobs scaling logic #187

Comments

hmoravec commented Jun 5, 2020 • edited Loading

tomkerkhove commented Jun 5, 2020

stale bot commented Oct 13, 2021

stale bot commented Oct 20, 2021

hmoravec commented Jun 5, 2020 •

edited

Loading