docs: fix typos and add PR link #379

callum-tait-pbx · 2021-03-08T19:32:13Z

minor changes to docs I noticed

callum-tait-pbx · 2021-03-08T19:47:59Z

@mumoshu I think the following benefit listed in the README.md for the PercentageRunnersBusy schema is wrong:

Allows for multiple controllers to be deployed as each controller deployed is responsible for scaling their own runner pods on a per namespace basis. [#223](https://github.com/summerwind/actions-runner-controller/pull/223)

It came from the below snipped taken from the #223:

@ahmad-hamade this autoscaling scheme will scale the runners that your runner-deployment creates on your behalf, or the runners you've created Runner resources for. If you are running 3 runners all in separate EKS clusters - i am also assuming you are also running 3 separate controllers. If that is the case, each horizontal runner autoscaler that you have defined in each cluster will be responsible for scaling that clusters runners - which is a benefit over the previous implementation, where a single horizontal runner autoscaler is scaling across all runners, since it cannot distinguish pods running in separate clusters.

I don't quite get what the author is getting at. Are you able to figure it out so the benefit in the README can be updated to be correct? I'm struggling to grasp how this is a benefit unique to the PercentageRunnersBusy schema

mumoshu · 2021-03-09T01:03:29Z

@callum-tait-pbx Before #355, TotalNumberOfQueuedAndInProgressWorkflowRuns was unable to distinguish runners from different namespaces. So probably it was theoretically correct to say that PercentageRunnersBusy was more appropriate for a multiple actions-runner-controller deployment across namespaces.

However,

Allows for multiple controllers to be deployed as each controller deployed is responsible for scaling their own runner pods on a per namespace basis.

It wasn't working like that until #380.

mumoshu · 2021-03-09T01:07:01Z

I've reread the doc and I would say that this part of the doc:

Allows for multiple controllers to be deployed as each controller deployed is responsible for scaling their own runner pods on a per namespace basis. #223

looks obsolete and can be removed now.

After #355 and #380, both scaling metrics(strategies) should work er eithper namespace or per cluster basis without any problem.

callum-tait-pbx · 2021-03-11T15:32:25Z

@mumoshu that is probably worded better

mumoshu · 2021-03-11T23:46:21Z

README.md

 3. Like all scaling metrics, you can manage workflow allocation to the RunnerDeployment through the use of [Github labels](#runner-labels).

 **Drawbacks of this metric**
 1. Repositories must be named within the scaling metric, maintaining a list of repositories may not be viable in larger environments or self-serve environments.
-2. May not scale quick enough for some users needs
+2. May not scale quick enough for some users needs. This metric is pull based and so the queue depth is polled as configured by the sync period, as a result scaling performance is bound by this sync period meaning there is a lag to scaling activity.


Great explanation 💡

mumoshu · 2021-03-11T23:51:58Z

README.md


 **Drawbacks of this metric**
-1. May not scale quick enough for some users needs as we are scaling up and down based on indicative information rather than a direct count of the workflow queue depth
+1. May not scale quick enough for some users needs. This metric is pull based and so the number of busy runners are polled as configured by the sync period, as a result scaling performance is bound by this sync period meaning there is a lag to scaling activity.


Probably you could try combining two. Webhook-based scaling will quickly add (possibly more than necessary) runners on demand and PercentageRunnersBusy will periodically "correct" the number of runners depending on runner statuses.

Does the webhook scaling work inconjuction with the regular metrics? I saw you removed the need to define a metric when using webhooks, can you still define a metric and it will compliment the webhook scaling?

Does the webhook scaling work inconjuction with the regular metrics?

Yes

I saw you removed the need to define a metric when using webhooks

Yes. Actually, it works only in conjunction with regular metrics before/after I've removed the need to explicitly define metrics. It wasn't working when repository runners + TotalNumberOfQueuedAndInProgressWorkers combination so I fixed it in #381. Other combinations should have worked and will keep working.

can you still define a metric and it will compliment the webhook scaling?

Exactly! In other words, the webhook-based scaling works in combination with either TotalNumberOfQueuedAndInProgressWorkers or PercentageRunnersBusy. When you omited HRA.Spec.Metrics it defaults to TotalNumberOfQueuedAndInProgressWorkers. So you can say that it is still working in conjunction with the regular metrics.

I've changed my mind and will soon change the default Metrics[] when ScaleUpTriggers isn't empty, so that ScaleUpTrigger can be actually used standalone. But the general idea described above should still stand.

mumoshu

LGTM. Thank you so much for your continuous effort @callum-tait-pbx ☺️

… enabled Relates to #379 (comment) Relates to #377 (comment) When you defined HRA.Spec.ScaleUpTriggers[] but HRA.Spec.Metrics[], the HRA controller will now enable ScaleUpTriggers alone and insteaed of automatically enabling TotalNumberOfQueuedAndInProgressWorkflowRuns. This allows you to use ScaleUpTriggers alone, so that the autoscaling is done without calling GitHub API at all, which should grealy decrease the change of GitHub API calls get rate-limited.

… enabled (#391) Relates to #379 (comment) Relates to #377 (comment) When you defined HRA.Spec.ScaleUpTriggers[] but HRA.Spec.Metrics[], the HRA controller will now enable ScaleUpTriggers alone and insteaed of automatically enabling TotalNumberOfQueuedAndInProgressWorkflowRuns. This allows you to use ScaleUpTriggers alone, so that the autoscaling is done without calling GitHub API at all, which should grealy decrease the change of GitHub API calls get rate-limited.

docs: fix typos and add PR link

ebd7fa1

callum-tait-pbx added 4 commits March 10, 2021 21:48

docs: changes based on feedback

853dcee

docs: fixing numbers in list

b0056f5

docs: grammer

a490e37

docs: better wording

99c4f0d

mumoshu reviewed Mar 11, 2021

View reviewed changes

mumoshu approved these changes Mar 11, 2021

View reviewed changes

mumoshu merged commit a6270b4 into actions:master Mar 11, 2021

mumoshu mentioned this pull request Mar 14, 2021

Disable metrics-based autoscaling by default when scaleUpTriggers are enabled #391

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: fix typos and add PR link #379

docs: fix typos and add PR link #379

callum-tait-pbx commented Mar 8, 2021

callum-tait-pbx commented Mar 8, 2021 •

edited

mumoshu commented Mar 9, 2021

mumoshu commented Mar 9, 2021

callum-tait-pbx commented Mar 11, 2021

mumoshu Mar 11, 2021

mumoshu Mar 11, 2021

callum-tait-pbx Mar 11, 2021

mumoshu Mar 12, 2021 •

edited

mumoshu Mar 12, 2021

mumoshu left a comment

docs: fix typos and add PR link #379

docs: fix typos and add PR link #379

Conversation

callum-tait-pbx commented Mar 8, 2021

callum-tait-pbx commented Mar 8, 2021 • edited

mumoshu commented Mar 9, 2021

mumoshu commented Mar 9, 2021

callum-tait-pbx commented Mar 11, 2021

mumoshu Mar 11, 2021

Choose a reason for hiding this comment

mumoshu Mar 11, 2021

Choose a reason for hiding this comment

callum-tait-pbx Mar 11, 2021

Choose a reason for hiding this comment

mumoshu Mar 12, 2021 • edited

Choose a reason for hiding this comment

mumoshu Mar 12, 2021

Choose a reason for hiding this comment

mumoshu left a comment

Choose a reason for hiding this comment

callum-tait-pbx commented Mar 8, 2021 •

edited

mumoshu Mar 12, 2021 •

edited