Increase the default $BMO_CONCURRENCY for scale #906

zaneb · 2021-06-01T16:48:22Z

Instead of defaulting the $BMO_CONCURRENCY value (the maximum number of
Hosts to reconcile concurrently) to a hard-coded value of 3, set it
instead to the number of CPU threads available, but constrained to a
range between 2 and 8.

We never default to 1 so that we don't inadvertantly rely on
single-threadedness in tests. 8 seems to be a reasonable value for a
large scale deployment, while still being substantially below the
default $PROVISIONING_LIMIT of 20.

There remains no restriction on the value that can be passed in the
environment variable.

Fixes #905

zaneb · 2021-06-01T16:56:08Z

/test-integration

flaper87

This looks good! Thanks for the PR

flaper87

/lgtm

metal3-io-bot · 2021-06-02T06:19:11Z

@flaper87: adding LGTM is restricted to approvers and reviewers in OWNERS files.

In response to this:

/lgtm

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

dtantsur · 2021-06-02T09:48:49Z

/approve

metal3-io-bot · 2021-06-02T09:51:30Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: dtantsur

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [dtantsur]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

andfasano · 2021-06-03T13:04:42Z

controllers/metal3.io/baremetalhost_controller.go

@@ -1312,7 +1313,13 @@ func (r *BareMetalHostReconciler) updateEventHandler(e event.UpdateEvent) bool {
 // SetupWithManager reigsters the reconciler to be run by the manager
 func (r *BareMetalHostReconciler) SetupWithManager(mgr ctrl.Manager) error {

-	maxConcurrentReconciles := 3
+	maxConcurrentReconciles := runtime.NumCPU()
+	if maxConcurrentReconciles > 8 {


Instead of using an hard-coded value, I was thinking that it could be useful to set as the upper limit something like min(PROVISIONIG_LIMIT/2.5, 1), just to enforce the ratio between the two values in case the user will set a particularly low value for PROVISIONING_LIMIT.

Is there a plan to keep this configurable as an option (via env)?

Is there a plan to keep this configurable as an option (via env)?

It is still configurable.

Instead of using an hard-coded value, I was thinking that it could be useful to set as the upper limit something like min(PROVISIONIG_LIMIT/2.5, 1), just to enforce the ratio between the two values in case the user will set a particularly low value for PROVISIONING_LIMIT.

I can't imagine anyone setting a smaller limit except on very small hardware, in which case the CPU count should scale the default appropriately for us. I can imagine people setting a bigger limit, but I don't think we want to increase the default concurrency in that case. I don't think we should tie these two settings together because ultimately they limit different things.

It's not about tying them together, it's just about the currently proposed value for the default upper limit of maxConcurrentReconciles - and I think there was an error in my previous comment, as I meant something like:

maxConcurrentReconciles = min(PROVISIONING_LIMIT/2.5, runtime.NumCPU())

which for a default PROVISIONING_LIMIT value of 20 will limit maxConcurrentReconciles to 8 on a system with more than 8 cores.

Given that the current proposal is to define the default value based on the number of available CPU, the basic idea is to try using all the available cpus as much possible - even though the benefits could be marginal (in general I'd expect that BMO will run within a container, which could have already resources and/or limits set), and at the same time trying to honor the fact that maxConcurrentReconciles << PROVISIONING_LIMIT

furkatgofurov7

This looks good, the only thing is we have to update the documentation whether we hard-code or set the upper limit to $BMO_CONCURRENCY accordingly.

Instead of defaulting the $BMO_CONCURRENCY value (the maximum number of Hosts to reconcile concurrently) to a hard-coded value of 3, set it instead to the number of CPU threads available, but constrained to a range between 2 and 8. We never default to 1 so that we don't inadvertantly rely on single-threadedness in tests. 8 seems to be a reasonable value for a large scale deployment, while still being substantially below the default $PROVISIONING_LIMIT of 20. There remains no restriction on the value that can be passed in the environment variable. Fixes metal3-io#905

We never call the provisioner in this state, so speed it up by never checking the current registration status.

zaneb · 2021-06-07T13:42:57Z

/test-integration

andfasano · 2021-06-08T10:06:32Z

The current changes could be a good improvement in respect to the previous situation, and looks like there could be some room for improvements in the future

/lgtm

zaneb requested a review from andfasano June 1, 2021 16:48

metal3-io-bot added the size/S Denotes a PR that changes 10-29 lines, ignoring generated files. label Jun 1, 2021

flaper87 reviewed Jun 2, 2021

View reviewed changes

metal3-io-bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jun 2, 2021

andfasano reviewed Jun 3, 2021

View reviewed changes

furkatgofurov7 reviewed Jun 5, 2021

View reviewed changes

zaneb added 2 commits June 7, 2021 09:40

Don't check registration status in MatchProfile

8daa459

We never call the provisioner in this state, so speed it up by never checking the current registration status.

zaneb force-pushed the scale-concurrency branch from d4f0b82 to 8daa459 Compare June 7, 2021 13:41

metal3-io-bot added the lgtm Indicates that a PR is ready to be merged. label Jun 8, 2021

metal3-io-bot merged commit f5228fa into metal3-io:master Jun 8, 2021

zaneb mentioned this pull request Mar 16, 2023

✨ Enable concurrency in BMO controllers through cmdline flags #1235

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Increase the default $BMO_CONCURRENCY for scale #906

Increase the default $BMO_CONCURRENCY for scale #906

zaneb commented Jun 1, 2021

zaneb commented Jun 1, 2021

flaper87 left a comment

flaper87 left a comment

metal3-io-bot commented Jun 2, 2021

dtantsur commented Jun 2, 2021

metal3-io-bot commented Jun 2, 2021

andfasano Jun 3, 2021

s3rj1k Jun 3, 2021

zaneb Jun 3, 2021

zaneb Jun 3, 2021

andfasano Jun 4, 2021

furkatgofurov7 left a comment

zaneb commented Jun 7, 2021

andfasano commented Jun 8, 2021

Increase the default $BMO_CONCURRENCY for scale #906

Increase the default $BMO_CONCURRENCY for scale #906

Conversation

zaneb commented Jun 1, 2021

zaneb commented Jun 1, 2021

flaper87 left a comment

Choose a reason for hiding this comment

flaper87 left a comment

Choose a reason for hiding this comment

metal3-io-bot commented Jun 2, 2021

dtantsur commented Jun 2, 2021

metal3-io-bot commented Jun 2, 2021

andfasano Jun 3, 2021

Choose a reason for hiding this comment

s3rj1k Jun 3, 2021

Choose a reason for hiding this comment

zaneb Jun 3, 2021

Choose a reason for hiding this comment

zaneb Jun 3, 2021

Choose a reason for hiding this comment

andfasano Jun 4, 2021

Choose a reason for hiding this comment

furkatgofurov7 left a comment

Choose a reason for hiding this comment

zaneb commented Jun 7, 2021

andfasano commented Jun 8, 2021