Bug 1861642: Add maxNodeProvisionTime for baremetal #158

hardys · 2020-07-31T13:14:39Z

In baremetal environments it can easily take longer than the CA
default of 15mins for the node to become active after a scale-out
action.

The CA supports --max-node-provision-time[1] so adding support to
enable configuration of that value should allow tuning of the
time such that it's more suited to baremetal.

[1] https://github.com/kubernetes/autoscaler/blob/master/cluster-autoscaler/FAQ.md#what-are-the-parameters-to-ca

openshift-ci-robot · 2020-07-31T13:14:46Z

@hardys: This pull request references Bugzilla bug 1861642, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker.

3 validation(s) were run on this bug

bug is open, matching expected state (open)
bug target release (4.6.0) matches configured target release for branch (4.6.0)
bug is in the state ASSIGNED, which is one of the valid states (NEW, ASSIGNED, ON_DEV, POST, POST)

In response to this:

Bug 1861642: Add maxNodeProvisionTime for baremetal

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

hardys · 2020-07-31T13:16:56Z

I've tested this locally to confirm no regressions, but I can't see a way to get the new CRD definition - I guess the CVO has an old copy, any tips on how to work around that for local testing?

elmiko

this looks mostly reasonable to me, i have a couple questions.

sadly, i do not have advice on how to solve crd chicken/egg issue =(

install/01_clusterautoscaler.crd.yaml

pkg/controller/clusterautoscaler/clusterautoscaler_test.go

elmiko · 2020-07-31T14:43:11Z

@enxebre @JoelSpeed ptal

elmiko · 2020-07-31T15:17:39Z

/lgtm

pkg/controller/clusterautoscaler/clusterautoscaler.go

install/01_clusterautoscaler.crd.yaml

install/02_machineautoscaler.crd.yaml

hardys · 2020-08-03T15:36:39Z

/test e2e-aws-operator

install/01_clusterautoscaler.crd.yaml

JoelSpeed · 2020-08-04T15:46:28Z

I'm happy to approve this once the format is updated, apologies, literally found out there's a small bug in it earlier today 😅

In baremetal environments it can easily take longer than the CA default of 15mins for the node to become active after a scale-out action. The CA supports --max-node-provision-time[1] so adding support to enable configuration of that value should allow tuning of the time such that it's more suited to baremetal. Accepted review suggestion to fix bug in regex Co-authored-by: Joel Speed <Joel.speed@hotmail.co.uk> [1] https://github.com/kubernetes/autoscaler/blob/master/cluster-autoscaler/FAQ.md#what-are-the-parameters-to-ca

hardys · 2020-08-05T12:35:04Z

/test e2e-aws-operator

pkg/apis/autoscaling/v1/clusterautoscaler_types.go

Add review suggestion Co-authored-by: Joel Speed <Joel.speed@hotmail.co.uk>

JoelSpeed · 2020-08-05T14:13:35Z

/approve

openshift-ci-robot · 2020-08-05T14:13:53Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: JoelSpeed

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [JoelSpeed]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

hardys · 2020-08-05T15:07:05Z

/retest

hardys · 2020-08-06T07:23:35Z

/retest

hardys · 2020-08-06T15:30:49Z

/test e2e-aws

hardys · 2020-08-07T12:04:24Z

/test e2e-aws

hardys · 2020-08-12T12:53:16Z

@elmiko - please could you revisit this when you get a moment - thanks! :)

elmiko

thanks @hardys !
/lgtm

openshift-bot · 2020-08-12T16:48:29Z

/retest

Please review the full test history for this PR and help us cut down flakes.

openshift-bot · 2020-08-12T17:53:26Z

/retest

Please review the full test history for this PR and help us cut down flakes.

openshift-bot · 2020-08-12T18:06:24Z

/retest

Please review the full test history for this PR and help us cut down flakes.

openshift-bot · 2020-08-12T18:19:25Z

/retest

Please review the full test history for this PR and help us cut down flakes.

openshift-ci-robot · 2020-08-12T19:53:41Z

@hardys: All pull requests linked via external trackers have merged: openshift/cluster-autoscaler-operator#158. Bugzilla bug 1861642 has been moved to the MODIFIED state.

In response to this:

Bug 1861642: Add maxNodeProvisionTime for baremetal

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

openshift-ci-robot added bugzilla/severity-medium Referenced Bugzilla bug's severity is medium for the branch this PR is targeting. bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. labels Jul 31, 2020

openshift-ci-robot requested review from elmiko and enxebre July 31, 2020 13:14

elmiko reviewed Jul 31, 2020

View reviewed changes

install/01_clusterautoscaler.crd.yaml Outdated Show resolved Hide resolved

pkg/controller/clusterautoscaler/clusterautoscaler_test.go Show resolved Hide resolved

hardys force-pushed the bz/1861642 branch from 6ad336f to f8a7872 Compare July 31, 2020 14:39

openshift-ci-robot assigned elmiko Jul 31, 2020

openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Jul 31, 2020

JoelSpeed reviewed Jul 31, 2020

View reviewed changes

hardys force-pushed the bz/1861642 branch from f8a7872 to 050fe40 Compare August 3, 2020 08:18

openshift-ci-robot removed the lgtm Indicates that a PR is ready to be merged. label Aug 3, 2020

hardys force-pushed the bz/1861642 branch from 050fe40 to 089ada1 Compare August 4, 2020 15:00

JoelSpeed reviewed Aug 4, 2020

View reviewed changes

install/01_clusterautoscaler.crd.yaml Outdated Show resolved Hide resolved

hardys force-pushed the bz/1861642 branch from 18d899c to 6cde7f9 Compare August 5, 2020 11:10

JoelSpeed reviewed Aug 5, 2020

View reviewed changes

pkg/apis/autoscaling/v1/clusterautoscaler_types.go Outdated Show resolved Hide resolved

Update pkg/apis/autoscaling/v1/clusterautoscaler_types.go

283a1c7

Add review suggestion Co-authored-by: Joel Speed <Joel.speed@hotmail.co.uk>

openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Aug 5, 2020

elmiko approved these changes Aug 12, 2020

View reviewed changes

openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Aug 12, 2020

openshift-merge-robot merged commit 472ecfc into openshift:master Aug 12, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug 1861642: Add maxNodeProvisionTime for baremetal #158

Bug 1861642: Add maxNodeProvisionTime for baremetal #158

hardys commented Jul 31, 2020

openshift-ci-robot commented Jul 31, 2020

hardys commented Jul 31, 2020

elmiko left a comment

elmiko commented Jul 31, 2020

elmiko commented Jul 31, 2020

hardys commented Aug 3, 2020

JoelSpeed commented Aug 4, 2020

hardys commented Aug 5, 2020

JoelSpeed commented Aug 5, 2020

openshift-ci-robot commented Aug 5, 2020

hardys commented Aug 5, 2020

hardys commented Aug 6, 2020

hardys commented Aug 6, 2020

hardys commented Aug 7, 2020

hardys commented Aug 12, 2020

elmiko left a comment

openshift-bot commented Aug 12, 2020

openshift-bot commented Aug 12, 2020

openshift-bot commented Aug 12, 2020

openshift-bot commented Aug 12, 2020

openshift-ci-robot commented Aug 12, 2020

Bug 1861642: Add maxNodeProvisionTime for baremetal #158

Bug 1861642: Add maxNodeProvisionTime for baremetal #158

Conversation

hardys commented Jul 31, 2020

openshift-ci-robot commented Jul 31, 2020

hardys commented Jul 31, 2020

elmiko left a comment

Choose a reason for hiding this comment

elmiko commented Jul 31, 2020

elmiko commented Jul 31, 2020

hardys commented Aug 3, 2020

JoelSpeed commented Aug 4, 2020

hardys commented Aug 5, 2020

JoelSpeed commented Aug 5, 2020

openshift-ci-robot commented Aug 5, 2020

hardys commented Aug 5, 2020

hardys commented Aug 6, 2020

hardys commented Aug 6, 2020

hardys commented Aug 7, 2020

hardys commented Aug 12, 2020

elmiko left a comment

Choose a reason for hiding this comment

openshift-bot commented Aug 12, 2020

openshift-bot commented Aug 12, 2020

openshift-bot commented Aug 12, 2020

openshift-bot commented Aug 12, 2020

openshift-ci-robot commented Aug 12, 2020