-
Notifications
You must be signed in to change notification settings - Fork 522
fix: mark walinux for hold in cloud-init #778
fix: mark walinux for hold in cloud-init #778
Conversation
@@ -57,10 +57,6 @@ else | |||
FULL_INSTALL_REQUIRED=true | |||
fi | |||
|
|||
if [[ $OS == $UBUNTU_OS_NAME ]]; then |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we need to get rid of the "apt-mark hold" invocation in CSE, as the hold command will wait for the release of apt locks, which may take a long time if cloud-init has already reserved them.
/azp run pr-e2e |
Azure Pipelines successfully started running 1 pipeline(s). |
Codecov Report
@@ Coverage Diff @@
## master #778 +/- ##
=======================================
Coverage 67.58% 67.58%
=======================================
Files 115 115
Lines 16856 16856
=======================================
Hits 11392 11392
Misses 4677 4677
Partials 787 787 |
parts/k8s/kubernetescustomscript.sh
Outdated
@@ -179,7 +175,7 @@ if $REBOOTREQUIRED; then | |||
echo 'reboot required, rebooting node in 1 minute' | |||
/bin/bash -c "shutdown -r 1 &" | |||
if [[ $OS == $UBUNTU_OS_NAME ]]; then | |||
holdWALinuxAgent "unhold" | |||
aptmarkWALinuxAgent unhold |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
should this be a background process to return control to the user without waiting for it to finish?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm pending lots of passing e2e tests :)
owner: root | ||
content: | | ||
Package: walinuxagent | ||
Pin: version 2.2.32.2 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
is that the version that is currently in VHD? or the version we are currently being upgraded to ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The version in the VHD
/azp run pr-e2e |
Azure Pipelines successfully started running 1 pipeline(s). |
/lgtm |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: jackfrancis, mboersma The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
(cherry picked from commit 99f9405)
(cherry picked from commit 99f9405)
(cherry picked from commit 99f9405)
Reason for Change:
Follow-up of #771
Testing against the changes in #771 revealed that cloud-init itself is invoking
apt
to install some packages:This means in practice that when we attempt to
apt-mark hold walinuxagent
in CSE it may fail due to cloud-init already having reserved the lock.Additionally, we are explicitly pinning the version of walinuxagent to further protect against runtime changes to the walinuxagent. This last change is a defensive manouver. Based on production data following the introduction of
2.2.37
we want to introduce new version changes purposefully.We can consider reverting to the prior behavior once we recover create stability, if the static version pinning produces undesired effects for long-lived clusters who wish to update their
walinuxagent
versions "out of band".Issue Fixed:
Requirements:
Notes: