You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Oct 22, 2020. It is now read-only.
Overlord continues to wait for a node to finish running its service but it will indefinitely sit there waiting and not continue.
The reason is that a unit or multiple units failed on a node as shown when doing a fleetctl list-units
When you log into the failed node and issue a fleetctl status <unit_name> it gives an error due to not being able to pull the binaries from the Internet. This is due to the fact that systemd-networkd was restarted too many times and it exited from continuing to attempt restarting the service as shown in systemdctl status systemd-networkd. The issue has to do with the issuance of multiple network devices requiring a restart of systemd-networkd and its restarting too many times for it to be happy.
This issue happens every so often but not always.
The simple work around unfortunately is to destroy the stack all-together via Heat and create a new stack or restart the systemd-networkd unit as well as any other units on the failed nodes, and then observing of the logs of the overlord to make sure it completed
The text was updated successfully, but these errors were encountered:
Closing issue as manual/hard-coded networking of all network devices & configs for overlay has been replaced by CoreOS Flannel; therefore, this issue no longer exists thanks to commit 2ab5885
Overlord continues to wait for a node to finish running its service but it will indefinitely sit there waiting and not continue.
The reason is that a unit or multiple units failed on a node as shown when doing a
fleetctl list-units
When you log into the failed node and issue a
fleetctl status <unit_name>
it gives an error due to not being able to pull the binaries from the Internet. This is due to the fact that systemd-networkd was restarted too many times and it exited from continuing to attempt restarting the service as shown insystemdctl status systemd-networkd
. The issue has to do with the issuance of multiple network devices requiring a restart of systemd-networkd and its restarting too many times for it to be happy.This issue happens every so often but not always.
The simple work around unfortunately is to destroy the stack all-together via Heat and create a new stack or restart the systemd-networkd unit as well as any other units on the failed nodes, and then observing of the logs of the overlord to make sure it completed
The text was updated successfully, but these errors were encountered: