Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.Sign up
nixos/alertmanager: start after the network-online target #82808
Motivation for this change
Since an upgrade to 20.03, the Prometheus alertmanager is no longer starting after a reboot on my machine:
If the host network stack is slow to start, the alertmanager fails to
This bug can be reproduced by shutting down the network stack and
Note I don't know why I didn't hit this issue with previous
I deployed the patch on this machine and the issue is now fixed.
We don't have any tests for the alertmanager. And i'm not able to reproduce the issue in a private Prometheus NixOS test because the VM network stack is up too fast.
If the host network stack is slow to start, the alertmanager fails to start with this error message: caller=main.go:256 msg="unable to initialize gossip mesh" err="create memberlist: Failed to get final advertise address: No private IP address found, and explicit IP not provided" This bug can be reproduced by shutting down the network stack and restarting the alertmanager. Note I don't know why I didn't hit this issue with previous alertmanager releases.