Unit with After=network-online.target starts up before an IP is on the interface #1966

MohammadKarimi23 · 2017-05-14T20:17:14Z

Bug

Container Linux Version

NAME="Container Linux by CoreOS"
ID=coreos
VERSION=1298.7.0
VERSION_ID=1298.7.0
BUILD_ID=2017-03-31-0215
PRETTY_NAME="Container Linux by CoreOS 1298.7.0 (Ladybug)"
ANSI_COLOR="38;5;75"
HOME_URL="https://coreos.com/"
BUG_REPORT_URL="https://github.com/coreos/bugs/issues

Environment

HP gen9 server

Bug Description

I'm using CoreOS matchbox for network boot and provisioning my private cloud.
matchbox uses systemd units for booting the system and I'm following matchbod examples at https://github.com/coreos/matchbox/blob/v0.6.0/examples/ignition/install-reboot.yaml

installer.service contains Requires=network-online.target and After=network-online.target but the service fails to curl the ignition file needed for installer and returns 7/failed_to_connect error, but when I curl the file manually after boot time, the curl is successful

The text was updated successfully, but these errors were encountered:

crawford · 2017-05-15T16:29:53Z

The underlying issue is that network-online.target isn't a silver bullet for detecting if the network is up. It's intended to be used with legacy applications that don't handle network changes properly. This example should be smart enough to retry the network operation and therefore drop the network online dependency.

MohammadKarimi23 · 2017-05-16T15:47:53Z

I actually solved the problem using until ping command as 'ExecStartPre' like below

[Service]
        ExecStartPre=/bin/sh -c 'until ping -c1 google.com; do sleep 1; done;'
        ExecStartPre=/usr/bin/mkdir -p /etc/kubernetes/ssl
        ExecStart=/usr/bin/bash -c "[ -f /etc/kubernetes/ssl/%i ] ||  curl {{.k8s_cert_endpoint}}/tls/%i -o /etc/kubernetes/ssl/%i"

but I appreciate if you show me a way to do that using systemd units

crawford · 2017-05-16T17:05:41Z

That seems like a fine solution to me.

euank · 2017-06-30T18:03:49Z

I opened poseidon/matchbox#596 to hopefully fix this. I think discussion can be moved there.

When telegraf starts with localhost.localdomain as the hostname, it sticks with that hostname even after the hostname changes later on. This happens mostly during server startup. Posting points with localhost.localdomain messes up lot of monitoring aspects. On systemd-networkd based systems, the following link explains how to make a systemd service start after the interface is fully setup. (refer https://www.freedesktop.org/wiki/Software/systemd/NetworkTarget/) But using repeated experiments it was confirmed that those steps do not work reliably. There are reports from other users about network-online.target not behaving as expected. See https://community.getchannels.com/t/wait-for-networking-on-reboot-ubuntu-network-online-target/936 So we are using a simple and yet gauranteed workaround suggested in coreos/bugs#1966 Repeated testing on ce114 confirmed that this method is completely reliable. This issue is avoided in dhclient based systems since we rely on dhclient helper to delay start telegraf. However, we could use the same change on dhclient based systems too if needed.

crawford added area/stability component/matchbox kind/friction low hanging fruit priority/P1 team/os labels May 15, 2017

MohammadKarimi23 mentioned this issue May 15, 2017

Unit with After=network-online.target starts up before an IP is on the interface #170

Closed

euank closed this as completed Jun 30, 2017

kthommandra mentioned this issue Aug 30, 2018

wait for hostname to be available before starting aristanetworks/telegraf#30

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unit with After=network-online.target starts up before an IP is on the interface #1966

Unit with After=network-online.target starts up before an IP is on the interface #1966

MohammadKarimi23 commented May 14, 2017

crawford commented May 15, 2017

MohammadKarimi23 commented May 16, 2017

crawford commented May 16, 2017

euank commented Jun 30, 2017

Unit with After=network-online.target starts up before an IP is on the interface #1966

Unit with After=network-online.target starts up before an IP is on the interface #1966

Comments

MohammadKarimi23 commented May 14, 2017

Bug

Container Linux Version

Environment

Bug Description

crawford commented May 15, 2017

MohammadKarimi23 commented May 16, 2017

crawford commented May 16, 2017

euank commented Jun 30, 2017