Enable shared gateway mode for OVN #727

trozet · 2020-07-23T17:47:06Z

This patch migrates from using Local gateway mode to Shared gateway mode
with ovn-kubernetes. With shared gateway mode, the external network NIC
is now directly configured as part of an OVS bridge, which has a Layer 2
connection to OVN, effectively allowing OVN to share the NIC with the
host as a Layer 2 network.

Unlike Local gateway mode, this eliminates the need to route through the
kernel for certain OVN traffic to egress the host.

Signed-off-by: Tim Rozet trozet@redhat.com

trozet · 2020-07-23T17:47:51Z

/hold

Wait for corresponding MCO change:
openshift/machine-config-operator#1860

danwinship · 2020-07-29T16:51:27Z

lgtm and I think it can be un-hold-ed now, but it needs to be rebased anyway

trozet · 2020-07-29T19:01:26Z

lgtm and I think it can be un-hold-ed now, but it needs to be rebased anyway

waiting for openshift/ovn-kubernetes#216 which has a shared gateway fix in it

This patch migrates from using Local gateway mode to Shared gateway mode with ovn-kubernetes. With shared gateway mode, the external network NIC is now directly configured as part of an OVS bridge, which has a Layer 2 connection to OVN, effectively allowing OVN to share the NIC with the host as a Layer 2 network. Unlike Local gateway mode, this eliminates the need to route through the kernel for certain OVN traffic to egress the host. Signed-off-by: Tim Rozet <trozet@redhat.com>

trozet · 2020-07-29T22:08:02Z

/hold cancel

trozet · 2020-07-29T22:08:17Z

/retest

trozet · 2020-07-29T22:09:10Z

/assign @danwinship

Note GCP will fail until we have a working fix for shared gw mode and routes. AWS should pass.

trozet · 2020-07-30T00:57:59Z

/retest

trozet · 2020-07-30T03:07:18Z

/retest

trozet · 2020-07-30T05:14:52Z

/retest

trozet · 2020-07-30T13:28:50Z

@stbenjam introspection is failing here:

level=error
level=error msg="Error: could not inspect: could not inspect node, node is currently 'inspect failed', last error was 'Failed to inspect hardware. Reason: unable to start inspection: 'System' object has no attribute 'set_system_boot_options''"
level=error
level=error msg="  on ../../tmp/openshift-install-198912340/masters/main.tf line 1, in resource \"ironic_node_v1\" \"openshift-master-host\":"
level=error msg="   1: resource \"ironic_node_v1\" \"openshift-master-host\" {"
level=error
level=error

I'm guessing that this networking change may have some effect on metal ipi. We are moving the physical interface into OVS now during coreos startup. With ironic introspection in openstack, we used to load the introspection image first, run introspection, reboot with the real image. How is it being done in OCP?

knobunc · 2020-07-30T15:04:35Z

/approve

knobunc · 2020-07-30T17:19:12Z

Closing the loop on an earlier comment by @trozet. @stbenjam says that the metal failure is not due to this PR. They have a fix at openshift-metal3/dev-scripts#1076

stbenjam · 2020-07-30T19:55:02Z

Things should be better now. We've fixed the introspection error and moved back to IPv6.

/test e2e-metal-ipi

stbenjam · 2020-07-30T22:23:41Z

The current e2e-metal-ipi failure looks real:

level=error msg="Cluster operator network Degraded is True with RolloutHung: DaemonSet \"openshift-ovn-kubernetes/ovnkube-node\" rollout is not making progress - pod ovnkube-node-m46pw is in CrashLoopBackOff State\nDaemonSet \"openshift-ovn-kubernetes/ovnkube-node\" rollout is not making progress - pod ovnkube-node-b8jdn is in CrashLoopBackOff State\nDaemonSet \"openshift-ovn-kubernetes/ovnkube-node\" rollout is not making progress - pod ovnkube-node-hctv9 is in CrashLoopBackOff State\nDaemonSet \"openshift-ovn-kubernetes/ovnkube-node\" rollout is not making progress - last change 2020-07-30T20:41:19Z"

Bootstrap log bundle is @ https://gcsweb-ci.apps.ci.l2s4.p1.openshiftapps.com/gcs/origin-ci-test/pr-logs/pull/openshift_cluster-network-operator/727/pull-ci-openshift-cluster-network-operator-master-e2e-metal-ipi/1288926158213091328/artifacts/e2e-metal-ipi/baremetalds-devscripts-gather/

trozet · 2020-07-31T00:18:52Z

/retest

trozet · 2020-07-31T00:32:55Z

The current e2e-metal-ipi failure looks real:
level=error msg="Cluster operator network Degraded is True with RolloutHung: DaemonSet \"openshift-ovn-kubernetes/ovnkube-node\" rollout is not making progress - pod ovnkube-node-m46pw is in CrashLoopBackOff State\nDaemonSet \"openshift-ovn-kubernetes/ovnkube-node\" rollout is not making progress - pod ovnkube-node-b8jdn is in CrashLoopBackOff State\nDaemonSet \"openshift-ovn-kubernetes/ovnkube-node\" rollout is not making progress - pod ovnkube-node-hctv9 is in CrashLoopBackOff State\nDaemonSet \"openshift-ovn-kubernetes/ovnkube-node\" rollout is not making progress - last change 2020-07-30T20:41:19Z"
Bootstrap log bundle is @ https://gcsweb-ci.apps.ci.l2s4.p1.openshiftapps.com/gcs/origin-ci-test/pr-logs/pull/openshift_cluster-network-operator/727/pull-ci-openshift-cluster-network-operator-master-e2e-metal-ipi/1288926158213091328/artifacts/e2e-metal-ipi/baremetalds-devscripts-gather/

@stbenjam For some reason I dont see an ovnkube-log in must-gather-ipi...tar -> bootstrap/containers

stbenjam · 2020-07-31T00:46:11Z

@stbenjam For some reason I dont see an ovnkube-log in must-gather-ipi...tar -> bootstrap/containers

Yea sorry, the log bundle is what we were able to capture (log-bundle-ipi-ci-op-92dzpci2-0c056-20200730T200748.tar). It doesn't look like things came up enough to collect a whole must-gather, but we do have some logs from the masters.

e.g.,. From ./control-plane/fd2e:6f44:5dd8:c956::14/containers/ovn-controller-9723b213aad102045724dc6b668432db9343f16cbd4afd5d4ecdf098992dafa9.log:

2020-07-30T21:08:00Z|00123|patch|ERR|Dropped 1 log messages in last 11 seconds (most recently, 11 seconds ago) due to excessive rate
2020-07-30T21:08:00Z|00124|patch|ERR|bridge not found for localnet port 'lnet-node_local_switch' with network name 'locnet'
2020-07-30T21:08:18Z|00125|patch|ERR|Dropped 3 log messages in last 18 seconds (most recently, 18 seconds ago) due to excessive rate
2020-07-30T21:08:18Z|00126|patch|ERR|bridge not found for localnet port 'lnet-node_local_switch' with network name 'locnet'

Not sure if that's relevant, but there's a bunch more logs in there from the other containers as well.

trozet · 2020-07-31T02:13:36Z

@stbenjam thanks. Now I see the error is:
F0730 21:08:03.130489 58258 ovnkube.go:130] failed to get default gateway interface

and this is because if we look in ovs-configuration.service.log:

Jul 30 20:38:13 master-2.ostest.test.metalkube.org systemd[1]: Starting Configures OVS with proper host networking configuration... Jul 30 20:38:13 master-2.ostest.test.metalkube.org configure-ovs.sh[1898]: + iface= Jul 30 20:38:13 master-2.ostest.test.metalkube.org configure-ovs.sh[1898]: + counter=0 Jul 30 20:38:13 master-2.ostest.test.metalkube.org configure-ovs.sh[1898]: + '[' 0 -lt 12 ']' Jul 30 20:38:13 master-2.ostest.test.metalkube.org configure-ovs.sh[1898]: ++ jq -r '.[0].dev' Jul 30 20:38:13 master-2.ostest.test.metalkube.org configure-ovs.sh[1898]: ++ ip -j route show default Jul 30 20:38:13 master-2.ostest.test.metalkube.org configure-ovs.sh[1898]: + iface=null Jul 30 20:38:13 master-2.ostest.test.metalkube.org configure-ovs.sh[1898]: + '[' -n null ']' Jul 30 20:38:13 master-2.ostest.test.metalkube.org configure-ovs.sh[1898]: + echo 'Default gateway interface found: null' Jul 30 20:38:13 master-2.ostest.test.metalkube.org configure-ovs.sh[1898]: Default gateway interface found: null Jul 30 20:38:13 master-2.ostest.test.metalkube.org configure-ovs.sh[1898]: + break Jul 30 20:38:13 master-2.ostest.test.metalkube.org configure-ovs.sh[1898]: + '[' null = br-ex ']' Jul 30 20:38:13 master-2.ostest.test.metalkube.org configure-ovs.sh[1898]: + '[' -z null ']' Jul 30 20:38:13 master-2.ostest.test.metalkube.org systemd[1]: ovs-configuration.service: Main process exited, code=exited, status=1/FAILURE Jul 30 20:38:13 master-2.ostest.test.metalkube.org configure-ovs.sh[1898]: /usr/local/bin/configure-ovs.sh: line 35: /sys/class/net/null/address: No such file or directory

Looks like we get back "null" from jq so we are only trying 1 time there. So that is a bug, but regardless, why was there no default gateway on this host? The node should come up and DHCP automatically before NetworkManager wait online. Are there system journals in this tar ball?

openshift-ci-robot · 2020-08-02T03:49:55Z

@knobunc: Overrode contexts on behalf of knobunc: ci/prow/e2e-gcp-ovn

In response to this:

/override ci/prow/e2e-gcp-ovn

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

trozet · 2020-08-02T03:51:00Z

@stbenjam FYI we do not have a fix yet for ipv6 with shared gw mode. @Billy99 is working on it. I would recommend reverting to ipv4 for a few days until he has that fixed (once this merges)

trozet · 2020-08-02T03:51:19Z

/test e2e-aws-ovn

openshift-bot · 2020-08-02T04:17:37Z

/retest