Add hybrid cluster extensions framework (v2) #36

dcbw · 2019-10-29T17:09:00Z

Due to the problems met in issue: ovn-org/ovn-kubernetes#855 We should use ovs 2.11 instead of 2.10 to fix the problem of ovn-nbctl running in daemon mode. So the ovs version used in the Dockerfile should be promoted to 2.11 to build a suitable image for daemonset install. The DPDK library is also updated 18.11 to satisfy the requirement of ovs2.11. The related changes in Dockerfile for rpmArch of aarch64(arm64) are also given. Signed-off-by: Trevor Tao <trevor.tao@arm.com>

Upgrade ovs version in Dockerfile to 2.11

Signed-off-by: Michael Cambria <mcambria@redhat.com>

Signed-off-by: Dan Williams <dcbw@redhat.com> (cherry picked from commit 76504df)

Signed-off-by: Shahar Klein <sklein@nvidia.com>

…centos Use a newer kubectl version for the centos image

Move management port creation to the master

Signed-off-by: Girish Moodalbail <gmoodalbail@nvidia.com>

remove the unnecessary annotation from the ovn-kubernetes namespace

Kubelet may keep multiple sandboxes running for a given pod if some are waiting for garbage collection. We need to make sure that only the latest sandbox has iface-id set to the container's namespace/name or ovn-controller may get confused about which OVS port to associate with the Pod's logical switch port. Signed-off-by: Dan Williams <dcbw@redhat.com>

Signed-off-by: Dan Williams <dcbw@redhat.com>

util: validate pod annotation

the serviceaccount user `ovn` has a cluster-admin role today and can pretty much do anything in the clsuter. we need to fine grain access control and pod privileges for this user. to do this: 1. define a PodSecurityPolicy object that captures minimum required security policies to run our deployments and daemonset. 2. define a ClusterRole object that captures all the resources we are intersted in and all the actions we need on them. also, this role should use the PodSecurityPolicy defined in step 1. 3. bind the above role to `ovn` serviceaccount Note: this commit adds (1) that provides almost any securityContext to be set (this can be restricted in future commits) Signed-off-by: Girish Moodalbail <gmoodalbail@nvidia.com>

cni/linux: ensure only the latest sandbox has external-ids:iface-id set

Commit 0c0aa45 moved the creation of the management lsp to the master, but the comment and error message createManagementPortGeneric() doesn't reflect this. Also, util.GetNodeWellKnownAddresses() never returns an error, so let's remove the return value and the error handling from callers.

management-port: clean up from move to master

If dynamic addressing is set up, but there are no addresses assigned, then since commit 11f284c, you'll see: /usr/bin/ovn-nbctl --db=tcp:10.0.141.40:9641 --timeout=15 get logical_switch_port jtor-GR_ip-10-0-170-219.us-west-2.compute.internal addresses" stdout: \"[dynamic]\\n\"" stderr: \"\"" failed to localnet gateway: error while waiting for addresses for gateway switch port whereas the error before was: failed to localnet gateway: empty addresses for gateway switch port Restore the previous behavior by recognizing that [dynamic] means that no addresses are assigned. Signed-off-by: Mark McLoughlin <markmc@redhat.com>

fixes the following lint warning post commit 0c0aa45 warning: cannot initialize 1 variables with 2 values (staticcheck) Signed-off-by: Girish Moodalbail <gmoodalbail@nvidia.com>

Handle static vs dynamic better in GetPortAddresses()

The connection tracking offload is not merged yet into kernel and ovs therefore update the doc to reflect that. Once all the patches get merged we will update the doc with the relevant kernel and ovs versions. This patch also fix the sriov device plugin json example Signed-off-by: moshe010 <moshele@mellanox.com>

For egress network bandwidth, we configure ingress_policying_rate, but, by itself, it might end up allowing less than the rate (e.g. iperf TCP with a rate of 4Gbps gives < 1Gbps). OVSDB schema has this to say about ingress_policying_burst: " Specifying a larger burst size lets the algorithm be more forgiving, which is important for protocols like TCP that react severely to dropped packets. The burst size should be at least the size of the interface’s MTU. Specifying a value that is numerically at least as large as 10% of ingress_policing_rate helps TCP come closer to achieving the full rate " Set ingress_policying_burst to 10% of ingress_policying_rate. Additionally, for a SR-IOV VF we also need to configure its max_tx_rate using the egress value in Mbps. Tested by configuring egress-bandwidth for a pod and using iperf. Tested the VF setting by giving a VF as OVN interface to the pod, with egress-bandwidth configured. Signed-off-by: venu iyer <venugopali@nvidia.com>

Bandwidth: Actual egress bandwidth might be much lower than configured.

right now it returns router IP as IP/PLEN, whilst returns the management port IP as IP. make the return arugments to be just *net.IPnet so that the callers can make use of the String() function to either construct IP or construct IP with mask. this change makes this function usable elsewhere in the code as well. Signed-off-by: Girish Moodalbail <gmoodalbail@nvidia.com>

Signed-off-by: Girish Moodalbail <gmoodalbail@nvidia.com>

To reduce risk of errors using CIDRs where we want IPs which is more possible with string types, convert from strings to real Go types immediately when parsing the pod annotation and use the real Go types everywhere in the code. Signed-off-by: Dan Williams <dcbw@redhat.com>

Signed-off-by: Dan Williams <dcbw@redhat.com>

No code changes, just moving things around. Signed-off-by: Dan Williams <dcbw@redhat.com>

Signed-off-by: Dan Williams <dcbw@redhat.com>

make use of GetNodeWellKnownAddresses() in createManagementPortGeneric()

Used the below command: govendor fetch k8s.io/client-go/tools/leaderelection@kubernetes-1.14.3 Upcoming patch will make use of leaderelection to support HA ovnkube-master. Signed-off-by: Numan Siddique <nusiddiq@redhat.com>

dcbw · 2019-10-29T17:09:29Z

/test e2e-aws-ovn-kubernetes

openshift-ci-robot · 2019-10-29T17:09:35Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: dcbw

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [dcbw]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

namespace: fix panic

dcbw · 2019-10-29T17:13:13Z

/test e2e-aws-ovn-kubernetes

dcbw · 2019-10-29T18:34:06Z

/test e2e-aws-ovn-kubernetes

When the configuration file changes, exit. The configuration file is a mounted configmap (ovnkube-config) so when it changes exit. SDN-456 - OVN: use config file via ConfigMap rather than environment variables https://jira.coreos.com/browse/SDN-456 Signed-off-by: Phil Cameron <pcameron@redhat.com>

Notice change in configmap and exit.

dcbw · 2019-10-29T20:56:08Z

/test e2e-aws-ovn-kubernetes

The annotator provides a better-encapsulated interface for setting or removing multiple annotations on a node from disconnected functions since it bundles them all up into a single set and/or delete call. Signed-off-by: Dan Williams <dcbw@redhat.com>

Signed-off-by: Dan Williams <dcbw@redhat.com>

Doesn't check for OVS/OVN utilities; to be used for components that don't need those. Signed-off-by: Dan Williams <dcbw@redhat.com>

…nels Heavily modified from work by Rajat Chopra in: ovn-org/ovn-kubernetes#593 Signed-off-by: Dan Williams <dcbw@redhat.com>

Signed-off-by: Jocelyn Berrendonner <jocelynb@microsoft.com>

Signed-off-by: Dan Williams <dcbw@redhat.com>

The hybrid overlay uses the 3rd IP of the logical switch, but when upgrading from a non-hybrid-overlay-enabled ovn-kubernetes that doesn't exclude that IP to one that does, existing pods may have that IP when the hybrid-overlay-enabled ovn-kubernetes starts. When ovn-kubernetes adds the 3rd IP to the exclude_ips of the switch, OVN will helpfully re-address any logical port with that IP, but since Kubernetes doesn't support changing pod IPs, OVN and Kube will mismatch and the pod won't work. Just kill it.

dcbw · 2019-10-29T22:01:46Z

/test e2e-aws-ovn-kubernetes

dcbw · 2019-10-29T22:42:42Z

/test e2e-aws-ovn-kubernetes

dcbw · 2019-10-30T01:17:42Z

/test e2e-aws-ovn-kubernetes

dcbw · 2019-10-30T19:12:11Z

/test e2e-aws-ovn-kubernetes

openshift-ci-robot · 2019-10-30T20:30:32Z

@dcbw: The following test failed, say /retest to rerun them all:

Test name	Commit	Details	Rerun command
ci/prow/e2e-aws-ovn-kubernetes	`e49aef4`	link	`/test e2e-aws-ovn-kubernetes`

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

dcbw · 2019-11-05T19:05:17Z

obsoleted by #43

…d_validation Add Hardware Offload Validation as a test option for flows.

TrevorTaoARM and others added 30 commits October 17, 2019 15:21

Merge pull request openshift#864 from TrevorTaoARM/centos

72959eb

Upgrade ovs version in Dockerfile to 2.11

sdn-388 Move management port creation to the master

0c0aa45

Signed-off-by: Michael Cambria <mcambria@redhat.com>

util: read static addresses correctly too

11f284c

Signed-off-by: Dan Williams <dcbw@redhat.com> (cherry picked from commit 76504df)

Use a newer kubectl version for the centos image

b1417da

Signed-off-by: Shahar Klein <sklein@nvidia.com>

Merge pull request openshift#871 from shahar-klein/Use-newer-kubectl-…

a8cb03d

…centos Use a newer kubectl version for the centos image

Merge pull request openshift#767 from mccv1r0/sdn-388

aa36fc1

Move management port creation to the master

remove the not needed annotation from the ovn-kubernetes namespace

c3bf30b

Signed-off-by: Girish Moodalbail <gmoodalbail@nvidia.com>

Merge pull request openshift#872 from girishmg/fix_ovn_setup

f9522e2

remove the unnecessary annotation from the ovn-kubernetes namespace

util: validate pod annotation

3225b4f

Signed-off-by: Dan Williams <dcbw@redhat.com>

Merge pull request openshift#873 from dcbw/validate-pod-annotation

cbd8418

util: validate pod annotation

Merge pull request openshift#869 from dcbw/pod-iface-id

b771d38

cni/linux: ensure only the latest sandbox has external-ids:iface-id set

Merge pull request openshift#877 from markmc/mgmt-port-cleanup

032f97a

management-port: clean up from move to master

fix lint warning in management-port_windows_test.go

490f390

fixes the following lint warning post commit 0c0aa45 warning: cannot initialize 1 variables with 2 values (staticcheck) Signed-off-by: Girish Moodalbail <gmoodalbail@nvidia.com>

Merge pull request openshift#878 from markmc/empty-lsp-addresses

070177f

Handle static vs dynamic better in GetPortAddresses()

Merge pull request openshift#880 from venuiyer/vi-branch

7859def

Bandwidth: Actual egress bandwidth might be much lower than configured.

make use of GetNodeWellKnownAddresses() in createManagementPortGeneric()

1498805

Signed-off-by: Girish Moodalbail <gmoodalbail@nvidia.com>

policy: remove unused arguments

067831b

Signed-off-by: Dan Williams <dcbw@redhat.com>

policy: move common code into policy_common.go

729ad2f

No code changes, just moving things around. Signed-off-by: Dan Williams <dcbw@redhat.com>

policy: use UnmarshalPodAnnotation

8e51ecc

Signed-off-by: Dan Williams <dcbw@redhat.com>

Merge pull request openshift#879 from girishmg/refactor_mgmt_port

1ea0c17

make use of GetNodeWellKnownAddresses() in createManagementPortGeneric()

openshift-ci-robot added the size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. label Oct 29, 2019

openshift-ci-robot requested review from danwinship and rcarrillocruz October 29, 2019 17:09

openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Oct 29, 2019

Merge pull request openshift#867 from squeed/namespace-panic

244c5a7

namespace: fix panic

dcbw force-pushed the extensions2 branch from e323640 to a5600c6 Compare October 29, 2019 17:12

pecameron and others added 2 commits October 29, 2019 16:00

Merge pull request openshift#860 from pecameron/sdn456a

66ae4e5

Notice change in configmap and exit.

dcbw and others added 8 commits October 29, 2019 16:49

factory: add filtered node watch handlers

82c95a0

Signed-off-by: Dan Williams <dcbw@redhat.com>

util: add SetExecWithoutOVS() variant

8b3f9f0

Doesn't check for OVS/OVN utilities; to be used for components that don't need those. Signed-off-by: Dan Williams <dcbw@redhat.com>

hybrid-overlay: framework for extending the OVN network via VXLAN tun…

373a959

…nels Heavily modified from work by Rajat Chopra in: ovn-org/ovn-kubernetes#593 Signed-off-by: Dan Williams <dcbw@redhat.com>

hybrid-overlay: initial HNS-based Windows node implementation

7feeb3a

Signed-off-by: Jocelyn Berrendonner <jocelynb@microsoft.com>

ovnkube.sh: add support for extensions

9f3db64

Signed-off-by: Dan Williams <dcbw@redhat.com>

Merge remote-tracking branch 'dcbwupstream/extensions' into extensions2

e49aef4

dcbw force-pushed the extensions2 branch from a5600c6 to e49aef4 Compare October 29, 2019 22:01

dcbw closed this Nov 5, 2019

Billy99 added a commit to Billy99/ovn-kubernetes that referenced this pull request Nov 2, 2022

Merge pull request openshift#36 from wizhaoredhat/add_hardware_offloa…

17369ed

…d_validation Add Hardware Offload Validation as a test option for flows.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add hybrid cluster extensions framework (v2) #36

Add hybrid cluster extensions framework (v2) #36

dcbw commented Oct 29, 2019

dcbw commented Oct 29, 2019

openshift-ci-robot commented Oct 29, 2019

dcbw commented Oct 29, 2019

dcbw commented Oct 29, 2019

dcbw commented Oct 29, 2019

dcbw commented Oct 29, 2019

dcbw commented Oct 29, 2019

dcbw commented Oct 30, 2019

dcbw commented Oct 30, 2019

openshift-ci-robot commented Oct 30, 2019

dcbw commented Nov 5, 2019

Add hybrid cluster extensions framework (v2) #36

Add hybrid cluster extensions framework (v2) #36

Conversation

dcbw commented Oct 29, 2019

dcbw commented Oct 29, 2019

openshift-ci-robot commented Oct 29, 2019

dcbw commented Oct 29, 2019

dcbw commented Oct 29, 2019

dcbw commented Oct 29, 2019

dcbw commented Oct 29, 2019

dcbw commented Oct 29, 2019

dcbw commented Oct 30, 2019

dcbw commented Oct 30, 2019

openshift-ci-robot commented Oct 30, 2019

dcbw commented Nov 5, 2019