node: Handle arpinging when remote node is in different L2 #14201

brb · 2020-11-27T14:20:34Z

See commit msgs

brb · 2020-11-27T14:36:30Z

test-me-please

kkourt

Looks good to me.

One thing that is not obvious to me is whether the refcount optimization for not pinging gateways can lead to a situation where the entry for the gateway has expired in the ARP cache (I'm assuming this is why we arpping) and there is no way to reinsert this entry because its refcount is >0.

pkg/datapath/linux/node.go

brb · 2020-11-30T10:43:35Z

Thanks for the reviews!

One thing that is not obvious to me is whether the refcount optimization for not pinging gateways can lead to a situation where the entry for the gateway has expired in the ARP cache (I'm assuming this is why we arpping) and there is no way to reinsert this entry because its refcount is >0.

@kkourt The ARP entries cannot expire in the local cache, because we set NUD_PERMANENT which makes them permanent.

We are arpinging because when BPF NodePort running in XDP or TC gets a request which needs to be forwarded to another node (because of a backend selection), it needs an L2 address of a nexthop to that node (gw or the node itself). If the L2 addr is not present in the neighbor system, then we have to drop such request, as it's not possible to drive the arp resolution from the BPF program. To avoid that, we do arping from cilium-agent once it learns about a new k8s node.

Previously, insertNeighbor() was assuming that a remote node is in the same L2 subnet, i.e. directly reachable w/o a gateway. However, this is not the case for all deployments. This commit adds a check for detecting whether the remote node is in the same L2. If it's not, then a gateway IP addr (nexthop) is going to be arpinged instead of the remote node IP addr. The missing bit in this commit is a refcounting to avoid redundant arpings and neigh removals when the gateway is used to access more than one remote node. Signed-off-by: Martynas Pumputis <m@lambda.lt>

To avoid redundant pings and neigh entry removals when the entry is still used by other node (happens when two or more nodes can be accessed through the same gateway). Signed-off-by: Martynas Pumputis <m@lambda.lt>

brb · 2020-11-30T12:20:54Z

Previously, all checks were green. I've just pushed the comment change to address https://github.com/cilium/cilium/pull/14201/files/4ce23957a6661d8f985651aca1dc00e3a7c0833a#diff-14a5efc51ce7ea3959385022d4800b355c766c60e8b7ba0e6052afced05ddbf0.

brb added pending-review sig/datapath Impacts bpf/ or low-level forwarding details, including map management and monitor messages. labels Nov 27, 2020

brb requested review from a team and kkourt November 27, 2020 14:20

maintainer-s-little-helper bot added the dont-merge/needs-release-note-label The author needs to describe the release impact of these changes. label Nov 27, 2020

maintainer-s-little-helper bot assigned kkourt Nov 27, 2020

maintainer-s-little-helper bot added this to In progress in 1.10.0 Nov 27, 2020

maintainer-s-little-helper bot added this to Needs backport from master in 1.9.1 Nov 27, 2020

maintainer-s-little-helper bot added this to Needs backport from master in 1.8.6 Nov 27, 2020

brb added the release-note/minor This PR changes functionality that users may find relevant to operating Cilium. label Nov 27, 2020

maintainer-s-little-helper bot removed the dont-merge/needs-release-note-label The author needs to describe the release impact of these changes. label Nov 27, 2020

borkmann approved these changes Nov 27, 2020

View reviewed changes

borkmann added the feature/lb-only Impacts cilium running in lb-only datapath mode label Nov 27, 2020

brb added the area/kube-proxy-free label Nov 27, 2020

This was referenced Nov 27, 2020

CI: RuntimePrivilegedUnitTests: arping failed in TestArpPingHandling #14125

Closed

Add Cilium API for adding / removing node to trigger arping in lb-only mode #14203

Closed

kkourt approved these changes Nov 30, 2020

View reviewed changes

maintainer-s-little-helper bot added the ready-to-merge This PR has passed all tests and received consensus from code owners to merge. label Nov 30, 2020

pchaigno reviewed Nov 30, 2020

View reviewed changes

pkg/datapath/linux/node.go Show resolved Hide resolved

brb added 2 commits November 30, 2020 13:20

node: Refcount neighbour entries

bec216a

To avoid redundant pings and neigh entry removals when the entry is still used by other node (happens when two or more nodes can be accessed through the same gateway). Signed-off-by: Martynas Pumputis <m@lambda.lt>

brb force-pushed the pr/brb/arping-gw branch from b554271 to bec216a Compare November 30, 2020 12:20

joestringer merged commit b78b3b7 into master Dec 1, 2020

joestringer deleted the pr/brb/arping-gw branch December 1, 2020 00:31

borkmann mentioned this pull request Dec 2, 2020

v1.9 backports 2020-12-02 #14246

Merged

borkmann added backport-pending/1.9 and removed needs-backport/1.9 labels Dec 2, 2020

maintainer-s-little-helper bot moved this from Needs backport from master to Backport pending to v1.9 in 1.9.1 Dec 2, 2020

nathanjsweet mentioned this pull request Dec 2, 2020

v1.8 backports 2020-12-02 #14249

Merged

nathanjsweet added backport-pending/1.8 and removed needs-backport/1.8 labels Dec 2, 2020

maintainer-s-little-helper bot moved this from Needs backport from master to Backport pending to v1.8 in 1.8.6 Dec 2, 2020

pchaigno added backport-done/1.9 and removed backport-pending/1.9 labels Dec 3, 2020

maintainer-s-little-helper bot moved this from Backport pending to v1.9 to Backport done to v1.9 in 1.9.1 Dec 3, 2020

aanm mentioned this pull request Dec 4, 2020

Prepare for release v1.8.6 #14275

Merged

aanm added backport-done/1.8 and removed backport-pending/1.8 labels Dec 4, 2020

aanm mentioned this pull request Dec 4, 2020

Prepare for release v1.9.1 #14280

Merged

jaffcheng mentioned this pull request Dec 9, 2020

Add a mechanism in neighbor management to allow refreshing ARP cache #14322

Closed

pchaigno mentioned this pull request Dec 10, 2020

Cilium complains "IP is not L2 reachable" on startup #14340

Closed

borkmann mentioned this pull request Sep 30, 2021

Cilium in EKS without kube-proxy #10462

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

node: Handle arpinging when remote node is in different L2 #14201

node: Handle arpinging when remote node is in different L2 #14201

brb commented Nov 27, 2020

brb commented Nov 27, 2020

kkourt left a comment

brb commented Nov 30, 2020

brb commented Nov 30, 2020

node: Handle arpinging when remote node is in different L2 #14201

node: Handle arpinging when remote node is in different L2 #14201

Conversation

brb commented Nov 27, 2020

brb commented Nov 27, 2020

kkourt left a comment

Choose a reason for hiding this comment

brb commented Nov 30, 2020

brb commented Nov 30, 2020