New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add gARP capability to L2 announcer feature #25933
Add gARP capability to L2 announcer feature #25933
Conversation
7cc36ed
to
7f7daf8
Compare
The garp package was sending out ARP requests, while valid in some situations, what we actually need is to send out ARP replies which tell nodes on the network what the new MAC address is for a given IP instead of asking the nodes. This commit also changes the `SendOnInterface` to `SendOnInterfaceIdx` since the only call site for this function has indices not names available. Signed-off-by: Dylan Reimerink <dylan.reimerink@isovalent.com>
When failover occurs a different node will start responding to ARP. But other nodes on the network will not know this has happened and thus keep using the ARP entry from their local cache. This commit makes it so we broadcast a gARP reply with the new MAC address of the IP any time we reconcile a IP+ifidx tuple that was not yet present in the L2 responder map. Nodes that honor gARP replies will update their ARP cache to minimize downtime. Those who do not honer these replies will have more downtime until their ARP cache entry expires and they perform a ARP request themselves. Signed-off-by: Dylan Reimerink <dylan.reimerink@isovalent.com>
7f7daf8
to
92e24f2
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
vendor changes lgtm
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM.
/test Job 'Cilium-PR-K8s-1.26-kernel-net-next' failed: Click to show.Test Name
Failure Output
Jenkins URL: https://jenkins.cilium.io/job/Cilium-PR-K8s-1.26-kernel-net-next/517/ If it is a flake and a GitHub issue doesn't already exist to track it, comment Then please upload the Jenkins artifacts to that issue. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
/test-1.26-net-next |
When failover occurs a different node will start responding to ARP.
But other nodes on the network will not know this has happened and thus
keep using the ARP entry from their local cache.
This PR makes it so we broadcast a gARP reply with the new MAC
address of the IP any time we reconcile a IP+ifidx tuple that was not
yet present in the L2 responder map.
Nodes that honor gARP replies will update their ARP cache to minimize
downtime. Those who do not honer these replies will have more downtime
until their ARP cache entry expires and they perform a ARP request
themselves.