docs: Fix egress gateway getting started guide #15984

gandro · 2021-05-03T15:18:31Z

This commit fixes various issues discovered while testing the egress
gateway getting started guide:

Use the correct Helm value (egressGateway.enabled) to enable the
feature.
Use the correct field names in the example CiliumEgressNATPolicy.
Set the Kubernetes namespace in helm install as we do in other
helm install invocations.
Ensure that the egress-ip-assign depoyment is always co-located
with the example workload via pod affinity.
The examples use curl to access the external service, so use curl
in the access log as well.

With these fixes applied, I was able to successfully validate this guide.

This commit fixes various issues discovered while testing the egress gateway getting started guide: - Use the correct Helm value (`egressGateway.enabled`) to enable the feature. - Use the correct field names in the example `CiliumEgressNATPolicy`. - Set the Kubernetes namespace in `helm install` as we do in other `helm install` invocations. - Ensure that the `egress-ip-assign` depoyment is always co-located with the example workload via pod affinity. - The examples use `curl` to access the external service, so use `curl` in the access log as well. Signed-off-by: Sebastian Wicki <sebastian@isovalent.com>

pchaigno

🚀

julianwiedmann · 2022-03-31T09:41:28Z

/me casts resurrect spell

Hey @gandro, do you still remember why this part was needed:

Ensure that the egress-ip-assign depoyment is always co-located
with the example workload via pod affinity.

Discussing with @jibi, having the workload on the same node as the Egress Gateway shouldn't be necessary at all.

gandro · 2022-04-01T08:55:21Z

/me casts resurrect spell

Hey @gandro, do you still remember why this part was needed:

Ensure that the egress-ip-assign depoyment is always co-located
with the example workload via pod affinity.

Discussing with @jibi, having the workload on the same node as the Egress Gateway shouldn't be necessary at all.

It's been a while, so I might be wrong here - but from what I can reconstruct this was not about the egress gateway itself, but the node configuration. My understanding is that the node where egress gateway is running needs the egress IP to be set, and this change made it such that we're setting the IP on the gateway node, and not just a random node.

Looking at the change, I guess the affinity should be based on the workload, but on the gateway itself, but I'm not sure to achieve that. The docs here also refer to the OSS version of egress gateway, which might behave differently from what Cilium Enterprise offers - I honestly lack the knowledge here. @jibi would know better.

pchaigno · 2022-04-01T09:42:22Z

Assigning the IP to the node makes it the gateway node.

But maybe what we'd want here is the opposite of the current action: to ensure that the egress gateway is not the node on which the client pod is running. That way users can see the full effect of the egress gateway feature (i.e., redirect to egress gateway + SNAT instead of just SNAT currently).

julianwiedmann · 2022-04-01T09:47:47Z

For reference, the test-VM instead suggests using a nodeSelector for pinning the Egress Gateway to a specific node. Which feels reasonable, so that the Gateway doesn't potentially bounce around in the cluster during a re-deployment.

jibi · 2022-04-01T10:00:19Z

My understanding is that the node where egress gateway is running needs the egress IP to be set

correct

and this change made it such that we're setting the IP on the gateway node, and not just a random node.

we are setting the IP on the node where the workload is running, which works I guess, but at the same time was confusing us a bit (as the comment seems to imply that we specifically need the node with the workload, while it's actually not)

But maybe what we'd want here is the opposite of the current action: to ensure that the egress gateway is not the node on which the client pod is running. That way users can see the full effect of the egress gateway feature (i.e., redirect to egress gateway + SNAT instead of just SNAT currently).

agree with this 👍

For reference, the test-VM instead suggests using a nodeSelector for pinning the Egress Gateway to a specific node. Which feels reasonable, so that the Gateway doesn't potentially bounce around in the cluster during a re-deployment.

yep, we should probably change the example egress-ip-assign pod's manifest to use a node selector, but I'd say to wait as I have a PR almost ready with some other egress gateway changes (which will allow to use a node selector also to specify the egress gateway node in the CiliumEgressNATPolicy object), and these changes will require to update the docs anyway

pchaigno · 2022-04-01T10:03:05Z

yep, we should probably change the example egress-ip-assign pod's manifest to use a node selector

Won't this be hard to achieve in an arbitrary customer environment? We don't know the names and labels of nodes. It makes sense for the CI where we control all this, but not sure it does for a user guide.

jibi · 2022-04-01T10:11:57Z

I mean, we should just the same example selector (example-label: test or something alike) both in the policy:

apiVersion: cilium.io/v2alpha1
kind: CiliumEgressNATPolicy
metadata:
  name: egress-test
spec:
#[..]
  egressGateway:
    nodeSelector:
      matchLabels:
        example-label: test

and then in the egress-ip-assign pod manifest:

apiVersion: v1
kind: Pod
metadata:
  name: egress-ip-assign
spec:
#[..]
  nodeSelector:
    example-label: test

(also not sure why we are currently suggesting to use a Deployment for that 🤔)

pchaigno · 2022-04-01T10:15:55Z

That requires the user to add the appropriate label on the node. One more step that isn't really necessary in this context IMO.

Why do we care about which exact node the egress gateway ends up being? I understand for test (reduce randomness) but I don't see the point for a user guide. An antiaffinity rule seems enough to me.

jibi · 2022-04-01T10:40:45Z

An antiaffinity rule seems enough to me.

with the current state of the feature: yes, but once we add support for nodeSelectors in the egressgw policies you'll have to assign the egress IP to the same node that is selected by the policy (perhaps we can document/add examples for both cases)

gandro added area/documentation Impacts the documentation, including textual changes, sphinx, or other doc generation code. release-note/misc This PR makes changes that have no direct user impact. needs-backport/1.10 labels May 3, 2021

gandro requested a review from a team as a code owner May 3, 2021 15:18

gandro requested a review from qmonnet May 3, 2021 15:18

maintainer-s-little-helper bot assigned qmonnet May 3, 2021

maintainer-s-little-helper bot added this to Needs backport from master in 1.10.0-rc2 May 3, 2021

pchaigno self-requested a review May 3, 2021 21:26

maintainer-s-little-helper bot assigned pchaigno May 3, 2021

qmonnet approved these changes May 4, 2021

View reviewed changes

maintainer-s-little-helper bot unassigned qmonnet May 4, 2021

pchaigno approved these changes May 4, 2021

View reviewed changes

maintainer-s-little-helper bot unassigned pchaigno May 4, 2021

pchaigno added the ready-to-merge This PR has passed all tests and received consensus from code owners to merge. label May 4, 2021

ti-mo merged commit f9ef751 into cilium:master May 5, 2021

brb mentioned this pull request May 7, 2021

v1.10 backports 2021-05-07 #16049

Merged

brb added backport-pending/1.10 and removed needs-backport/1.10 labels May 7, 2021

maintainer-s-little-helper bot moved this from Needs backport from master to Backport pending to v1.10 in 1.10.0-rc2 May 7, 2021

joestringer added backport-done/1.10 and removed backport-pending/1.10 labels May 13, 2021

maintainer-s-little-helper bot moved this from Backport pending to v1.10 to Backport done to v1.10 in 1.10.0-rc2 May 13, 2021

aanm mentioned this pull request May 17, 2021

Prepare for release v1.10.0-rc2 #16167

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: Fix egress gateway getting started guide #15984

docs: Fix egress gateway getting started guide #15984

gandro commented May 3, 2021 •

edited

pchaigno left a comment

julianwiedmann commented Mar 31, 2022

gandro commented Apr 1, 2022 •

edited

pchaigno commented Apr 1, 2022

julianwiedmann commented Apr 1, 2022

jibi commented Apr 1, 2022

pchaigno commented Apr 1, 2022

jibi commented Apr 1, 2022 •

edited

pchaigno commented Apr 1, 2022

jibi commented Apr 1, 2022

docs: Fix egress gateway getting started guide #15984

docs: Fix egress gateway getting started guide #15984

Conversation

gandro commented May 3, 2021 • edited

pchaigno left a comment

Choose a reason for hiding this comment

julianwiedmann commented Mar 31, 2022

gandro commented Apr 1, 2022 • edited

pchaigno commented Apr 1, 2022

julianwiedmann commented Apr 1, 2022

jibi commented Apr 1, 2022

pchaigno commented Apr 1, 2022

jibi commented Apr 1, 2022 • edited

pchaigno commented Apr 1, 2022

jibi commented Apr 1, 2022

gandro commented May 3, 2021 •

edited

gandro commented Apr 1, 2022 •

edited

jibi commented Apr 1, 2022 •

edited