Network design idea: use a private network. #9

rbo · 2021-06-04T10:49:25Z

Additional to #8, it's hard to secure the public interfaces of the Hetzner servers.

Idea A: Move SDN / OpenShift traffic to a private network. Add Hetzner you can attach the dedicated server to a vSwitch.

This would end up in this network setup:

Initially, I tried this kind of setup and failed.

Because of openshift installation use interfaces with a default gateway for main interface decision. We changed the default gw to 172.22.2.1 at vlan4000 interface to force to use the VLAN interface IP.

Additional we decided to complete disable public IP because bootstrap pick the first IP and this is the public one so bootstrap etcd member uses public API but all other nodes do not have access anymore to public IPs.

Source commit #4b7523ec11161c20e4a2e851e4f2e732185e96f1

rbo · 2021-06-04T11:59:48Z

Moving SDN/OVN to the secondary interface is not possible, RFE: Support Migration of OVN to a Secondary Cluster Host Interface planned for 4.9

Another related RFE: Multiple NIC Support for OVN-Kubernetes Deployments

rbo · 2021-06-04T12:49:51Z

Idea B:

It is close to, but with an own default router

sesheta · 2021-10-15T12:00:45Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

/lifecycle stale

rbo · 2021-10-18T07:55:20Z

/remove-lifecycle stale

4.9 will be released this month then we can plan to use the secondary interface thing.

rbo · 2021-10-29T15:54:21Z

Bring up Idea A at our internal slack with sdn engineering, and ask for a solution. https://coreos.slack.com/archives/CDCP2LA9L/p1635522685150000

/cc @durandom

durandom · 2021-11-01T14:27:37Z

@rbo since 4.9 is out now. Would you recommend upgrading rick to 4.9 and try this solution?
Or would we rather deploy another 4.9 test cluster and try the solution there?

rbo · 2021-11-02T08:57:02Z

It looks like SDN-1813 is the wrong approach, we are not the only one with that kind of problem: https://issues.redhat.com/browse/OCPBUGSM-27829

@rbo since 4.9 is out now. Would you recommend upgrading rick to 4.9 and try this solution? Or would we rather deploy another 4.9 test cluster and try the solution there?

As far as I know, the upgrade path is only available in the unsupported candidate-4.9 channel. First, we have to cleanup the rick cluster, there are some operators in "in progressing" state.

durandom · 2021-11-04T10:29:16Z

@larsks can you open a ticket with RH support for this and add @rbo to it?

larsks · 2021-11-04T11:40:15Z

@durandom I think someone else should probably manage the support ticket for this issue. It doesn't seem to touch on the MOC/BU environment, which is generally how I masquerade as a customer w/r/t support, and I'm not familiar with the hetzner environment at all. There needs to be someone else who is able to interact with the support system (or we should be pursuing this by directly interacting with the sdn team, rather than trying to treat it as a support issue -- which might be for the best because openshift support has been a mixed bag so far).

rbo · 2021-11-18T09:26:09Z

Mh it looks like we have an official documented solution to select the NIC for the kubelet: https://docs.openshift.com/container-platform/4.9/support/troubleshooting/troubleshooting-network-issues.html#nw-how-nw-iface-selected_troubleshooting-network-issues

sesheta · 2022-02-16T09:28:18Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

/lifecycle stale

rbo · 2022-02-25T08:56:10Z

The private network works pretty well:

PR #17 contains everything. I guess we can close this issue.

rbo added the kind/design Design decision label Jun 4, 2021

rbo added the priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now. label Jun 4, 2021

rbo removed the priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now. label Jun 17, 2021

sesheta added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Oct 15, 2021

sesheta removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Oct 18, 2021

rbo self-assigned this Oct 18, 2021

rbo mentioned this issue Oct 19, 2021

Limited firewall rules at Hetzner - only 10 rules per server #8

Closed

rbo mentioned this issue Oct 29, 2021

Get hardware with avx2 or avx512 capabilities in smaug instance operate-first/support#419

Closed

1 task

sesheta added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 16, 2022

rbo closed this as completed Feb 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Network design idea: use a private network. #9

Network design idea: use a private network. #9

rbo commented Jun 4, 2021 •

edited

Loading

rbo commented Jun 4, 2021 •

edited

Loading

rbo commented Jun 4, 2021 •

edited

Loading

sesheta commented Oct 15, 2021

rbo commented Oct 18, 2021

rbo commented Oct 29, 2021

durandom commented Nov 1, 2021

rbo commented Nov 2, 2021

durandom commented Nov 4, 2021

larsks commented Nov 4, 2021

rbo commented Nov 18, 2021

sesheta commented Feb 16, 2022

rbo commented Feb 25, 2022

Network design idea: use a private network. #9

Network design idea: use a private network. #9

Comments

rbo commented Jun 4, 2021 • edited Loading

rbo commented Jun 4, 2021 • edited Loading

rbo commented Jun 4, 2021 • edited Loading

sesheta commented Oct 15, 2021

rbo commented Oct 18, 2021

rbo commented Oct 29, 2021

durandom commented Nov 1, 2021

rbo commented Nov 2, 2021

durandom commented Nov 4, 2021

larsks commented Nov 4, 2021

rbo commented Nov 18, 2021

sesheta commented Feb 16, 2022

rbo commented Feb 25, 2022

rbo commented Jun 4, 2021 •

edited

Loading

rbo commented Jun 4, 2021 •

edited

Loading

rbo commented Jun 4, 2021 •

edited

Loading