feat: make `hostNetwork` configurable #107

vadasambar · 2024-01-11T12:11:34Z

disable it by default (fixes port conflict issue on the node if another port is trying to acquire the same port)

Why

We use hostNetwork: true by default which causes problems in the following scenario:

another-csi-driver pod is running on the same node A
another-csi-driver uses port 8080
warm-metal pod is scheduled on node A
warm-metal pod tries to use port 8080 as well (for metrics endpoint) but fails because the port is already in use

This happens because we use hostNetwork: true. Based on my tests and analysis (along with integration tests), we don't really need hostNetwork because we are not accessing anything on the node nor is the node accessing anything on the warm-metal pod directly.

More info

hostNetwork has been a part of the initial commit: 515cec1

The liveness probe is a sidecar container that exposes an HTTP /healthz endpoint, which serves as kubelet's livenessProbe hook to monitor health of a CSI driver.

The liveness probe uses Probe() call to check the CSI driver is healthy. See CSI spec for more information about Probe API call. Container Storage Interface (CSI)

https://github.com/kubernetes-csi/livenessprobe/tree/master?tab=readme-ov-file#liveness-probe

So kubelet basically executes livenessProbe which pings the healthz endpoint to check if the CSI driver is alive and livenessprobe server forwards that request to the CSI driver's Probe() endpoint to check if the driver is healthy.
https://github.com/kubernetes-csi/livenessprobe/blob/ebd49b57031ab90f6f376f1472884948f381558c/cmd/livenessprobe/main.go#L169 -> https://github.com/kubernetes-csi/livenessprobe/blob/ebd49b57031ab90f6f376f1472884948f381558c/cmd/livenessprobe/main.go#L73

I don't see a need for warm metal to use hostNetwork since it doesn't need to access anything on the node.

warm-metal can access all the unix sockets via volumeMounts:

...
    volumeMounts:
    - mountPath: /csi
      name: socket-dir
...
    - mountPath: /run/containerd/containerd.sock
      name: runtime-socket
...
  volumes:
  - hostPath:
      path: /var/lib/kubelet/plugins/csi-image.warm-metal.tech
      type: DirectoryOrCreate
    name: socket-dir
...
  - hostPath:
      path: /run/containerd/containerd.sock
      type: Socket
    name: runtime-socket
...
  - name: kube-api-access-xst7n
...

ref: https://github.com/warm-metal/csi-driver-image/blob/master/charts/warm-metal-csi-driver/templates/nodeplugin.yaml

- disable it by default (fixes port conflict issue on the node if another port is trying to acquire the same port)

vadasambar · 2024-01-11T12:30:13Z

pkg/pullexecutor/pullexecutor.go

@@ -102,7 +102,8 @@ func (m *PullExecutor) StartPulling(o *PullOptions) error {
 			c, cancel := context.WithTimeout(context.Background(), pullCtxTimeout)
 			defer cancel()

-			if pullstatus.Get(o.NamedRef) == pullstatus.StillPulling {
+			if pullstatus.Get(o.NamedRef) == pullstatus.StillPulling ||
+				pullstatus.Get(o.NamedRef) == pullstatus.Pulled {


This is a fix for what appears to be a race condition when I was testing #81
Basically when two requests sent back-to-back with little time duration between them, one of them succeeds while the other goes into the mutex section and continues to pull the same image again (if pullAlways: true). I haven't seen this issue outside #81 but thought it would be good to get the fix in before we get #81 merged.

Good catch!

mugdha-adhav

It would be good if we could also update other references for hostNetwork and set it to false for maintaining consistency.

vadasambar · 2024-01-11T12:59:15Z

It would be good if we could also update other references for hostNetwork and set it to false for maintaining consistency.

I have replaced the occurrences.

vadasambar · 2024-01-11T12:59:34Z

Will ping you here once the CI finishes successfully.

vadasambar · 2024-01-11T14:37:53Z

@mugdha-adhav can you review the PR again.

charts/warm-metal-csi-driver/templates/nodeplugin.yaml

vadasambar · 2024-01-12T13:17:23Z

@mugdha-adhav can I ask for a review again 🙏

feat: make hostNetwork configurable

7f67ff6

- disable it by default (fixes port conflict issue on the node if another port is trying to acquire the same port)

vadasambar marked this pull request as ready for review January 11, 2024 12:26

vadasambar requested a review from a team as a code owner January 11, 2024 12:26

vadasambar commented Jan 11, 2024

View reviewed changes

chore: update VERSION in the Makefile

572f104

mugdha-adhav approved these changes Jan 11, 2024

View reviewed changes

chore: replace hostNetwork: true with hostNetwork: false

5af0fcf

mugdha-adhav reviewed Jan 11, 2024

View reviewed changes

charts/warm-metal-csi-driver/templates/nodeplugin.yaml Outdated Show resolved Hide resolved

fix: template substituion for hostNetwork

28cdf2d

mugdha-adhav approved these changes Jan 12, 2024

View reviewed changes

mugdha-adhav merged commit 2da2ca1 into warm-metal:master Jan 12, 2024
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: make `hostNetwork` configurable #107

feat: make `hostNetwork` configurable #107

vadasambar commented Jan 11, 2024 •

edited

Loading

vadasambar Jan 11, 2024

mugdha-adhav Jan 11, 2024

mugdha-adhav left a comment •

edited

Loading

vadasambar commented Jan 11, 2024

vadasambar commented Jan 11, 2024

vadasambar commented Jan 11, 2024

vadasambar commented Jan 12, 2024

feat: make hostNetwork configurable #107

feat: make hostNetwork configurable #107

Conversation

vadasambar commented Jan 11, 2024 • edited Loading

Why

More info

vadasambar Jan 11, 2024

Choose a reason for hiding this comment

mugdha-adhav Jan 11, 2024

Choose a reason for hiding this comment

mugdha-adhav left a comment • edited Loading

Choose a reason for hiding this comment

vadasambar commented Jan 11, 2024

vadasambar commented Jan 11, 2024

vadasambar commented Jan 11, 2024

vadasambar commented Jan 12, 2024

feat: make `hostNetwork` configurable #107

feat: make `hostNetwork` configurable #107

vadasambar commented Jan 11, 2024 •

edited

Loading

mugdha-adhav left a comment •

edited

Loading