Skip to content

Gateway crash on v0.0.16: k3s fatal error 'flag provided but not defined: -resolv-conf' #696

@yanggf8

Description

@yanggf8

Description

OpenShell gateway v0.0.16 fails to start. The k3s process inside the cluster container crashes immediately with:

time="2026-03-31T06:17:23Z" level=fatal msg="Error: flag provided but not defined: -resolv-conf"

The container exits before the openshell namespace is ever created, causing nemoclaw onboard to fail with "K8s namespace not ready" after all retry attempts.

Environment

  • OS: Ubuntu (WSL2 on Windows)
  • Docker: 29.3.0
  • OpenShell CLI: 0.0.16
  • OpenShell image: ghcr.io/nvidia/openshell/cluster:0.0.16
  • k3s version in image: v1.35.2+k3s1

Steps to reproduce

docker run -d --name test --privileged ghcr.io/nvidia/openshell/cluster:0.0.16
docker logs test
# Container exits immediately with the -resolv-conf fatal error

Or via NemoClaw:

node bin/nemoclaw.js onboard
# Fails at step 2/7 "Starting OpenShell gateway"

Root cause

The cluster entrypoint passes -resolv-conf to k3s, but k3s v1.35.2 no longer recognizes this flag. It was likely renamed or removed in a recent k3s release.

Workaround

Downgrading to v0.0.15 works:

OPENSHELL_CLUSTER_IMAGE=ghcr.io/nvidia/openshell/cluster:0.0.15 openshell gateway start --name nemoclaw

Expected behavior

Gateway should start successfully and create the openshell k8s namespace.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions