Skip to content

fix(ci): e2e nested bootstrap cluster add d8 v ssh timeout#2301

Merged
universal-itengineer merged 2 commits into
mainfrom
fix/ci/e2e-nested-bootstrap-cluster-add-d8vssh-timeout
Apr 29, 2026
Merged

fix(ci): e2e nested bootstrap cluster add d8 v ssh timeout#2301
universal-itengineer merged 2 commits into
mainfrom
fix/ci/e2e-nested-bootstrap-cluster-add-d8vssh-timeout

Conversation

@universal-itengineer
Copy link
Copy Markdown
Member

@universal-itengineer universal-itengineer commented Apr 29, 2026

Description

Add SSH keepalive options to d8 v ssh calls used in nested-cluster CI and the e2e d8 helper.
This keeps long-lived SSH sessions more stable when the VM connection is proxied through virtualization-api port-forwarding.
The following SSH options were added:

  • ServerAliveInterval=15 to send periodic keepalive probes during long-running sessions.
  • ServerAliveCountMax=8 to tolerate short interruptions before the SSH client gives up.
  • ConnectTimeout=10 to fail stalled connection attempts quickly and avoid hanging retries.

Why do we need it, and what problem does it solve?

Nested cluster bootstrap and kubeconfig generation use d8 v ssh over a websocket-backed port-forward path.
When that path is interrupted, CI may fail with errors like websocket: close 1006, Broken pipe, or exit status 255.
Adding ServerAliveInterval, ServerAliveCountMax, and a fixed ConnectTimeout makes these SSH sessions more resilient to transient transport issues and short interruptions.

What is the expected result?

d8 v ssh sessions in CI and e2e helpers should better survive short-lived connection drops.
This should reduce flaky failures during long-running nested-cluster operations without changing the functional behavior of the tests.

Checklist

  • The code is covered by unit tests.
  • e2e tests passed.
  • Documentation updated according to the changes.
  • Changes were tested in the Kubernetes cluster manually.

Changelog entries

section: ci
type: fix
summary: add SSH keepalive options to d8-based nested cluster access in CI and e2e helpers
impact_level: low

Signed-off-by: Nikita Korolev <nikita.korolev@flant.com>
Signed-off-by: Nikita Korolev <nikita.korolev@flant.com>
@universal-itengineer universal-itengineer added this to the v1.9.0 milestone Apr 29, 2026
@universal-itengineer universal-itengineer marked this pull request as ready for review April 29, 2026 12:41
@universal-itengineer universal-itengineer changed the title Fix/ci/e2e nested bootstrap cluster add d8vssh timeout fix(ci): e2e nested bootstrap cluster add d8vssh timeout Apr 29, 2026
@universal-itengineer universal-itengineer changed the title fix(ci): e2e nested bootstrap cluster add d8vssh timeout fix(ci): e2e nested bootstrap cluster add d8 v ssh timeout Apr 29, 2026
@universal-itengineer universal-itengineer merged commit 813a48d into main Apr 29, 2026
33 of 36 checks passed
@universal-itengineer universal-itengineer deleted the fix/ci/e2e-nested-bootstrap-cluster-add-d8vssh-timeout branch April 29, 2026 13:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants