Skip to content

fix: make health probes configurable with better defaults#202

Merged
vishnu-narayanan merged 4 commits intomainfrom
fix/configurable-health-probes
Apr 2, 2026
Merged

fix: make health probes configurable with better defaults#202
vishnu-narayanan merged 4 commits intomainfrom
fix/configurable-health-probes

Conversation

@vishnu-narayanan
Copy link
Copy Markdown
Member

@vishnu-narayanan vishnu-narayanan commented Apr 2, 2026

Summary

  • Add timeoutSeconds: 5 to liveness, readiness, and startup probes (was defaulting to K8s implicit 1s)
  • Increase startup probe failureThreshold from 30 to 60 (~10 min budget instead of ~5.5 min)
  • Move probe configuration to values.yaml so self-hosted customers can tune without forking the chart

Context

Self-hosted customer reported pods stuck in restart loops due to health check failures. Root cause was CPU-starved pods taking too long to boot Rails, combined with the implicit 1s timeoutSeconds killing pods that were slow to respond after startup.

Test plan

  • helm template renders probes correctly with default values
  • Verify existing deployments are not affected (defaults are strictly more permissive)
  • Verify customers can override probe values via values.yaml

vishnu-narayanan and others added 4 commits April 1, 2026 17:45
nslookup in BusyBox v1.37+ (shipped in the chatwoot image) exits 1
when intermediate search domain lookups return NXDOMAIN, even if the
final lookup succeeds. This causes the init-redis until loop to hang
indefinitely on clusters with ndots:5 (the Kubernetes default).

getent hosts uses the system C library resolver which returns the first
successful result and exits 0, matching the expected behavior.

Fixes #200
- Add timeoutSeconds: 5 to liveness, readiness, and startup probes (was defaulting to 1s)
- Increase startup probe failureThreshold from 30 to 60 (~10 min budget)
- Move probe configuration to values.yaml so customers can tune without forking
@vishnu-narayanan vishnu-narayanan merged commit ab9f574 into main Apr 2, 2026
1 check passed
@vishnu-narayanan vishnu-narayanan deleted the fix/configurable-health-probes branch April 2, 2026 11:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant