ci: convert k8s charts deployment -> statefulset #3642

conorsch · 2024-01-20T01:17:14Z

Updates the helm charts used for testnet deployments to use a StatefulSet [0], rather than a Deployment [1], as the representation for a Penumbra fullnode/validator. The goal is to leverage the k8s API as best as possible for our workloads, which are indeed stateful in the sense that they require attached storage and cannot maintain their identity absent that storage.

We also benefit from ordered rollouts, meaning that future minor version bumps will be applied sequentially, and paused if any node fails to become ready. This will ensure more predictable behavior as we move toward chain upgrades.

When performing a chain upgrade, the manual steps taken by a human operator are now significantly simpler. In addition to the conversion to Statefulsets, the relevant charts now boast a new future called "maintenanceMode", defaulting to false, which will place nodes in a suspended state so that a human operator can run pd migrate. This mode encapsulates a number of finicky manual steps: override command to be "sleep infinity", for both pd and cometbft, alter securityContext to run as root user for volume permissions, and then undo all that in the reverse order.

[0] https://kubernetes.io/docs/concepts/workloads/controllers/statefulset/
[1] https://kubernetes.io/docs/concepts/workloads/controllers/deployment/

Updates the helm charts used for testnet deployments to use a StatefulSet [0], rather than a Deployment [1], as the representation for a Penumbra fullnode/validator. The goal is to leverage the k8s API as best as possible for our workloads, which are indeed stateful in the sense that they require attached storage and cannot maintain their identity absent that storage. We also benefit from ordered rollouts, meaning that future minor version bumps will be applied sequentially, and paused if any node fails to become ready. This will ensure more predictable behavior as we move toward chain upgrades. When performing a chain upgrade, the manual steps taken by a human operator are now significantly simpler. In addition to the conversion to Statefulsets, the relevant charts now boast a new future called "maintenanceMode", defaulting to false, which will place nodes in a suspended state so that a human operator can run `pd migrate`. This mode encapsulates a number of finicky manual steps: override command to be "sleep infinity", for both pd and cometbft, alter securityContext to run as root user for volume permissions, and then undo all that in the reverse order. [0] https://kubernetes.io/docs/concepts/workloads/controllers/statefulset/ [1] https://kubernetes.io/docs/concepts/workloads/controllers/deployment/

conorsch temporarily deployed to smoke-test January 20, 2024 01:17 — with GitHub Actions Inactive

conorsch force-pushed the statefulset-deployments branch from 8ca2c7e to 1423b35 Compare January 22, 2024 22:52

conorsch temporarily deployed to smoke-test January 22, 2024 22:52 — with GitHub Actions Inactive

conorsch marked this pull request as ready for review January 22, 2024 22:58

conorsch merged commit 1d688c2 into main Jan 22, 2024
7 checks passed

conorsch deleted the statefulset-deployments branch January 22, 2024 23:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: convert k8s charts deployment -> statefulset #3642

ci: convert k8s charts deployment -> statefulset #3642

conorsch commented Jan 20, 2024

ci: convert k8s charts deployment -> statefulset #3642

ci: convert k8s charts deployment -> statefulset #3642

Conversation

conorsch commented Jan 20, 2024