Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Anti-affinity for k8s nodes (control-plane and workers) #226

Closed
2 tasks done
garloff opened this issue Dec 5, 2022 · 1 comment · Fixed by SovereignCloudStack/standards#336
Closed
2 tasks done
Assignees
Labels
Container Issues or pull requests relevant for Team 2: Container Infra and Tooling SCS is standardized SCS is standardized SCS-VP10 Related to tender lot SCS-VP10 Sprint Ljubljana Sprint Ljubljana (2023, cwk 38+39) standardization Standards & Certification standards Issues / ADR / pull requests relevant for standardization & certification

Comments

@garloff
Copy link
Contributor

garloff commented Dec 5, 2022

As a user, I want to be sure that having containers on different nodes are not going down simultaneously because they are actually VMs (servers) on the same host (hypervisor). I need this to be the default behavior.

The control plane must survive for the cluster to survive. This requires hard anti-affinity (or in a shared control plane enough redundancy guaranteed otherwise).
For workers, we still want to avoid co-scheduling, but it does not necessarily need to be a hard anti-affinity.

This could be made opt-out, which could be useful for very small environments.

Definition of Done:

  • A standardization document is written (we only produced a Decision Record with this issue in the end)
  • A follow up issue is created.
@garloff
Copy link
Contributor Author

garloff commented Dec 7, 2022

In previous meetings we had agreed that using

  • hard anti-affinity on control plane nodes
    (ignore for shared control plane but require enough redundancy in a different way then)
  • soft anti-affinity on work nodes
    is the right default.

@tibeer tibeer mentioned this issue Mar 29, 2023
@cah-hbaum cah-hbaum self-assigned this Aug 24, 2023
@cah-hbaum cah-hbaum added Container Issues or pull requests relevant for Team 2: Container Infra and Tooling SCS is standardized SCS is standardized standardization Standards & Certification standards Issues / ADR / pull requests relevant for standardization & certification SCS-VP10 Related to tender lot SCS-VP10 labels Aug 24, 2023
@jschoone jschoone added the Sprint Ljubljana Sprint Ljubljana (2023, cwk 38+39) label Feb 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Container Issues or pull requests relevant for Team 2: Container Infra and Tooling SCS is standardized SCS is standardized SCS-VP10 Related to tender lot SCS-VP10 Sprint Ljubljana Sprint Ljubljana (2023, cwk 38+39) standardization Standards & Certification standards Issues / ADR / pull requests relevant for standardization & certification
Projects
Archived in project
Development

Successfully merging a pull request may close this issue.

3 participants