[RFC] Define Flux tenancy models #2086

stefanprodan · 2021-11-15T13:19:10Z

The main goal of this RFC is to define the Kubernetes tenancy models supported by Flux.

This PR attempts to document the status quo, and should provide clarity of what multi-tenancy capabilities Flux has. It also functions as a base for rewriting the loose proposal in #582 into well scoped RFCs.

phillebaba · 2021-11-15T13:32:35Z

rfcs/0001-multi-tenancy/README.md

+
+The platform admins have unrestricted, cluster-scoped access to Kubernetes API.
+They are responsible for installing Flux and granting Flux
+access to the sources (Git, Helm, OCI repositories) that make up the cluster(s) control plane desired state.


Does this mean that tenants should not configure their own sources? In the tenants section it does however state "Register their sources with Flux". I might just be misinterpreting the meaning.

This is about the cluster control plane desired state as in cluster-wide resources, controllers, etc.

I've added a new section "Tenants Onboarding". Hopefully this clarifies that tenants can add their app repos to their main repo which is registered by admins.

I was initially confused by this as well. It might pay to spend some paras up front explaining which git repositories are assumed to exist, and how they are used (i.e., what they contain).

I wonder if it would make sense to introduce a definition of the "control plane" in a separate paragraph at the beginning of the RFC somewhere. I'm thinking of everything that is either shared among tenants or created as part of the on-boarding of a tenant; the Flux instance itself, components such as Gatekeeper and resources such as ServiceAccounts.

It might also be helpful to explain the repo hierarchy: Each tenant has a root repo that's created by cluster admins and as many subsequent repos maintained by themselves.

rfcs/0001-multi-tenancy/README.md

souleb · 2021-11-22T15:32:24Z

rfcs/0001-multi-tenancy/README.md

+
+Example of operations performed by tenants:
+
+- Register their sources with Flux (`GitRepositories`, `HelmRepositories` and `Buckets`).


It is said here above that cluster-admin Onboard tenants by registering their Git repositories with Flux. This might need a clarification on separation of concern

I've added a new section "Tenants Onboarding". Hopefully this clarifies that tenants can add their app repos to their main repo which is registered by admins.

yebyen

This is a great overview of the current state-of-the-art with lots of good references for follow-up education. 👍 LGTM with changes, a few typos corrected.

rfcs/0001-multi-tenancy/README.md

squaremo · 2021-11-29T13:48:22Z

It's not uncommon to have a "memorandum" RFC which describes the status quo, rather than proposing a new design. It seems like a needless indirection to use an RFC to propose new documentation, giving the content verbatim, though. I would expect either

an RFC which will stand itself as the description of the status quo at some point (and, e.g., documentation can point at it, or paraphrase it while still relevant); or,
a PR with the new documentation.

Given the goal of building up to new designs, I think the first is the appropriate form here (and would require only a little adaptation).

jonathan-innis

LGTM 🚀

nikkomiu

Looks good! (I'm with the MS/Azure group)

squaremo

Overall comment: 💯 👏 for the effort to definitively set out tenancy models for Flux. I think the content could be more pointed in how it does that, by

being clear about which bits are definitions or assumptions;
describing the models more directly -- in some places, the text lapses into being "how to" rather than being definitive.

rfcs/0001-multi-tenancy/README.md

squaremo · 2021-11-29T18:12:34Z

rfcs/0001-multi-tenancy/README.md

+- List the tenancy models supported by Flux.
+- Explain the differences between tenancy models.


These aren't the only ways to set up a multi-tenant Flux system though, are they? This feels like it's partly a guide to good practice, rather than a reference. In which case, the language could be more like

Define two models for multi-tenancy, "soft multi-tenancy" and "hard multi-tenancy"

Explain when each is appropriate

Describe a reference implementation of each model with Flux

(this distinguishes between definitions, which are normative; and implementations, of which there may be variations).

rfcs/0001-multi-tenancy/README.md

squaremo · 2021-11-30T12:04:53Z

rfcs/0001-multi-tenancy/README.md

+
+### Hard Multi-Tenancy
+
+With hard multi-tenancy, the platform admins use Kubernetes Cluster API to create dedicated clusters for each tenant.


Is this a strict requirement of the model? Or could the kubeConfig secrets come from some other mechanism, e.g., if clusters are constructed with terraform, or with clicking buttons.

I'm not clear on whether applying things remotely is required for the hard multi-tenancy, or kind of a mixed-in concern (if you're giving each tenant a cluster, you probably have a management cluster, so let's base that model on that assumption ...). Could you provide some justification in the text for this approach? Or explicitly give it as an assumption.

D2iQ actually currenty implements hard multi-tenancy without kubeConfig but instead we have controllers install Flux and create the sync resources on each tenant cluster. So I suppose kubeConfig is one of several ways to enforce hard multi-tenancy.

squaremo · 2021-11-30T12:11:27Z

rfcs/0001-multi-tenancy/README.md

+When onboarding tenants, platform admins have the option to assign namespaces, set
+permissions and register the tenants main repositories onto clusters in a declarative manner.
+
+The Flux CLI offers an easy way of generating all the Kubernetes manifests needed to onboard tenants:


This and the examples following are "how to set up multi-tenancy" rather than describing the model or implementation. Demonstrating how to set it up is not a goal, in the text as it stands -- neither are describing the model or its implementation, but according to the PR title, perhaps they should be.

I suggest reworking this section to describe what the soft-tenancy model requires of RBAC (things like "each tenant namespace has a service account, with these bindings"). Telling people how to make it so conveniently, as you have here, is useful as extra information, but informative rather than definitive.

Expanding the RBAC recommendations here would be really useful.
It would be good to ensure we cater for protecting the tenant's service account from being misused.
Here's some ideas:

A) Vanila K8S

The Platform Admin would pre-create all namespaces the tenant will use ahead of time, setting access via rolebindings for all the tenant's namespaces.
All Flux objects are created at the tenant flux namespace.

flux-tenant-alpha ├── flux-tenant-alpha-serviceaccount ├── flux-tenant-alpha-rolebinding ├── podinfo-helmrepository └── podinfo-helmrelease apps ├── flux-tenant-alpha-rolebinding └── podinfo

B) HNC

The Platform Admin pre-creates the tenant top level namespace, with its service account and rolebindings.
All Flux objects are created at the tenant top level namespace.
Tenants can create subnamespaces and deploy apps to it.

flux-tenant-alpha ├── flux-tenant-alpha-serviceaccount ├── flux-tenant-alpha-rolebinding ├── podinfo-helmrepository ├── podinfo-helmrelease └── [ns] apps ├── flux-tenant-alpha-rolebinding └── podinfo

In both cases, the "deployment" service account is never placed on a namespace that is shared with other applications.

If a tenant's flux namespace needs to have mixed use (shared between applications and flux components), it would require admission controllers to block the misuse of the tenant's service account.

C) Vanila K8S + Admission Controllers

flux-tenant-alpha ├── flux-tenant-alpha-serviceaccount (i.e. kyverno policy to block misuse of this service account) ├── flux-tenant-alpha-rolebinding ├── podinfo-helmrepository └── podinfo-helmrelease └── podinfo

squaremo · 2021-11-30T12:15:47Z

rfcs/0001-multi-tenancy/README.md

+- [EKS multi-tenancy best practices](https://aws.github.io/aws-eks-best-practices/security/docs/multitenancy/)
+
+### Soft Multi-Tenancy
+


The paras here are a nice, and concise, explanation 💟

squaremo · 2021-11-30T12:17:33Z

rfcs/0001-multi-tenancy/README.md

+make use of it without any manual actions. For clusters created by other means than Cluster API, the
+platform team has to create the `kubeConfig` secrets to allow Flux access to the remote clusters.
+
+As of Flux v0.23.0, we don't provide any guidance for cluster admins on how to generate the `kubeConfig` secrets.


The text above says they come from Cluster API.

makkes · 2021-12-03T16:38:56Z

rfcs/0001-multi-tenancy/README.md

+
+## Motivation
+
+The documentation [here](https://fluxcd.io/docs/) describes the security model of Flux.


Suggested change

The documentation [here](https://fluxcd.io/docs/) describes the security model of Flux.

The documentation [here](https://fluxcd.io/docs/security/) describes the security model of Flux.

Isn't this the more concrete page? The main one doesn't mention security.

makkes · 2021-12-03T16:43:02Z

rfcs/0001-multi-tenancy/README.md

+
+## Introduction
+
+Flux allows different organizations and/or teams to share the same Kubernetes control plane.


I recall someone (maybe it was @stefanprodan) telling me there shouldn't be multiple instances of Flux running on a single cluster (which could help in isolating tenants). Maybe that notion should be part of this doc as some kind of "official guidance"?

There are configuration options in which this theoretically still is a solution, but need to adhere to a set of rules that do not apply to most.

makkes · 2021-12-03T16:47:19Z

rfcs/0001-multi-tenancy/README.md

+
+## User Roles
+
+The tenancy models assume two types of user: platform admins and tenants.


Suggested change

The tenancy models assume two types of user: platform admins and tenants.

The existing Flux tenancy models assume two types of user: platform admins and tenants.

Not sure if that's the intention here but I figure a bit of clarification of which tenancy model we're talking about here might be helpful.

makkes · 2021-12-03T16:57:33Z

rfcs/0001-multi-tenancy/README.md

+
+The platform admins have unrestricted, cluster-scoped access to Kubernetes API.
+They are responsible for installing Flux and granting Flux
+access to the sources (Git, Helm, OCI repositories) that make up the cluster(s) control plane desired state.


I wonder if it would make sense to introduce a definition of the "control plane" in a separate paragraph at the beginning of the RFC somewhere. I'm thinking of everything that is either shared among tenants or created as part of the on-boarding of a tenant; the Flux instance itself, components such as Gatekeeper and resources such as ServiceAccounts.

It might also be helpful to explain the repo hierarchy: Each tenant has a root repo that's created by cluster admins and as many subsequent repos maintained by themselves.

makkes · 2021-12-06T15:31:41Z

rfcs/0001-multi-tenancy/README.md

+
+### Hard Multi-Tenancy
+
+With hard multi-tenancy, the platform admins use Kubernetes Cluster API to create dedicated clusters for each tenant.


D2iQ actually currenty implements hard multi-tenancy without kubeConfig but instead we have controllers install Flux and create the sync resources on each tenant cluster. So I suppose kubeConfig is one of several ways to enforce hard multi-tenancy.

makkes · 2021-12-06T15:32:36Z

rfcs/0001-multi-tenancy/README.md

+Note that with hard multi-tenancy, tenants have full access to cluster-wide resources, so they have the option
+to manage Flux independently of platform admins, by deploying a Flux instance on each cluster.


We should mention here that hard multi-tenancy can be combined with soft multi-tenancy to get around this limitation.

csand-msft · 2021-12-06T20:00:23Z

rfcs/0001-multi-tenancy/README.md

+
+The Kubernetes tenancy models supported by Flux are: soft multi-tenancy and hard multi-tenancy.
+
+For an overview of the Kubernetes multi-tenant architecture please consult the following documentation:


AKS multi-tenancy doc: https://docs.microsoft.com/azure/aks/operator-best-practices-cluster-isolation

pjbgf · 2021-12-13T09:47:42Z

rfcs/0001-multi-tenancy/README.md

+
+## Tenancy Models
+
+The Kubernetes tenancy models supported by Flux are: soft multi-tenancy and hard multi-tenancy.


On the four sources below (Kubernetes, GCP, Azure and AWS) only AWS uses the terms soft and hard for multi-tenancy. It would be useful to expand slightly here to clarify what we mean by it, which may speak to the RFC's goal of "Explain when each model is appropriate.".

Some ideas:

Soft Multi-tenancy Hard Multi-tenancy

Tenants may share cluster with other tenants Yes No

Tenants may share cluster with the flux management instance Yes No

Tenants access to cluster-wide resources Limited Unrestricted

pjbgf · 2021-12-13T09:48:19Z

rfcs/0001-multi-tenancy/README.md

+- [EKS multi-tenancy best practices](https://aws.github.io/aws-eks-best-practices/security/docs/multitenancy/)
+
+### Soft Multi-Tenancy
+


pjbgf · 2021-12-13T10:17:02Z

rfcs/0001-multi-tenancy/README.md

+Note that with soft multi-tenancy, true tenant isolation requires security measures beyond Kubernetes RBAC.
+Please refer to the Kubernetes [security considerations documentation](https://kubernetes.io/blog/2021/04/15/three-tenancy-models-for-kubernetes/#security-considerations)
+for more details on how to harden shared clusters.


I wonder whether we need a small multi-tenancy security section on its own, as similar points may be valid for hard multi-tenancy - although at a lower level of the stack.

The key point being that flux support several multi-tenancy use cases, but the Platform Admin is ultimately the responsible for ensuring the correct level of isolation is enforced between the tenants, based on their own security requirements.

pjbgf · 2021-12-13T11:57:34Z

rfcs/0001-multi-tenancy/README.md

+When onboarding tenants, platform admins have the option to assign namespaces, set
+permissions and register the tenants main repositories onto clusters in a declarative manner.
+
+The Flux CLI offers an easy way of generating all the Kubernetes manifests needed to onboard tenants:


Expanding the RBAC recommendations here would be really useful.
It would be good to ensure we cater for protecting the tenant's service account from being misused.
Here's some ideas:

A) Vanila K8S

The Platform Admin would pre-create all namespaces the tenant will use ahead of time, setting access via rolebindings for all the tenant's namespaces.
All Flux objects are created at the tenant flux namespace.

flux-tenant-alpha ├── flux-tenant-alpha-serviceaccount ├── flux-tenant-alpha-rolebinding ├── podinfo-helmrepository └── podinfo-helmrelease apps ├── flux-tenant-alpha-rolebinding └── podinfo

B) HNC

The Platform Admin pre-creates the tenant top level namespace, with its service account and rolebindings.
All Flux objects are created at the tenant top level namespace.
Tenants can create subnamespaces and deploy apps to it.

flux-tenant-alpha ├── flux-tenant-alpha-serviceaccount ├── flux-tenant-alpha-rolebinding ├── podinfo-helmrepository ├── podinfo-helmrelease └── [ns] apps ├── flux-tenant-alpha-rolebinding └── podinfo

In both cases, the "deployment" service account is never placed on a namespace that is shared with other applications.

If a tenant's flux namespace needs to have mixed use (shared between applications and flux components), it would require admission controllers to block the misuse of the tenant's service account.

C) Vanila K8S + Admission Controllers

flux-tenant-alpha ├── flux-tenant-alpha-serviceaccount (i.e. kyverno policy to block misuse of this service account) ├── flux-tenant-alpha-rolebinding ├── podinfo-helmrepository └── podinfo-helmrelease └── podinfo

These were adapted from the multi-tenancy RFC: #2086 Signed-off-by: Michael Bridgen <michael@weave.works>

Signed-off-by: Stefan Prodan <stefan.prodan@gmail.com>

The multi-tenancy implementations described rely on impersonation and remote apply; to make this RFC stand by itself, those need to be explained, along with the authorisation model (how Flux "decides" what it's allowed to do). This commit adds a summary of the authorisation model, impersonation, and remote apply, and rejigs the headings a little to make space. Signed-off-by: Michael Bridgen <michael@weave.works>

Signed-off-by: Stefan Prodan <stefan.prodan@gmail.com>

This gives a baseline for future changes, e.g., expanding where namespace ACLs are used, switching access control to untrusted-by-default. The "Security considerations" section was adapted from #2086 Signed-off-by: Michael Bridgen <michael@weave.works>

This gives a baseline for future changes, e.g., expanding where namespace ACLs are used, switching access control to untrusted-by-default. The "Security considerations" section was adapted from fluxcd#2086 Signed-off-by: Michael Bridgen <michael@weave.works>

stefanprodan requested review from squaremo, makkes, darkowlzz, relu, hiddeco and phillebaba November 15, 2021 13:19

stefanprodan added the area/rfc Feature request proposals in the RFC format label Nov 15, 2021

phillebaba reviewed Nov 15, 2021

View reviewed changes

stefanprodan force-pushed the rfc-0001 branch 5 times, most recently from e3d2c9e to df82c40 Compare November 18, 2021 13:18

hiddeco reviewed Nov 18, 2021

View reviewed changes

rfcs/0001-multi-tenancy/README.md Outdated Show resolved Hide resolved

stefanprodan force-pushed the rfc-0001 branch 2 times, most recently from 2c1c1c7 to dcc754b Compare November 18, 2021 16:16

souleb reviewed Nov 22, 2021

View reviewed changes

yebyen suggested changes Nov 23, 2021

View reviewed changes

rfcs/0001-multi-tenancy/README.md Outdated Show resolved Hide resolved

rfcs/0001-multi-tenancy/README.md Outdated Show resolved Hide resolved

jonathan-innis reviewed Nov 24, 2021

View reviewed changes

rfcs/0001-multi-tenancy/README.md Outdated Show resolved Hide resolved

stefanprodan force-pushed the rfc-0001 branch 2 times, most recently from 76a8325 to a417992 Compare November 25, 2021 15:16

jonathan-innis approved these changes Nov 29, 2021

View reviewed changes

nikkomiu approved these changes Nov 29, 2021

View reviewed changes

stefanprodan force-pushed the rfc-0001 branch 2 times, most recently from d12d812 to befbd52 Compare November 30, 2021 11:37

squaremo reviewed Nov 30, 2021

View reviewed changes

makkes reviewed Dec 6, 2021

View reviewed changes

csand-msft reviewed Dec 6, 2021

View reviewed changes

pjbgf reviewed Dec 13, 2021

View reviewed changes

pjbgf mentioned this pull request Dec 13, 2021

[RFC] Flux Multi-Tenancy Mode #2093

Closed

squaremo added a commit that referenced this pull request Dec 16, 2021

Add security considerations

48eaaab

These were adapted from the multi-tenancy RFC: #2086 Signed-off-by: Michael Bridgen <michael@weave.works>

stefanprodan force-pushed the rfc-0001 branch from f884db8 to 18091b4 Compare December 17, 2021 09:44

stefanprodan changed the title ~~[RFC-0001] Define Flux tenancy models~~ [RFC-0004] Define Flux tenancy models Dec 17, 2021

stefanprodan and others added 4 commits December 17, 2021 11:58

Define Flux tenancy models

d23d87a

Signed-off-by: Stefan Prodan <stefan.prodan@gmail.com>

Incorporate Michael's suggestions

dc7cb18

Signed-off-by: Stefan Prodan <stefan.prodan@gmail.com>

Refer to authorisation model in RFC-0001

e0bc754

Signed-off-by: Stefan Prodan <stefan.prodan@gmail.com>

stefanprodan force-pushed the rfc-0001 branch from 18091b4 to e0bc754 Compare December 17, 2021 09:58

stefanprodan mentioned this pull request Dec 17, 2021

[RFC-0001] Memorandum on the authorization model #2212

Merged

kingdonb mentioned this pull request Jan 26, 2022

No alerts for HelmRelease in slack fluxcd/helm-controller#350

Closed

1 task

stefanprodan mentioned this pull request Feb 3, 2022

Propose security model for impersonation/tenancy #582

Closed

stefanprodan changed the title ~~[RFC-0004] Define Flux tenancy models~~ [RFC] Define Flux tenancy models Apr 12, 2022

stefanprodan marked this pull request as draft April 12, 2022 12:00

pjbgf mentioned this pull request Apr 20, 2022

Multi-tenancy Improvements #2655

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Define Flux tenancy models #2086

[RFC] Define Flux tenancy models #2086

stefanprodan commented Nov 15, 2021 •

edited

phillebaba Nov 15, 2021

stefanprodan Nov 15, 2021

stefanprodan Nov 25, 2021

squaremo Nov 30, 2021

makkes Dec 3, 2021

souleb Nov 22, 2021

stefanprodan Nov 25, 2021

yebyen left a comment

squaremo commented Nov 29, 2021 •

edited

jonathan-innis left a comment

nikkomiu left a comment

squaremo left a comment

squaremo Nov 29, 2021

squaremo Nov 30, 2021

squaremo Nov 30, 2021

makkes Dec 6, 2021

squaremo Nov 30, 2021

pjbgf Dec 13, 2021

squaremo Nov 30, 2021

pjbgf Dec 13, 2021

squaremo Nov 30, 2021

makkes Dec 3, 2021

makkes Dec 3, 2021

hiddeco Dec 6, 2021

makkes Dec 3, 2021

makkes Dec 3, 2021

makkes Dec 6, 2021

makkes Dec 6, 2021

csand-msft Dec 6, 2021

pjbgf Dec 13, 2021

pjbgf Dec 13, 2021

pjbgf Dec 13, 2021

pjbgf Dec 13, 2021


		Example of operations performed by tenants:

		- Register their sources with Flux (`GitRepositories`, `HelmRepositories` and `Buckets`).

		- List the tenancy models supported by Flux.
		- Explain the differences between tenancy models.


		### Hard Multi-Tenancy

		With hard multi-tenancy, the platform admins use Kubernetes Cluster API to create dedicated clusters for each tenant.

		- [EKS multi-tenancy best practices](https://aws.github.io/aws-eks-best-practices/security/docs/multitenancy/)

		### Soft Multi-Tenancy


		## Motivation

		The documentation [here](https://fluxcd.io/docs/) describes the security model of Flux.


		## Introduction

		Flux allows different organizations and/or teams to share the same Kubernetes control plane.


		## User Roles

		The tenancy models assume two types of user: platform admins and tenants.

	The tenancy models assume two types of user: platform admins and tenants.
	The existing Flux tenancy models assume two types of user: platform admins and tenants.

		Note that with hard multi-tenancy, tenants have full access to cluster-wide resources, so they have the option
		to manage Flux independently of platform admins, by deploying a Flux instance on each cluster.


		The Kubernetes tenancy models supported by Flux are: soft multi-tenancy and hard multi-tenancy.

		For an overview of the Kubernetes multi-tenant architecture please consult the following documentation:


		## Tenancy Models

		The Kubernetes tenancy models supported by Flux are: soft multi-tenancy and hard multi-tenancy.

	Soft Multi-tenancy	Hard Multi-tenancy
Tenants may share cluster with other tenants	Yes	No
Tenants may share cluster with the flux management instance	Yes	No
Tenants access to cluster-wide resources	Limited	Unrestricted

[RFC] Define Flux tenancy models #2086

Are you sure you want to change the base?

[RFC] Define Flux tenancy models #2086

Conversation

stefanprodan commented Nov 15, 2021 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yebyen left a comment

Choose a reason for hiding this comment

squaremo commented Nov 29, 2021 • edited

jonathan-innis left a comment

Choose a reason for hiding this comment

nikkomiu left a comment

Choose a reason for hiding this comment

squaremo left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

stefanprodan commented Nov 15, 2021 •

edited

squaremo commented Nov 29, 2021 •

edited