You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

History

Matheus Pimenta d2aa9fb996 [RFC-0010] Add workload identity support for remote generic clusters Signed-off-by: Matheus Pimenta <matheuscscp@gmail.com>		1 day ago
..
README.md	[RFC-0010] Add workload identity support for remote generic clusters	1 day ago

README.md

RFC-0010 Multi-Tenant Workload Identity

Status: implementable

Creation date: 2025-02-22

Last update: 2025-04-29

Summary

In this RFC we aim to add support for multi-tenant workload identity in Flux, i.e. the ability to specify at the object-level which set of cloud provider permissions must be used for interacting with the respective cloud provider on behalf of the reconciliation of the object. In this process, credentials must be obtained automatically, i.e. this feature must not involve the use of secrets. This would be useful in a number of Flux APIs that need to interact with cloud providers, spanning all the Flux controllers.

Multi-Tenancy Model

In the context of this RFC, multi-tenancy refers to the ability of a single Flux instance running inside a Kubernetes cluster to manage Flux objects belonging to all the tenants in the cluster while still ensuring that each tenant has access only to their own resources according to the Least Privilege Principle. In this scenario a tenant is often a team inside an organization, so the reader can consider the multi-team tenancy model. Each team has their own namespaces, which are not shared with other teams.

Motivation

Flux has strong multi-tenancy features. For example, the Kustomization and HelmRelease APIs support the field spec.serviceAccountName for specifying the Kubernetes ServiceAccount to impersonate when interacting with the Kubernetes API on behalf of a tenant, e.g. when applying resources. This allows tenants to be constrained under the Kubernetes RBAC permissions granted to this ServiceAccount, and therefore have access only to the specific subset of resources they should be allowed to use.

Besides the Kubernetes API, Flux also interacts with cloud providers, e.g. container registries, object storage, pub/sub services, etc. In these cases, Flux currently supports basically two modes of authentication:

Secret-based multi-tenant authentication: Objects have the field spec.secretRef for specifying the Kubernetes Secret containing the credentials to use when interacting with the cloud provider. This is similar to the spec.serviceAccountName field, but for cloud providers. The problem with this approach is that secrets are a security risk and operational burden, as they must be managed and rotated.
Workload-identity-based single-tenant authentication: Flux offers single-tenant workload identity support by configuring the ServiceAccount of the Flux controllers to impersonate a cloud identity. This eliminates the need for secrets, as the credentials are obtained automatically by the cloud provider Go libraries used by the Flux controllers when they are running inside the respective cloud environment. The problem with this approach is that it is single-tenant, i.e. all objects are reconciled using the same cloud identity, the one associated with the respective controller.

For delivering the high level of security and multi-tenancy support that Flux aims for, it is necessary to extend the workload identity support to be multi-tenant. This means that each object must be able to specify which cloud identity must be impersonated when interacting with the cloud provider on behalf of the reconciliation of the object. This would allow tenants to be constrained under the cloud provider permissions granted to this identity, and therefore have access only to the specific subset of resources they are allowed to manage.

Goals

Provide multi-tenant workload identity support in Flux, i.e. the ability to specify at the object-level which cloud identity must be impersonated to interact with the respective cloud provider on behalf of the reconciliation of the object, without the need for secrets.

Non-Goals

It's not a goal of this RFC to implement an identity provider for Flux. Instead, the goal is to leverage Kubernetes' built-in identity provider capabilities, i.e. the Kubernetes ServiceAccount token issuer, to obtain short-lived access tokens for the cloud providers.

Proposal

For supporting multi-tenant workload identity at the object-level for the Flux APIs we propose associating the Flux objects with Kubernetes ServiceAccounts. The controller would need to create a token for the ServiceAccount associated with the object in the Kubernetes API, and then exchange it for a short-lived access token for the cloud provider. This would require the controller ServiceAccount to have RBAC permission to create tokens for any ServiceAccounts in the cluster.