GEP-2907: TLS Configuration Placement and Terminology¶

Issue: #2907
Status: Memorandum

TLDR¶

This GEP aims to define high level TLS terminology and structure within Gateway API to ensure that all our independent proposals related to TLS result in a coherent set of APIs. This will result in some adjustments to provisional and experimental TLS-related GEPs, specifically BackendTLSPolicy and Client Certificate Verification.

Goals¶

Define high-level terminology for how we refer to TLS in Gateway API.
Define top level fields where TLS configuration can live.

Non-Goals¶

Add or change fields directly. (This may inspire changes in other GEPs though).
Commit to including specific parts of TLS configuration in Gateway API. (This is merely to provide space for future configuration, not a commitment that we will add it to the API.)

Out of Scope¶

There are a variety of related TLS concepts in Gateway API that are not currently in scope for this GEP. In the future, this GEP may be expanded to include:

Automatic mTLS (often associated with Service mesh)
TLSRoute

Introduction¶

Where TLS could be configured¶

We have three different places where we might want to configure TLS:

A. Gateways¶

Config attached at this layer will apply to everything behind a single address such as a virtual IP.

B. Gateway Listeners¶

Each Gateway contains multiple “Listeners”. Each HTTPS Listener in a Gateway must have a unique combination of protocol, port, and SNI (Server Name Indication). TLS configuration attached at this layer should be isolated via SNI.

C. BackendTLSPolicy¶

This policy allows us to attach unique TLS configuration per Backend. Depending on the organization, this policy may be owned by the application owner or the cluster operator. Note that this configuration will be used by all Gateways (potentially from different implementations) that are connecting to the backend.

"Frontend" and "Backend"¶

A guiding principle in this naming is to use consistent naming for “Downstream” (1+2) and “Upstream” (3+4), similar to Envoy. To avoid the confusion with what is upstream and downstream, Gateway API will use “Frontend” (1+2) and “Backend” (3+4).

Frontend: The entity connecting to a Gateway, typically a client application and/or web browser.
Backend: The entity a Gateway is routing traffic to, typically the endpoints behind a Service.

There are essentially 4 different segments that could be relevant to TLS configuration in Gateway API:

flowchart LR
    Frontend -.->|1. Client Cert| Gateway
    Gateway -.->|2. Server Cert| Frontend
    Gateway -.->|3. Client Cert| Backend
    Backend -.->|4. Server Cert| Gateway

The above diagram depicts these four segments as edges in a graph.

TLS mode¶

TLS can be configured with two distinct modes:

Terminate: the TLS connection is instantiated between the frontend and the Gateway. The connection between the Gateway and the backend is left unencrypted unless a new TLS connection between the two entities is configured via BackendTLSPolicy.
Passthrough: the TLS connection is instantiated between the frontend and the backend. The traffic flows through the Gateway encrypted, and the Gateway is not able to decrypt or inspect the encrypted portions of the TLS stream.

Proposed Segments¶

Note that this does not represent any form of commitment that any of these fields or concepts will exist within Gateway API. If or when they do, we propose the following naming structure:

1. Validate Client Certificate provided by Frontend¶

Proposed Placement	Name	Status
Gateway Listener	`Listener.TLS.FrontendValidation`	Proposed

Rationale¶

Use FrontendValidation to leave room for concepts like trust anchors and trust domains. Anything not strictly tied to validation would belong to Listener.TLS which is now reserved exclusively for “Frontend” TLS configuration (1+2).

Why Not Frontend.Validation?¶

Part of the idea behind this GEP is that Listener.TLS is really entirely focused on 1+2 - the bits of TLS between frontend and Gateway. That means that if we were to add any additional TLS configuration in the Listener.TLS struct, it would tied to that limited scope, and thus we don't really need a separate Frontend struct.

One could make an argument that Listener.TLS.Validation should be the field name here to avoid any ambiguity, but in this specific context, we think that it's probably helpful to specifically spell out frontend.

Why Not FrontendTLSPolicy?¶

It could be reasonable to try to mirror BackendTLSPolicy for Frontend TLS. As we considered that possibility, we evaluated why BackendTLSPolicy exists as a separate policy resource:

Some parts of the config, such as the SNI that should be used to connect to the Service/backend only make sense when attached per Service - it's very unlikely that you'd want to use the same SNI to connect to all backends from a Gateway.
The Service API is overloaded and very slow to change, Gateway API has taken the approach of exposing config at that level via policy resources that are tied to specific personas (why Session Persistence config and Backend TLS are in different resources).

We don't think either of those reasons really apply to frontend TLS. Instead, frontend TLS could theoretically be configured either for an entire Gateway, or a specific Gateway listener. Given that many implementations already support distinguishing this config per SNI, it seems to make sense to start with listener level attachment. We think that the persona that would be responsible for a Gateway is not sufficiently different than the persona that would be responsible for frontend TLS, so the current proposal is likely the best option available to us.

2. Configure TLS Termination, including Server Certificate¶

Proposed Placement	Name	Status
Gateway Listener	`Listener.TLS`	GA

Rationale¶

This is already finalized in the API and so we're stuck with this name. In hindsight a name that was more clearly tied to frontend TLS would have been ideal here.

3. Configure Client Certificate that Gateway should use to connect to Backend¶

Proposed Placement	Name	Status
Gateway	`Gateway.Spec.BackendTLS.ClientCertificateRef`	Not Proposed
BackendTLSPolicy	`BackendTLSPolicy.Spec.ClientCertificateRef`	Not Proposed

Rationale¶

It's not yet obvious which of the above options are preferable, but a case could be made for either or both. If we add a BackendTLS struct to Gateway it would leave room for concepts like TLS version, ALPN, cipher suites, etc.

Why Not Listener level to match FrontendValidation?¶

In general, we'd expect this identity to represent a Gateway as a whole, and thus per-Listener configuration probably does not make sense here. Having FrontendValidation at the Listener level allows a Gateway to accept different client certs per SNI (a matching attribute of the frontend -> Gateway connection). On the other hand, when determining the identity a Gateway should use when connecting to a backend, it should likely either be tied directly to the Gateway or Backend, but the Listener is not particularly relevant in this context.

4. Validate Server Certificate that is provided by Backend¶

Proposed Placement	Name	Status
Gateway	`Gateway.Spec.BackendTLS.Validation`	Not Proposed
BackendTLSPolicy	`BackendTLSPolicy.Spec.Validation`	Experimental (Different Name)

Rationale¶

Rename BackendTLSPolicy.TLS to BackendTLSPolicy.Validation. Because this is already clearly backend-focused, config like TLS Version, ALPN, and cipher suites, could live at the same level without confusion. It's also plausible that some Gateways may want to express Gateway-wide CAs that are trusted.

Why BackendTLS on Gateways?¶

Although this GEP is intentionally not committing to adding new fields or features, it's possible that at some point in the future we may want to have some kind of Backend TLS config at the Gateway level. For example, it may be useful to configure a set of CAs that a Gateway trusts or a client cert that a Gateway should use to connect to all backends. If this existed, there would likely be some overlap in config with BackendTLSPolicy, and if a future GEP proposed this, it would need to include how overlapping config should be handled.

Why the inconsistency between Frontend and Backend TLS config on Gateways?¶

If we were to populate all the possible fields described in this GEP, we'd end up with the following config on Gateways:

Gateway.Spec.Listeners.TLS.FrontendValidation
Gateway.Spec.Listeners.TLS
Gateway.Spec.BackendTLS.ClientCertificateRef
Gateway.Spec.BackendTLS.Validation

This is all tied to the need to provide backwards compatibility with the TLS config that is already GA on Listeners (Listener.TLS). If we were naming that field today with the broader vision of TLS configuration throughout Gateway API we would almost certainly choose a more descriptive name that was more clearly tied to Frontend TLS. Unfortunately we're too late to change that name, but we can try to make the rest of the terminology clearly tied to frontend or backend TLS.

One suggestion by @candita would be to introduce a Listener like resource for BackendTLS, resulting in a more consistent naming scheme within Gateway TLS configuration. Although it's not yet clear if we need this additional layer, we should reconsider it as we're further developing Backend TLS.

5. Configure TLS mode¶

Proposed Placement	Name	Status
Gateway Listener	`Listener.TLS.Mode`	GA

Rationale¶

Similarly to the broader TLS termination segment, this field is already finalized in the API. In hindsight, along with a different naming of Listener.TLS, we could have moved this field out of the broader frontend TLS configuration, as it is not bound to it.

How the TLS configuration is affected¶

The TLS mode affects how the frontend TLS should be configured. In case Terminate is set, the Listener.TLS.CertificateRefs field has to be populated, as the connection is intended to be terminated at the Gateway level, while for Passthrough, the Certificate does not need to be provided, as the TLS termination is handled by the backend.

Routes and TLS¶

Multiple routes can be attached to listeners specifying TLS configuration. This section intends to clearly state how and when Routes can attach to listeners with TLS configuration.

Context¶

The *Route objects refer the Gateway (or a specific Listener of the Gateway) they want to be attached to. A successful attachment can be granted by one of the two following conditions:

the Listener.AllowedRoutes field allows that specific Route GroupKind to be attached to it, or
the Listener.Protocol field allows that specific Route to be attached to the Listener. This applies only in case the Gateway's AllowedRoutes field is not set.

In case the First condition is not satisfied (the Route references a Gateway that does not explicitly allow such an attachment), the Route Accepted condition is set to False with reason NotAllowedByListeners. In case the Route references a Gateway with no Listener matching the protocol needed by the Route, the Route ResolvedRefs condition is set to False with reason UnsupportedProtocol.

What Routes are allowed by Listeners with TLS¶

The following is a summary of all the Routes and the TLS termination mode they support, along with the compatible protocol.

Protocol	Routes	TLS Terminate	TLS Passthough
HTTP	`HTTPRoute`/`GRPCRoute`	no	no
HTTPS	`HTTPRoute`/`GRPCRoute`	yes	no
TLS	`TLSRoute`	yes	yes
TCP	`TCPRoute`	yes	no
UDP	`UDPRoute`	no	no

[!NOTE] When the traffic is routed to the backend via a listener configured with TLS Passthrough and a compatible route, the packets are left untouched by the gateway. In order to terminate the TLS connection to the gateway and forward the traffic unencrypted to the backend, a listener configured with TLS Terminate and a compatible route must be used.