Introduce (Cluster)EventSource CRD to subscribe to CloudEvents emitted by KEDA #3533

tomkerkhove · 2022-08-09T07:41:02Z

Use-Case

Goal

Allow end-users to subscribe to events emitted by KEDA allowing them to gain insights into what is going on and how their applications are being scaled.

These should serve several purposes such as:

Allow for autoscaling awareness (scaling)
Allow for automated notifications for incidents (errors, misconfiguration)
Allow for extensibility to enable end-users and our community to build on top of KEDA

Events that are being emitted must be CloudEvent-compliant and support pushing to destination endpoints inside/outside of the cluster.

With this proposal, our intention is to extend the event capabilities of KEDA with CloudEvents.

From a high-level perspective, KEDA will be doing three things in this area:

Scale workloads across one or more namespaces
Provide Kubernetes events inside the cluster that can be used through typical tooling (CLI, UIs, …)
Emit events to endpoints inside/outside the cluster that are CloudEvents-compliant

What are we doing

With the introduction of CloudEvents, we are targeting two audiences:

Cluster operators who are interested to subscribe to events across all namespaces to have a holistic overview of everything going on in the cluster
App developers who want to subscribe to events of their own application by using. This allows them to reduce the noise of other teams and save compute for filtering them out

With the introduction of EventSource & ClusterEventSource CRD, we cover both scenarios and give end-users the controls that they need. The goal is that they can define an endpoint, authentication, and optional filter to which KEDA will emit its events.

Similar to TriggerAuthentication & ClusterTriggerAuthentication CRDs, the idea is that EventSource is scoped to a single namespace while ClusterEventSource is cluster-wide.

End-users can optionally define event types they want to exclude so that the destination will never receive them.

While the event subscription configuration opens up a lot of opportunities, we should strive to keep the control as minimal as possible so that we don’t implement our own eventing engine.

End-users who need more robust filtering capabilities have to use another tool that is a better fit in this scenario.

Do we need a new CRD?

In order to build a scalable way of emitting events, across various teams and parties it is not an option to configure an endpoint directly on our KEDA control plane, but rather rely on a new CRD.

Otherwise, it would quickly become a bottleneck for end-users that have numerous namespaces and want to have more control over who is allowed to receive what events:

Cluster admins/operators will want to have events for scaling in all namespaces
App devs/operators will only be interested in the events for their applications

While every team could do the filtering on their own, this is a waste of compute.

Requirements

Introduce a new (Cluster)EventSource CRD that allows people to subscribe for CloudEvents:

apiVersion: events.keda.sh/v1alpha1
kind: EventSource # Or ClusterEventSource
metadata:
  name: operations-cross-cluster-events
spec:
  destination:
    # Support regular webhook endpoints over HTTP(S)
    http:
      uri: http://foo.bar
      authentication:
        apiKey:
          headerName: x-api-key
          valueFrom:
            secretKeyRef:
            name: secrets-operations-events
            key: webhook-api-key
  eventSubscription:
    includedEventTypes:
      - keda.example.Event
    excludedEventTypes:
      - keda.example.Event

Once these are created by end-users; KEDA will automatically push the events to the configured sink(s).

Issues for events are created separately.

Anything else?

Relates to #479

The text was updated successfully, but these errors were encountered:

tomkerkhove · 2022-08-09T07:42:10Z

@kedacore/keda-maintainers I don't have a preference on the implementation - We can start doing things in our current operator and later on introduce a dedicated container (if we really have to)

tomkerkhove · 2023-09-05T06:27:48Z

@zroubalik @JorTurFer We had some discussion on what the best model for source & subject is so I reached out to @duglin and landed on this:

source: Who is emitting the event? In our case KEDA and more specifically it should be the CRD that subscribes to it <cluster-name>/<namespace>/keda (<cluster-name>/keda if it's clustercloudevent)
subject: For what/whom was the event emitted? In most of the cases this is the workload or so that we are scaling cluster/namespace/workload/resource-name

Make sense?

SpiritZhou · 2023-09-05T07:01:51Z

Hi all, here's the high-level idea of how I'd like to implement CloudEvents in KEDA:

To acheive this, we'll need to

Introduce new CRD and start watching it
Refactor current event emitting and add internal adapter in code to handle normal k8s event emitting and CloudEvent emitting.

Operations

To help operate this at scale, we should offer new Prometheus & OTEL metrics:

keda_event_emitted_error_totals - Provides an indication of all the errors related to pushing events, per event sink
keda_event_emitted_totals - Provides an indication of all the events that have been emitted, per event sink
keda_event_sinks_totals - Provides an indication of all the event sinks created, per event sink type, per type (namespace/cluster)
keda_event_queue_status - Provides an indication of how many events are droped or still queue

Proposed Action plan Estimation:

The proposal is to implement the whole scope in multiple phases to be more agile and merge changes faster.

For the MVP, it feels best to implement the following features with one event example:

Introduce new CRD
Implement the logic of emitting CloudEvent.
Emit event when authentication fails

The following features can be implemented with follow-up PRs (in order):

Add Prometheus & OTEL metrics about CloudEvent
Filter events in KEDA side
Support Azure Event Grid
Introduce ClusterCloudEvent
Fulfill all current events to be emitted.

tomkerkhove · 2023-09-05T07:19:43Z

LGTM

JorTurFer · 2023-09-12T07:23:00Z

LGTM but I have one question. How are we going to take the cluster name? I mean, IIRC we don't have that info inside the pod, we have to request it to the users (it's not a problem IMHO, but just to be sure)

tomkerkhove · 2023-09-12T07:28:32Z

That's a valid question but no need to worry. This is configurable and when not specified "default" will be used (AFAIK, need to check design doc)

zroubalik · 2023-09-13T09:40:18Z

Hi, sorry for the delay. I like the proposal and the proposed direction. Great job @SpiritZhou and @tomkerkhove!

tomkerkhove · 2023-09-13T14:22:59Z

All the work was done by @SpiritZhou :)

SpiritZhou · 2023-09-21T08:50:30Z

There may be a scenario where the user needs to send one CloudEvent to multiple destinations, and there are two possible solutions:

Let the user create multiple CloudEvent resources, with each resource having only one destination.
Change the destination specification in CloudEvent from an object type to an array type so that the user can add different destinations in one CloudEvent resource.

I think it would be convenient for users to create one CloudEvent resource, but I am not sure if there are any drawbacks or if the concept of CloudEvents would be better served by creating multiple CloudEvent resources. What do you think? Which one is better? @tomkerkhove @JorTurFer @zroubalik

tomkerkhove · 2023-09-21T08:55:29Z

There is actually a 3rd option which is what i originally had in mind:

Allow to use multiple destinations, but only 1 per type (ie HTTP and to Azure Event Grid)

I personally think creating 1 resource per scenario (aka send all events when auth fails for my app) is what we should strive for and avoid having monolithical subscriptions. That's why I prefer to keep the model simple and have multiple event source resources.

Does that mean I want to avoid using an array? No. But if that is at the cost of having a good schema which we can validate against then I'd say yes.

I'd love to know what business scenario there is for creating 1 EventSource resource that pushes to 3 HTTP endpoints?

JorTurFer · 2023-09-22T21:08:21Z

I'm not an expert in CloudEvents (and neither a noob, I'm a real ignorant), so maybe I'm saying something stupid, but where is the problem of supporting multiple CloudEvent resources with multiple targets of the same type?
Wouldn't I want to emit the event to multiple receivers for different stuffs? I see this (with a lot of imagination) as an audit trail about what has happened in KEDA. I see this as something that different departments could configure for different stuff as self-service. Maybe SRE team wants some events on any targets, and security team wants other events on other target, etc

I'm not saying that we have to support it now (we could just wait to get feedback before doing a lot of things), it's just a question. Maybe in the beginning we could start with a single CloudEvent with multiple targets or even with a single CloudEvent with a single target to get feedback from end users

JorTurFer · 2023-09-22T22:20:40Z

I have also reviewed the PR and I'm curious about how you will choose between HTTP and Event Grid. Will you include the extra CRD that @tomkerkhove proposed for it? Will it be part of the current CRD?
Once again, from my absoluto ignorance, IDK how many different targets we would have, but if there are several of them, maybe another CRD for defining them instead of adding all the options to current is more flexible

tomkerkhove · 2023-09-23T08:38:47Z

I see this as something that different departments could configure for different stuff as self-service. Maybe SRE team wants some events on any targets, and security team wants other events on other target, etc

Yes, and because it's owned by different departments they should create their own CRD instance and not bundle it in to a single "subscription" in my opinion. That's why for me 1 subscription is linked to 1 destination (or at least of the same type IMO). If you need multiple destinations, then it's because that's for a difference scenario and thus a separate resource, but that's just my opinion.

I have also reviewed the PR and I'm curious about how you will choose between HTTP and Event Grid. Will you include the extra CRD that @tomkerkhove proposed for it? Will it be part of the current CRD? Once again, from my absoluto ignorance, IDK how many different targets we would have, but if there are several of them, maybe another CRD for defining them instead of adding all the options to current is more flexible

The current PR is not according to what we specced out and I believe @SpiritZhou is going to update the PR to align with it but he's waiting for our outcome on the single or multiple destination per type.

The original proposal which we agreed on is this:

apiVersion: events.keda.sh/v1alpha1
kind: EventSource # Or ClusterEventSource
metadata:
  name: operations-cross-cluster-events
spec:
  destination:
    http:
      uri: http://foo.bar/
      authentication:
        apiKey:
          headerName: x-api-key
          valueFrom:
            secretKeyRef:
            name: secrets-operations-events
            key: webhook-api-key
    azureEventgrid:
      topicEndpoint: https://{resource-name}.{region}.eventgrid.azure.net/api/events # Mandatory
      authentication: # End-users must use accessKey or activeDirectory
        accessKey:
          # Allow end-users to pull information from Kubernetes secret or from TriggerAuthentication resources
          valueFrom:
            secretKeyRef:
              name: secrets-operations-events
              key: webhook-api-key
            triggerAuthenticationRef:
              name: trigger-auth-sample
              parameterName: eventGridAuth
        activeDirectory:
          tenantId: xyz
          clientApplication:
            id: ABC
            secret:
              valueFrom:
                secretKeyRef:
                  name: secrets-operations-events
                  key: webhook-api-key
                triggerAuthenticationRef:
                  name: trigger-auth-sample
                  parameterName: eventGridAuth
          managedIdentity:
            valueFrom:
              triggerAuthenticationRef:
                name: trigger-auth-sample
            key: webhook-api-key
  eventSubscription:
    includedEventTypes:
      - keda.example.Event
    excludedEventTypes:
      - keda.example.Event

This allows end-users to use HTTP and/or Azure Event Grid, but only 1 destination and not an array.

The thing that is now brought up is:

Do we allow end-users to use HTTP, Azure Event Grid and future destinations to be mixed in 1 CRD instance?
Do we allow multiple destinations of the same type? (ie multiple HTTP endpoints)

Personally I'd say no to both and they need separate EventSource instances so that we can offer proper error metric per resource, log per resource, proper status on CRD, etc.

But curious about your thoughts @zroubalik & @JorTurFer.

JorTurFer · 2023-09-23T14:47:23Z

I think that we can start with 1-1 and over it, iterate. I mean, currently we can support 1 destination per EventSource, and then based on feedback, we could update it to support multiple destinations or not.

I have a question here. Does it make sense to have another CRD for destinations and link to it in the EventSource? I mean, could a situation like a cluster admin setting allowed destinations (and teams just using them inside their EventSources) be a real world scenario or it doesn't apply here?

tomkerkhove · 2023-09-23T15:25:14Z

That's a valid question, but instead I'd introduce crd in the future then for cluster admiks to define what is (not) allowed and use the for event source validation going forward.

I think that is nicer rather than adding more resources because of destinations?

zroubalik · 2023-10-10T14:26:54Z

These are valid points, we should think about what is the main usecase though for mutliple destinations.

Is it that admin would like to create a different types of eventSubscription and for each type, he would like to add multiple destinations? If so, then 1 CRD is probably better than creating a separate CRD for each relation.

But I agree that we can start with 1-1 relation, but don't block ourselves on adding more in the future (in this case we shouldn't probably change the CRD spec, ie going from a single field to array).

tomkerkhove · 2023-10-12T12:36:01Z

Based on @JorTurFer's remarks above and a chat I've had with @zroubalik we've agreed to go with:

CloudEventSource instead of EventSource in case a new format/pattern comes in the future
Allow 1 destination per type, and not use an array (ie 1 HTTP destination, 1 for Azure Event Grid, 1 for AWS/GCP/...)

I have a question here. Does it make sense to have another CRD for destinations and link to it in the EventSource? I mean, could a situation like a cluster admin setting allowed destinations (and teams just using them inside their EventSources) be a real world scenario or it doesn't apply here?

We'll start simple and will evaluate this if it comes up in the future

tomkerkhove · 2023-12-11T06:58:18Z

This is done

tomkerkhove · 2023-12-11T07:08:33Z

Re-opening to better list events supported in our docs

tomkerkhove · 2024-01-19T10:44:08Z

@SpiritZhou did we add cluster-wide CRD already or should we open a separate issue for this?

tomkerkhove · 2024-02-26T07:30:12Z

@SpiritZhou did we add cluster-wide CRD already or should we open a separate issue for this?

@SpiritZhou any update on this?

neelanjan00 · 2024-03-16T08:09:08Z

Any ETA on this? I'd like to contribute to a few of the CloudEvents integrations, is this issue a blocker for them?

tomkerkhove · 2024-03-16T12:26:19Z

What are you planning on contributing? The crds are already in

neelanjan00 · 2024-03-16T15:02:47Z

What are you planning on contributing? The crds are already in

I can start with this one: #3527
It looks easy to begin with 😅

tomkerkhove · 2024-03-18T09:00:08Z

Sure, you should be able to get started - Thanks @neelanjan00!

tomkerkhove added needs-discussion feature-request All issues for new features that have not been committed to labels Aug 9, 2022

tomkerkhove modified the milestone: CloudEvents - Initial version Aug 9, 2022

tomkerkhove added planning:microsoft-engineering and removed planning:microsoft-engineering labels May 23, 2023

tomkerkhove assigned SpiritZhou Sep 5, 2023

SpiritZhou mentioned this issue Sep 13, 2023

feat: Introduce CloudEvents to KEDA #4968

Merged

6 tasks

tomkerkhove mentioned this issue Sep 21, 2023

Introduce CloudEvent to KEDA kedacore/keda-docs#1227

Merged

1 task

tomkerkhove mentioned this issue Nov 23, 2023

feat: Introduce CloudEventSources CRD and adding ClusterName parameter kedacore/charts#572

Merged

3 tasks

tomkerkhove closed this as completed Dec 11, 2023

tomkerkhove reopened this Dec 11, 2023

This was referenced Dec 11, 2023

Update supported CloudEvents list kedacore/keda-docs#1275

Merged

Fix: Update CloudEventType to correct format #5277

Merged

SpiritZhou mentioned this issue Jan 23, 2024

Introduce Filter CloudEvents Feature #5424

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce (Cluster)EventSource CRD to subscribe to CloudEvents emitted by KEDA #3533

Introduce (Cluster)EventSource CRD to subscribe to CloudEvents emitted by KEDA #3533

tomkerkhove commented Aug 9, 2022 •

edited

tomkerkhove commented Aug 9, 2022

tomkerkhove commented Sep 5, 2023

SpiritZhou commented Sep 5, 2023 •

edited by tomkerkhove

tomkerkhove commented Sep 5, 2023

JorTurFer commented Sep 12, 2023

tomkerkhove commented Sep 12, 2023

zroubalik commented Sep 13, 2023

tomkerkhove commented Sep 13, 2023

SpiritZhou commented Sep 21, 2023

tomkerkhove commented Sep 21, 2023

JorTurFer commented Sep 22, 2023

JorTurFer commented Sep 22, 2023 •

edited

tomkerkhove commented Sep 23, 2023 •

edited

JorTurFer commented Sep 23, 2023

tomkerkhove commented Sep 23, 2023

zroubalik commented Oct 10, 2023

tomkerkhove commented Oct 12, 2023

tomkerkhove commented Dec 11, 2023

tomkerkhove commented Dec 11, 2023

tomkerkhove commented Jan 19, 2024

tomkerkhove commented Feb 26, 2024

neelanjan00 commented Mar 16, 2024

tomkerkhove commented Mar 16, 2024

neelanjan00 commented Mar 16, 2024

tomkerkhove commented Mar 18, 2024

Introduce (Cluster)EventSource CRD to subscribe to CloudEvents emitted by KEDA #3533

Introduce (Cluster)EventSource CRD to subscribe to CloudEvents emitted by KEDA #3533

Comments

tomkerkhove commented Aug 9, 2022 • edited

Use-Case

Goal

What are we doing

Do we need a new CRD?

Requirements

Anything else?

tomkerkhove commented Aug 9, 2022

tomkerkhove commented Sep 5, 2023

SpiritZhou commented Sep 5, 2023 • edited by tomkerkhove

Operations

Proposed Action plan Estimation:

tomkerkhove commented Sep 5, 2023

JorTurFer commented Sep 12, 2023

tomkerkhove commented Sep 12, 2023

zroubalik commented Sep 13, 2023

tomkerkhove commented Sep 13, 2023

SpiritZhou commented Sep 21, 2023

tomkerkhove commented Sep 21, 2023

JorTurFer commented Sep 22, 2023

JorTurFer commented Sep 22, 2023 • edited

tomkerkhove commented Sep 23, 2023 • edited

JorTurFer commented Sep 23, 2023

tomkerkhove commented Sep 23, 2023

zroubalik commented Oct 10, 2023

tomkerkhove commented Oct 12, 2023

tomkerkhove commented Dec 11, 2023

tomkerkhove commented Dec 11, 2023

tomkerkhove commented Jan 19, 2024

tomkerkhove commented Feb 26, 2024

neelanjan00 commented Mar 16, 2024

tomkerkhove commented Mar 16, 2024

neelanjan00 commented Mar 16, 2024

tomkerkhove commented Mar 18, 2024

tomkerkhove commented Aug 9, 2022 •

edited

SpiritZhou commented Sep 5, 2023 •

edited by tomkerkhove

JorTurFer commented Sep 22, 2023 •

edited

tomkerkhove commented Sep 23, 2023 •

edited