Vector operator for Kubernetes #768

tlvenn · 2019-08-19T01:33:00Z

Hi,

Not an issue strictly with vector but hoping to spark some ideas regarding the integration with k8s.
An operator similar to what https://github.com/banzaicloud/logging-operator does for fluent-bit would be pretty awesome.

binarylogic · 2019-08-19T03:33:07Z

Thanks @tlvenn, the logging-operator project is very interesting. We'll dig in and see what we can do.

LucioFranco · 2019-10-18T19:50:09Z

Hi @tlvenn thanks for opening this. I did some research into this. This sounds like some that would be very helpful for our users.

From what I have been reading it looks like having a vector operator could provide a way to configure different Vector topologies running on top of kubernetes. We would be able to define a few CRD's that would allow users to express their Vector deployments ontop of kube with kube like configuration that integrates with kube very naturally.

Under the hood this Vector operator could deploy a daemonset of Vector onto each kube node. It could then supply each agent with a config that either allows vector to run in a distributed topology. On top of this, the operator could also then deploy a statefulset that will act as a central vector agent that allows the node agents to connect to itself and is in charge of providing a durable disk buffer and a way to output logs to some external service configuring TLS all the way. Also on top of this, this operator could provide a way to configure an entire log transform pipeline within the same cluster configuring kafka as an intermediate message broker. This would then provide the stream based topology setup.

An example of a simple deployment that fluentbit's logging operator does can be seen here. This configures fluentbit to write to s3 and shows that you can set up logging very easily and naturally.

Since this relates to how Vector is deployed, I think this might not belong in the current initial containers milestone. That said, I do think this could be extremely valuable to allow users to quickly get up and running with more complex vector deployments.

tlvenn · 2019-10-19T00:10:48Z

Yep that's pretty much it and the possibility are endless, beyond CRDs to describe the entire desired vector pipeline, the operator could also react to some annotations on a container to inject some logging vector sidecars but yes you definitely captured the gist of it.

This particular diagram illustrate best the whole idea:

tlvenn · 2019-10-19T00:14:27Z

Note that the operator approach also pave the way for some IoC where the application bundles some custom CRDs to declare its logging streams and how they should be routed. Very similarly to the ServiceMonitor CRD with the Prometheus operator.

tlvenn · 2020-05-21T18:04:32Z

Hi @binarylogic, was wondering where your current thinking is regarding the idea of having a K8S Operator to orchestrate / deploy vector ?

binarylogic · 2020-05-21T21:57:56Z

Hi @tlvenn, @MOZGIII should be able to answer that since he is heading up our k8s integration currently.

MOZGIII · 2020-05-21T22:46:22Z

Hey @tlvenn! I'm currently working on a new k8s integration outlined at this RFC: https://github.com/timberio/vector/blob/master/rfcs/2020-04-04-2221-kubernetes-integration.md
We don't yet have plans to implement Vector operator in the backlog, but we're working on the building blocks to enable that option for us in the future.

The scope of the Vector operator would be to use CRDs to describe and rollout complete logging topologies, i.e. deploy Vector for pod log collection as a DaemonSet on each node and to deploy another Vector as a Deployment to gather all the log streams from the whole cluster and do advanced processing. Another responsibility of the operator would be, effectively, to assemble Vector configuration from the CRDs (as opposed to user-supplied .toml configs). All that is very advanced functionality, and we're building the k8s integration with that in mind. We're currently working on prerequisites to make is possible.

Furthermore, instead of building our own operator, we might be able to work with https://github.com/banzaicloud/logging-operator to add Vector support there. We'll look into it after we complete the initial integration with k8s.

I'm personally looking forward to the scenario where apps can ship with their own logging configurations (i.e. transform specifications) as CRDs, and an operator dynamically configures Vector to process and ship the logs accordingly.

tlvenn · 2020-05-22T03:09:14Z

Awesome, thanks a lot for the feedback and looking forward for all of this as well !

aelbarkani · 2021-08-29T10:54:33Z

@binarylogic @MOZGIII @tlvenn I can start working on it to have a first version by the end of September.

jszwedko · 2021-08-30T13:59:19Z

cc/ @spencergilbert

spencergilbert · 2021-08-30T15:35:40Z

Hey @aelbarkani - I'm curious what you're looking to get out of an operator here. Is it going to solve a problem that the existing helm charts aren't?

aelbarkani · 2021-08-30T19:12:01Z

Hey @spencergilbert. Logging operator of Banzaicloud is a good example. We run multi-tenant clusters where each application team can have isolated namespaces in a cluster. Application teams have limited access to the cluster (they don't have access to cluster-wide resources nor privileged containers). And usually they don't care, the only thing the application teams want is to be admin in their namespaces, and leave the cluster management hassle to infrastructure team.
In a Kubernetes cluster Vector agents are privileged, and thus end users should not have access to them. However, the users would need some sort of API (a Kubernetes Custom Resource) in order to request sources, transforms and sinks. Basically the idea is to be able to offer to our users a fully managed Vector instance in multi-tenant Kubernetes clusters, and I don't think that would be possible with a Helm chart.

spencergilbert · 2021-08-30T20:46:31Z

@aelbarkani - would you be interested in having a call with us to discuss your plans and thoughts around the operator more fully? If you do, you can email me at spencer.gilbert (at) datadoghq.com

tlvenn · 2021-08-31T00:15:45Z

Hi @aelbarkani , please take a moment to read the material and articles linked above, the value props of the vector operator should be pretty apparent.

aelbarkani · 2021-08-31T00:53:57Z

Hi @tlvenn. I must say that value proposition of an operator is not always straightforward. In this case it is, and I can say that since we've been using Banzaicloud's operator in dozens of clusters in production and developed many for our clients. Now, my comment was about one use case (the one that I'm interested in), but of course there are many others. Don't hesitate to explain a little bit more your use cases.

aelbarkani · 2021-08-31T00:56:06Z

@spencergilbert yep, just sent you an email !

tlvenn · 2021-08-31T01:01:18Z

Dho my comment was for @spencergilbert , I made a mistake mentioning you 🤦

aelbarkani · 2021-08-31T01:19:57Z

@tlvenn no worries !

tmckeon · 2022-07-08T18:08:07Z

Hi All

I would just like to add my support for this issue. It would be great if dev teams could configure vector transforms by themselves.

xinbinhuang · 2022-10-16T19:36:34Z

Hi all,

I want to points out there is another use case to better integrate or replace prometheus for metrics collection. Currently, a lot of helm charts ship with PodMonitor/ServiceMonitor to allow scraping by Prometheus. However, Vector can't utilize these directly. It would be awesome if the vector operator can monitor existing PodMonitor/ ServiceMonitor to scrape metrics when configured to do so.

zvlb · 2022-11-03T12:03:40Z

Hi. Please try this one - https://github.com/kaasops/vector-operator
We released it for deploying and configuring Vector in Kubernetes. (Like how Logging Operator does it, but with some differences).

You can use CRDs:
Vector - for deploy Vector instance
VectorPipeline - for deploy sources/transforms/sinks in namespace scope
ClusterVectorPipeline - for deploy sources/transforms/sinks in cluster scope

vectordotdev/vector#768 https://github.com/kaasops/vector-operator/blob/main/helm/index.yaml

nabokihms · 2023-01-21T18:57:20Z

Deckhouse Kubernetes Platform has a log-shipper module, which is basically an operator that is built around vector.

Simple configuration example:

apiVersion: deckhouse.io/v1alpha1
kind: ClusterLoggingConfig
metadata:
  name: system-logs
spec:
  type: KubernetesPods
  kubernetesPods:
    namespaceSelector:
      matchNames:
        - kube-system
  destinationRefs:
    - loki-storage
---
apiVersion: deckhouse.io/v1alpha1
kind: ClusterLogDestination
metadata:
  name: loki-storage
spec:
  type: Loki
  loki:
    endpoint: http://loki.loki:3100

tlvenn mentioned this issue Aug 19, 2019

Kubernetes integration #260

Closed

6 tasks

binarylogic added meta: idea Anything in the idea phase. Needs further discussion and consensus before work can begin. Type: New Feature labels Aug 19, 2019

binarylogic assigned ktff Aug 27, 2019

binarylogic added this to the Initial containers support milestone Sep 7, 2019

binarylogic added the needs: approval Needs review & approval before work can begin. label Sep 23, 2019

LucioFranco unassigned ktff Oct 18, 2019

LucioFranco removed this from the Initial containers support milestone Oct 18, 2019

LucioFranco mentioned this issue Oct 22, 2019

Add deployment documentation around kubernetes #1070

Closed

2 tasks

binarylogic added type: feature A value-adding code addition that introduce new functionality. and removed type: new feature labels Jun 16, 2020

binarylogic added have: nice This feature is nice to have. It is low priority. platform: kubernetes Anything `kubernetes` platform related domain: administration Anything related to administration/operation domain: setup Anything related to setting up or installing Vector labels Aug 7, 2020

binarylogic added the needs: more demand Needs more demand before work can begin, +1 or comment to support. label Jan 18, 2021

binarylogic mentioned this issue Apr 30, 2021

Build a Vector Kubernetes operator #7282

Closed

vectordotdev deleted a comment from JeanMertz Apr 30, 2021

spencergilbert mentioned this issue Jul 25, 2022

Support operator to inject sidecar? vectordotdev/helm-charts#231

Closed

metacoma added a commit to metacoma/mindwm that referenced this issue Dec 29, 2022

Add vector operator

c14cc6a

vectordotdev/vector#768 https://github.com/kaasops/vector-operator/blob/main/helm/index.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vector operator for Kubernetes #768

Vector operator for Kubernetes #768

tlvenn commented Aug 19, 2019

binarylogic commented Aug 19, 2019

LucioFranco commented Oct 18, 2019

tlvenn commented Oct 19, 2019

tlvenn commented Oct 19, 2019

tlvenn commented May 21, 2020

binarylogic commented May 21, 2020

MOZGIII commented May 21, 2020 •

edited

Loading

tlvenn commented May 22, 2020

aelbarkani commented Aug 29, 2021 •

edited

Loading

jszwedko commented Aug 30, 2021

spencergilbert commented Aug 30, 2021 •

edited

Loading

aelbarkani commented Aug 30, 2021 •

edited

Loading

spencergilbert commented Aug 30, 2021

tlvenn commented Aug 31, 2021 •

edited

Loading

aelbarkani commented Aug 31, 2021

aelbarkani commented Aug 31, 2021

tlvenn commented Aug 31, 2021

aelbarkani commented Aug 31, 2021

tmckeon commented Jul 8, 2022

xinbinhuang commented Oct 16, 2022

zvlb commented Nov 3, 2022

nabokihms commented Jan 21, 2023

Vector operator for Kubernetes #768

Vector operator for Kubernetes #768

Comments

tlvenn commented Aug 19, 2019

binarylogic commented Aug 19, 2019

LucioFranco commented Oct 18, 2019

tlvenn commented Oct 19, 2019

tlvenn commented Oct 19, 2019

tlvenn commented May 21, 2020

binarylogic commented May 21, 2020

MOZGIII commented May 21, 2020 • edited Loading

tlvenn commented May 22, 2020

aelbarkani commented Aug 29, 2021 • edited Loading

jszwedko commented Aug 30, 2021

spencergilbert commented Aug 30, 2021 • edited Loading

aelbarkani commented Aug 30, 2021 • edited Loading

spencergilbert commented Aug 30, 2021

tlvenn commented Aug 31, 2021 • edited Loading

aelbarkani commented Aug 31, 2021

aelbarkani commented Aug 31, 2021

tlvenn commented Aug 31, 2021

aelbarkani commented Aug 31, 2021

tmckeon commented Jul 8, 2022

xinbinhuang commented Oct 16, 2022

zvlb commented Nov 3, 2022

nabokihms commented Jan 21, 2023

MOZGIII commented May 21, 2020 •

edited

Loading

aelbarkani commented Aug 29, 2021 •

edited

Loading

spencergilbert commented Aug 30, 2021 •

edited

Loading

aelbarkani commented Aug 30, 2021 •

edited

Loading

tlvenn commented Aug 31, 2021 •

edited

Loading