Trivy operator behaviour when containers are injected into deployment pod by admissions controllers #1745

andriktr · 2024-01-05T13:34:02Z

Hello,
When we run a deployment in k8s trivy-operator looks into the replica set get the images for vulnerability scanning. The problem I see here is in case it's not covers cases when mutating admission controller injects container into the pod. Good example here is a case when we inject service mesh sidecars into our application. Usually then 1 or more containers being added into the pod however replica set normally remains unchanged and contains only the application image in the definition. As result in reality we have pods with application containers + containers injected by admission controllers and as trivy look on replicaset only the application container will be scanned.

Here is the image grep for the replica set:

As you can see we have only 1 image

And here we grep image from the pods controlled by this replica set

As u can see we have 2 additional images there consul-dataplane:1.2.3 is consul service mesh sidecar container image and consul-k8s-control-plane:1.2.3 is an init container image both these images are injected into pod by admission controller and they are not scanned by trivy-operator.

Any suggestions, opinions ?

Thank you.

The text was updated successfully, but these errors were encountered:

chen-keinan · 2024-01-08T06:48:09Z

@andriktr that is a very good point , there is a way to know if the service mesh has added a sidecars , I mean like labels or annotations ?

andriktr · 2024-01-08T08:27:29Z

I believe this depends on mesh solution, but typically u enable service mesh by adding an annotation to deployment/pod.

apiVersion: apps/v1
kind: Deployment
metadata:
  name: static-client
  namespace: consul-test
spec:
  replicas: 1
  selector:
    matchLabels:
      app: static-client
  template:
    metadata:
      name: static-client
      labels:
        app: static-client
      annotations:
        'consul.hashicorp.com/connect-inject': 'true'
....

However it also possible to enable injection by default for all workloads in cluster and I'm not sure if annotations will be added in this case as well, but most probably yes.

chen-keinan · 2024-01-08T09:26:31Z

I'm looking for an indicator which will help to decide when to scan pod and when replicaset

andriktr · 2024-01-08T09:33:07Z

Probably it will be hard to have one indicator for all possible injection cases. Maybe it possible to do some comparison between replicaset and final pod and if container count in replicaset is equal to pods container count then scan replicaset else scan pod.

alfsch · 2024-02-27T08:09:48Z

comparing image names between replicaset and final pods seem to be the only way to avoid troubles with injected sidecars by service meshes or mutation hooks.

chen-keinan · 2024-03-18T13:31:48Z

@andriktr @alfsch I have created a fix #1917 which will still scan the app container when sidecar if failing.
its not perfect but at least you'll get vulnerability report for main container.

let me know if it sufficient

andriktr · 2024-03-18T13:36:26Z

Hmm... this issue is not about a failing containers, but more about that sidecar container is not scanned when it injected by admission controller(-s) does your fix covers this as well ?

chen-keinan · 2024-03-18T13:40:45Z

Hmm... this issue is not about a failing containers, but more about that sidecar container is not scanned when it injected by admission controller(-s) does your fix covers this as well ?

no, unfortunately, I thought it might help as if a pod has more than one container and one is failing while other passing then until now you'll not get any report for either of containers

alfsch · 2024-03-18T14:59:28Z

@andriktr the above mentioned annotations and labels aren't added in all cases. If you uses kyverno image mutations, it's inside a kyverno rule resource and no annotation which can give a hint.
In case of istio service mesh it's possible to do it globally without any annotation or label, with label on namespace and with label on workloads pod-template.

andriktr · 2024-03-18T15:06:32Z

@andriktr the above mentioned annotations and labels aren't added in all cases. If you uses kyverno image mutations, it's inside a kyverno rule resource and no annotation which can give a hint.

In case of istio service mesh it's possible to do it globally without any annotation or label, with label on namespace and with label on workloads pod-template.

This actually obvious and depends on solution. Actually most effective way would be to just query for unique running images in a namespace and scan them and not relay on the replica set definition att all

chen-keinan · 2024-03-18T15:09:48Z

@andriktr the above mentioned annotations and labels aren't added in all cases. If you uses kyverno image mutations, it's inside a kyverno rule resource and no annotation which can give a hint.
In case of istio service mesh it's possible to do it globally without any annotation or label, with label on namespace and with label on workloads pod-template.

This actually obvious and depends on solution. Actually most effective way would be to just query for unique running images in a namespace and scan them and not relay on the replica set definition att all

trivy-operator do not perform query it follow operator pattern (event base) for every (new, update, deletion) of resource

andriktr · 2024-03-18T15:17:09Z

@andriktr the above mentioned annotations and labels aren't added in all cases. If you uses kyverno image mutations, it's inside a kyverno rule resource and no annotation which can give a hint.

In case of istio service mesh it's possible to do it globally without any annotation or label, with label on namespace and with label on workloads pod-template.

This actually obvious and depends on solution. Actually most effective way would be to just query for unique running images in a namespace and scan them and not relay on the replica set definition att all

trivy-operator do not perform query it follow operator pattern (event base) for every (new, update, deletion) of resource

When either replicaset and final pod should be compared or images which should be scanned should be taken from the pod only and if there are more than one pod with same image report should sort it out to avoid duplicated info.

Alternatively operator behaviour could be changed to track running images in a namespace instead of running replicasets.

alfsch · 2024-03-18T15:51:56Z

@andriktr #1872 (comment) describes some case which also have to be handled. The truth is only in the pods running in a namespace or in the relation between the higher level workload descriptions like deployment/replicasets/.... and their pods.

jutley · 2024-03-27T19:00:58Z

I have a couple thoughts on how this could potentially be handled. Like @andriktr, I am also missing scans on some images which only get added to pods via mutations.

The brute force approach would be to watch all pods (I think this is already happening) and getting the set of images in them, and then comparing that set to the controller (the replicaset, for example). If there are additional images in the pod's set, then add these into the scan.

A slightly more elegant (but still incomplete) approach would be to leverage Kubernetes v1.28's support for sidecar containers. The mechanism here is to run sidecar containers as init containers which set restartPolicy to Always. This capability is protected by a feature flag and is enabled by default in v1.29. However, many clusters won't have this capability, and even when they do, many mutating webhooks will not use it. This may be a good approach a year from now, but not for today.

chen-keinan · 2024-03-28T06:56:28Z

A slightly more elegant (but still incomplete) approach would be to leverage Kubernetes v1.28's support for sidecar containers. The mechanism here is to run sidecar containers as init containers which set restartPolicy to Always. This capability is protected by a feature flag and is enabled by default in v1.29. However, many clusters won't have this capability, and even when they do, many mutating webhooks will not use it. This may be a good approach a year from now, but not for today.

@jutley thanks for this input. I'll have a look to see how it fir our operator

andriktr added the kind/bug Categorizes issue or PR as related to a bug. label Jan 5, 2024

chen-keinan mentioned this issue Feb 26, 2024

unable to initialize a remote scanner for some images #1872

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Trivy operator behaviour when containers are injected into deployment pod by admissions controllers #1745

Trivy operator behaviour when containers are injected into deployment pod by admissions controllers #1745

andriktr commented Jan 5, 2024

chen-keinan commented Jan 8, 2024 •

edited

andriktr commented Jan 8, 2024

chen-keinan commented Jan 8, 2024

andriktr commented Jan 8, 2024

alfsch commented Feb 27, 2024

chen-keinan commented Mar 18, 2024 •

edited

andriktr commented Mar 18, 2024 •

edited

chen-keinan commented Mar 18, 2024 •

edited

alfsch commented Mar 18, 2024

andriktr commented Mar 18, 2024

chen-keinan commented Mar 18, 2024

andriktr commented Mar 18, 2024

alfsch commented Mar 18, 2024

jutley commented Mar 27, 2024

chen-keinan commented Mar 28, 2024

Trivy operator behaviour when containers are injected into deployment pod by admissions controllers #1745

Trivy operator behaviour when containers are injected into deployment pod by admissions controllers #1745

Comments

andriktr commented Jan 5, 2024

chen-keinan commented Jan 8, 2024 • edited

andriktr commented Jan 8, 2024

chen-keinan commented Jan 8, 2024

andriktr commented Jan 8, 2024

alfsch commented Feb 27, 2024

chen-keinan commented Mar 18, 2024 • edited

andriktr commented Mar 18, 2024 • edited

chen-keinan commented Mar 18, 2024 • edited

alfsch commented Mar 18, 2024

andriktr commented Mar 18, 2024

chen-keinan commented Mar 18, 2024

andriktr commented Mar 18, 2024

alfsch commented Mar 18, 2024

jutley commented Mar 27, 2024

chen-keinan commented Mar 28, 2024

chen-keinan commented Jan 8, 2024 •

edited

chen-keinan commented Mar 18, 2024 •

edited

andriktr commented Mar 18, 2024 •

edited

chen-keinan commented Mar 18, 2024 •

edited