[feature request] add possibility to ignore differences in pod image in sync process #2622

martin31821 · 2024-05-03T10:56:45Z

Please, answer some short questions which should help us to understand your problem / question better?

Which image of the operator are you using? registry.opensource.zalan.do/acid/postgres-operator:v1.11.0
Where do you run it - cloud or metal? Kubernetes or OpenShift? AWS EKS
Are you running Postgres Operator in production? yes
Type of issue? Bug report/feature request

We are experiencing the same issue as #1397, #2453 and #1955 and I'd like to propose a fix for it.
For us it is company policy to have every docker image we use mirrored in our own private registry, which we do by running a k8s mutating webhook that pulls the docker images, pushes it to our own registry and then swaps out the image reference on pod creation.

Therefore our pods always have a different image set than the StatefulSet, so postgres-operator will kill and recreate all of our clusters every sync interval.

I'd like to propose a new configuration option ignoreImageDifference, which by default should be false to keep the current behavior. If it's set to true, differences in the actual image in the pod vs in the statefulset will be ignored by the sync processor.

The text was updated successfully, but these errors were encountered:

Jasper-Ben · 2024-05-03T12:02:58Z

FWIW we currently work around this by setting enable_lazy_spilo_upgrade=true, thus not triggering a Pod recreate, see:

postgres-operator/pkg/cluster/sync.go

Lines 408 to 432 in 5357062

    
           	if len(podsToRecreate) == 0 && !c.OpConfig.EnableLazySpiloUpgrade { 
        
           		// even if the desired and the running statefulsets match 
        
           		// there still may be not up-to-date pods on condition 
        
           		//  (a) the lazy update was just disabled 
        
           		// and 
        
           		//  (b) some of the pods were not restarted when the lazy update was still in place 
        
           		for _, pod := range pods { 
        
           			effectivePodImage := getPostgresContainer(&pod.Spec).Image 
        
           			stsImage := getPostgresContainer(&desiredSts.Spec.Template.Spec).Image 
        
           			if stsImage != effectivePodImage { 
        
           				if err = c.markRollingUpdateFlagForPod(&pod, "pod not yet restarted due to lazy update"); err != nil { 
        
           					c.logger.Warnf("updating rolling update flag failed for pod %q: %v", pod.Name, err) 
        
           				} 
        
           				podsToRecreate = append(podsToRecreate, pod) 
        
           			} else { 
        
           				role := PostgresRole(pod.Labels[c.OpConfig.PodRoleLabel]) 
        
           				if role == Master { 
        
           					continue 
        
           				} 
        
           				switchoverCandidates = append(switchoverCandidates, util.NameFromMeta(pod.ObjectMeta)) 
        
           			} 
        
           		} 
        
           	} 
        
           }

FxKu · 2024-05-17T13:30:56Z

Can you not specify the mirrored image in the global configuration? Not sure if I'm getting it.
For delaying pod replacement on image differences you've already found the right option: enable_lazy_spilo_upgrade

Jasper-Ben · 2024-05-17T17:05:56Z

Can you not specify the mirrored image in the global configuration? Not sure if I'm getting it. For delaying pod replacement on image differences you've already found the right option: enable_lazy_spilo_upgrade

👋 @FxKu, to explain our workflow:

We use https://estahn.github.io/k8s-image-swapper/v1.5/index.html which will automatically mirror new container images of upcoming pods into our own registry using a well-known naming scheme. The next time the same container image is used in an upcoming pod, image-swapper will notice that the image already exists in our registry and transparently replaces the image using a mutating webhook. Thus, we ensure images are always available independently of various upstream registries (in the past we had a case of downtime due to an unfortunate combination of unavailable upstream registry and newly spawned autoscaling instances that did not have the required image cached).

Using the webhook rather than hardcoding the mirror image into the the helm values has the advantage that it solves the availability issue at scale throughout our entire cluster without any extra steps during workload setup.

As an added bonus, since we do IasC we can use renovate bot to easily manage dependencies without having to jump through any hoops due to changed image specifications.

Jasper-Ben · 2024-05-17T17:11:39Z

For delaying pod replacement on image differences you've already found the right option: enable_lazy_spilo_upgrade

Jep, this works for us but still isn't ideal since we now have to manually roll-out updates. But thinking about it, i'm actually not sure it is possible to have both using the operator 😅 (but not sure)

FxKu added the question label May 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feature request] add possibility to ignore differences in pod image in sync process #2622

[feature request] add possibility to ignore differences in pod image in sync process #2622

martin31821 commented May 3, 2024

Jasper-Ben commented May 3, 2024

FxKu commented May 17, 2024 •

edited

Jasper-Ben commented May 17, 2024 •

edited

Jasper-Ben commented May 17, 2024 •

edited

[feature request] add possibility to ignore differences in pod image in sync process #2622

[feature request] add possibility to ignore differences in pod image in sync process #2622

Comments

martin31821 commented May 3, 2024

Jasper-Ben commented May 3, 2024

FxKu commented May 17, 2024 • edited

Jasper-Ben commented May 17, 2024 • edited

Jasper-Ben commented May 17, 2024 • edited

FxKu commented May 17, 2024 •

edited

Jasper-Ben commented May 17, 2024 •

edited

Jasper-Ben commented May 17, 2024 •

edited