panic: interface conversion: interface {} is cache.DeletedFinalStateUnknown, not *v1.Service #10015

mintcckey · 2023-05-30T06:38:50Z

What happened:

We noticed several times when pods had panic with "interface conversion: interface {} is cache.DeletedFinalStateUnknown, not *v1.Service". Checked the original source code and it seems like the type assertion doesn't handle the case where the obj could be cache.DeletedFinalStateUnknown. I am sorry if this issue might have been reported by others already.

Thanks.

E0525 04:03:38.226398 7 runtime.go:79] Observed a panic: &runtime.TypeAssertionError{_interface:(*runtime._type)(0x17c10e0), concrete:(*runtime._type)(0x18a3e00), asserted:(*runtime._type)(0x1a23800), missingMethod:""} (interface conversion: interface {} is cache.DeletedFinalStateUnknown, not *v1.Service)
goroutine 125 [running]:
k8s.io/apimachinery/pkg/util/runtime.logPanic({0x1822d40?, 0xc003232660})
k8s.io/apimachinery@v0.25.2/pkg/util/runtime/runtime.go:75 +0x99
k8s.io/apimachinery/pkg/util/runtime.HandleCrash({0x0, 0x0, 0x203020203030202?})
k8s.io/apimachinery@v0.25.2/pkg/util/runtime/runtime.go:49 +0x75
panic({0x1822d40, 0xc003232660})
runtime/panic.go:884 +0x212
k8s.io/ingress-nginx/internal/ingress/controller/store.New.func21({0x18a3e00?, 0xc001d820e0?})
k8s.io/ingress-nginx/internal/ingress/controller/store/store.go:772 +0xde
k8s.io/client-go/tools/cache.ResourceEventHandlerFuncs.OnDelete(...)
k8s.io/client-go@v0.25.2/tools/cache/controller.go:246
k8s.io/client-go/tools/cache.(*processorListener).run.func1()
k8s.io/client-go@v0.25.2/tools/cache/shared_informer.go:820 +0xaf
k8s.io/apimachinery/pkg/util/wait.BackoffUntil.func1(0x0?)
k8s.io/apimachinery@v0.25.2/pkg/util/wait/wait.go:157 +0x3e
k8s.io/apimachinery/pkg/util/wait.BackoffUntil(0xc000659f38?, {0x1ccc140, 0xc00056ea20}, 0x1, 0xc000520cc0)
k8s.io/apimachinery@v0.25.2/pkg/util/wait/wait.go:158 +0xb6
k8s.io/apimachinery/pkg/util/wait.JitterUntil(0x0?, 0x3b9aca00, 0x0, 0x0?, 0xc000659f88?)
k8s.io/apimachinery@v0.25.2/pkg/util/wait/wait.go:135 +0x89
k8s.io/apimachinery/pkg/util/wait.Until(...)
k8s.io/apimachinery@v0.25.2/pkg/util/wait/wait.go:92
k8s.io/client-go/tools/cache.(*processorListener).run(0xc0007b3a80?)
k8s.io/client-go@v0.25.2/tools/cache/shared_informer.go:812 +0x6b
k8s.io/apimachinery/pkg/util/wait.(*Group).Start.func1()
k8s.io/apimachinery@v0.25.2/pkg/util/wait/wait.go:75 +0x5a
created by k8s.io/apimachinery/pkg/util/wait.(*Group).Start
k8s.io/apimachinery@v0.25.2/pkg/util/wait/wait.go:73 +0x85
panic: interface conversion: interface {} is cache.DeletedFinalStateUnknown, not *v1.Service [recovered]
panic: interface conversion: interface {} is cache.DeletedFinalStateUnknown, not *v1.Service

What you expected to happen:

NGINX Ingress controller version (exec into the pod and run nginx-ingress-controller --version.): controller-v1.4.0 tag

Kubernetes version (use kubectl version): v1.25.7

Environment:

Cloud provider or hardware configuration: GCP
OS (e.g. from /etc/os-release):
Kernel (e.g. uname -a):
Install tools: it's a GKE managed kubernetes cluster.
- Please mention how/where was the cluster created like kubeadm/kops/minikube/kind etc.
Basic cluster related info:
- kubectl version
- kubectl get nodes -o wide
How was the ingress-nginx-controller installed:
- If helm was used then please show output of helm ls -A | grep -i ingress
- If helm was used then please show output of helm -n <ingresscontrollernamepspace> get values <helmreleasename>
- If helm was not used, then copy/paste the complete precise command used to install the controller, along with the flags and options used
- if you have more than one instance of the ingress-nginx-controller installed in the same cluster, please provide details for all the instances

We didn't use Helm to install chart but instead a YAML manifest apply.

Current State of the controller:
- kubectl describe ingressclasses
- kubectl -n <ingresscontrollernamespace> get all -A -o wide
- kubectl -n <ingresscontrollernamespace> describe po <ingresscontrollerpodname>
- kubectl -n <ingresscontrollernamespace> describe svc <ingresscontrollerservicename>
Current state of ingress object, if applicable:
- kubectl -n <appnnamespace> get all,ing -o wide
- kubectl -n <appnamespace> describe ing <ingressname>
- If applicable, then, your complete and exact curl/grpcurl command (redacted if required) and the reponse to the curl/grpcurl command with the -v flag
Others:
- Any other related information like ;
  - copy/paste of the snippet (if applicable)
  - kubectl describe ... of any custom configmap(s) created and in use
  - Any other related information that may help

Below is the code snippet that I got from the ingress-nginx/internal/ingress/controller/store/store.go with tag controller-v1.4.0 where I think the panic happens.

serviceHandler := cache.ResourceEventHandlerFuncs{
		AddFunc: func(obj interface{}) {
			svc := obj.(*corev1.Service)
			if svc.Spec.Type == corev1.ServiceTypeExternalName {
				updateCh.In() <- Event{
					Type: CreateEvent,
					Obj:  obj,
				}
			}
		},
		DeleteFunc: func(obj interface{}) {
			svc := obj.(*corev1.Service)
			if svc.Spec.Type == corev1.ServiceTypeExternalName {
				updateCh.In() <- Event{
					Type: DeleteEvent,
					Obj:  obj,
				}
			}
		},
		UpdateFunc: func(old, cur interface{}) {
			oldSvc := old.(*corev1.Service)
			curSvc := cur.(*corev1.Service)

			if reflect.DeepEqual(oldSvc, curSvc) {
				return
			}

			updateCh.In() <- Event{
				Type: UpdateEvent,
				Obj:  cur,
			}
		},
	}

How to reproduce this issue:

Anything else we need to know:

It's not a critical issue for us as the pod can restore after crashing.

The text was updated successfully, but these errors were encountered:

k8s-ci-robot · 2023-05-30T06:38:58Z

This issue is currently awaiting triage.

If Ingress contributors determines this is a relevant issue, they will accept it by applying the triage/accepted label and provide further guidance.

The triage/accepted label can be added by org members by writing /triage accepted in a comment.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

longwuyuan · 2023-05-30T06:44:26Z

/remove-kind bug

Hi,
The information you provided is limited to some log messages and that is not enough to reproduce the problem for analysis.

Firstly, can you kindly answer all the questions that are visible in the template but have been skipped by you. That info may provide context & details that will help analysis.

Importantly, if you can kindly write a step-by-step guide that anyone can copy/paste on a kind cluster or a minikube cluster, it will help make some progress on this issue.
Thanks

mintcckey · 2023-05-30T07:00:36Z

/remove-kind bug

Hi, The information you provided is limited to some log messages and that is not enough to reproduce the problem for analysis.

Firstly, can you kindly answer all the questions that are visible in the template but have been skipped by you. That info may provide context & details that will help analysis.

Importantly, if you can kindly write a step-by-step guide that anyone can copy/paste on a kind cluster or a minikube cluster, it will help make some progress on this issue. Thanks

Hi there, I tried to provide as much information as I know and I will keep updating them once more information is available to me. Thanks.

tao12345666333 · 2023-05-30T08:34:46Z

Could you please try to upgrade your ingress-nginx controller to latest version?

mintcckey · 2023-05-30T09:03:34Z

Could you please try to upgrade your ingress-nginx controller to latest version?

Hi there, we do plan to upgrade. But we would also want to understand the root cause to make sure that it's been fixed in the more recent versions. Thank you.

tao12345666333 · 2023-05-30T12:46:51Z

In fact, version 1.4 is no longer supported. So I hope you can upgrade it before we confirm again.
https://github.com/kubernetes/ingress-nginx#supported-versions-table

github-actions · 2023-06-30T02:05:38Z

This is stale, but we won't close it automatically, just bare in mind the maintainers may be busy with other tasks and will reach your issue ASAP. If you have any question or request to prioritize this, please reach #ingress-nginx-dev on Kubernetes Slack.

dvaldivia · 2024-03-13T23:14:36Z

I think this problem is related to the informer closing down for some reason and pushing this type of message

Scusemua · 2024-03-27T22:23:46Z

I've also experienced this issue, though I don't have a good way to reproduce it.

geberl · 2024-04-20T19:53:37Z

I encountered something similar when my Pod's ClusterRole was missing the watch Pods permission (here probabaly Services). cache.DeletedFinalStateUnknown objects are gone once this permission is present.

mintcckey added the kind/bug Categorizes issue or PR as related to a bug. label May 30, 2023

k8s-ci-robot added the needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. label May 30, 2023

k8s-ci-robot added the needs-priority label May 30, 2023

k8s-ci-robot added needs-kind Indicates a PR lacks a `kind/foo` label and requires one. and removed kind/bug Categorizes issue or PR as related to a bug. labels May 30, 2023

github-actions bot added the lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. label Jun 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

panic: interface conversion: interface {} is cache.DeletedFinalStateUnknown, not *v1.Service #10015

panic: interface conversion: interface {} is cache.DeletedFinalStateUnknown, not *v1.Service #10015

mintcckey commented May 30, 2023 •

edited

k8s-ci-robot commented May 30, 2023

longwuyuan commented May 30, 2023

mintcckey commented May 30, 2023

tao12345666333 commented May 30, 2023

mintcckey commented May 30, 2023

tao12345666333 commented May 30, 2023

github-actions bot commented Jun 30, 2023

dvaldivia commented Mar 13, 2024

Scusemua commented Mar 27, 2024

geberl commented Apr 20, 2024

panic: interface conversion: interface {} is cache.DeletedFinalStateUnknown, not *v1.Service #10015

panic: interface conversion: interface {} is cache.DeletedFinalStateUnknown, not *v1.Service #10015

Comments

mintcckey commented May 30, 2023 • edited

k8s-ci-robot commented May 30, 2023

longwuyuan commented May 30, 2023

mintcckey commented May 30, 2023

tao12345666333 commented May 30, 2023

mintcckey commented May 30, 2023

tao12345666333 commented May 30, 2023

github-actions bot commented Jun 30, 2023

dvaldivia commented Mar 13, 2024

Scusemua commented Mar 27, 2024

geberl commented Apr 20, 2024

mintcckey commented May 30, 2023 •

edited