Skip to content

Retina Agent Pod Experiences CrashLoopBackOff After Installation on EKS #266

@jaeeyoungkim

Description

@jaeeyoungkim

1. Description:
I attempted to install Retina following the official installation guide available at Retina Installation Setup. I executed the commands as per the instructions on the page. However, the retina-agent pod entered a CrashLoopBackOff state, and the logs indicated a panic error related to the controller manager.

2. Steps to Reproduce:

Navigate to the Retina installation documentation page: https://retina.sh/docs/installation/setup
Run the following command to set the version and install Retina via Helm:

VERSION=$( curl -sL https://api.github.com/repos/microsoft/retina/releases/latest | jq -r .name)
helm upgrade --install retina oci://ghcr.io/microsoft/retina/charts/retina \
    --version $VERSION \
    --namespace kube-system \
    --set image.tag=$VERSION \
    --set operator.tag=$VERSION \
    --set logLevel=info \
    --set enabledPlugin_linux="[dropreason,packetforward,linuxutil,dns]"

3. Observe that the retina-agent pod enters a CrashLoopBackOff state.

Expected Behavior:
The Retina agent should install smoothly without errors and the pods should be running stably.

Actual Behavior:
The retina-agent pod fails to start and enters a CrashLoopBackOff loop. The logs display the following panic error:
panic: Error running controller manager

goroutine 148 [running]:
go.uber.org/zap/zapcore.CheckWriteAction.OnWrite(0x1?, 0x0?, {0x0?, 0x0?, 0xc001432020?})
    /go/pkg/mod/go.uber.org/zap@v1.26.0/zapcore/entry.go:196 +0x54
go.uber.org/zap/zapcore.(*CheckedEntry).Write(0xc00143c000, {0xc0016e2980, 0x1, 0x1})
    /go/pkg/mod/go.uber.org/zap@v1.26.0/zapcore/entry.go:262 +0x3ec
go.uber.org/zap.(*Logger).Panic(0xc000998700?, {0x2b52d44?, 0x0?}, {0xc0016e2980, 0x1, 0x1})
    /go/pkg/mod/go.uber.org/zap@v1.26.0/logger.go:284 +0x51
github.com/microsoft/retina/pkg/managers/controllermanager.(*Controller).Start(0xc000b0f220, {0x2f10ad0?, 0xc000b0f1d0?})
    /go/src/github.com/microsoft/retina/pkg/managers/controllermanager/controllermanager.go:119 +0x28c
created by main.main in goroutine 1
    /go/src/github.com/microsoft/retina/controller/main.go:290 +0x28d0
image

Platform

  • OS: Amazon Linux 2
  • Kubernetes Version: v1.25.9-eks-0a21954
  • Host: EKS
  • Retina Version: v0.0.5

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

Status

Done

Status

Accepted

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions