Charm doesn't react to pebble_ready so if the workload container restarts the charm stays in maintenance status #129

mthaddon · 2024-04-18T15:15:35Z

Describe the bug

The charm doesn't react to pebble_ready so if the workload container restarts the charm stays in maintenance status.

To Reproduce

Deploy the charm
Restart the workload (lego) container
Observe the status in juju remains as "maintenance"

Expected behavior

The charm should be active/idle.

Screenshots

N/A

Logs

N/A

Environment

Charm / library version (if relevant): 40
Juju version (output from juju --version): 3.1.8
Cloud Environment: OpenStack
Kubernetes version (output from kubectl version --short): Client Version: v1.29.4
Kustomize Version: v5.0.4-0.20230601165947-6ce0bf390ce3
Server Version: v1.26.15

Additional context

N/A

The text was updated successfully, but these errors were encountered:

gruyaume · 2024-04-19T16:04:30Z

Thank you for opening this issue @mthaddon, we added this to our next pulse.

gruyaume · 2024-04-23T18:05:43Z

I'm pretty sure this was fixed when we moved to using collect_status instead of observing individual events to set the charm status.

I'll promote the charm from edge to candidate. @mthaddon can you validate if you are still experiencing the issue with the candidate release?

juju deploy httprequest-lego-k8s --channel=candidate

mthaddon · 2024-04-24T07:28:23Z

I can't see a way to manually kill the workload container for this charm, as there's no shell for me to connect to the workload container, and pebble plan seems to be empty (I've just deployed the charm and set httpreq_endpoint and email but not related it to anything). Do you have any ideas of how to manually kill the lego container so that I can test this?

gruyaume · 2024-04-24T11:28:49Z

Indeed, there is no pebble plan, we are only using LEGO cli calls when needed. Here I simply deleted the k8s pod.

kubectl delete pod <pod name> -n <model name>

mthaddon · 2024-04-24T15:59:06Z

@gruyaume that deletes the entire pod. What I need to do here to confirm if the bug is fixed is to kill the lego container (which will then be restarted) but leave the charm container running.

mthaddon · 2024-04-25T12:04:56Z

Just for a bit more context, here's the output of kubectl describe pod and the previous container logs for the lego container on an instance manifesting this bug: https://pastebin.ubuntu.com/p/nZdqgtKkwt/

As you can see, the lego container has been restarted twice, and the last restart was on Wed, 17 Apr 2024 19:11:04 +0000.

gruyaume · 2024-04-25T12:12:43Z

I'm quite confident this would have been fixed by switching to "collect status" as status will be evaluated at every event now. I'm not quite sure how to confirm this 100%.

gruyaume · 2024-05-31T14:06:10Z

Closing this as this was addressed by leveraging collect status. You can run juju refresh to get the latest charm version.

mthaddon added the bug Something isn't working label Apr 18, 2024

gruyaume mentioned this issue Apr 23, 2024

fix: observe pebble ready event canonical/lego-base-k8s-operator#113

Closed

7 tasks

gruyaume closed this as completed May 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Charm doesn't react to pebble_ready so if the workload container restarts the charm stays in maintenance status #129

Charm doesn't react to pebble_ready so if the workload container restarts the charm stays in maintenance status #129

mthaddon commented Apr 18, 2024

gruyaume commented Apr 19, 2024

gruyaume commented Apr 23, 2024 •

edited

Loading

mthaddon commented Apr 24, 2024

gruyaume commented Apr 24, 2024

mthaddon commented Apr 24, 2024 •

edited

Loading

mthaddon commented Apr 25, 2024

gruyaume commented Apr 25, 2024 •

edited

Loading

gruyaume commented May 31, 2024

Charm doesn't react to pebble_ready so if the workload container restarts the charm stays in maintenance status #129

Charm doesn't react to pebble_ready so if the workload container restarts the charm stays in maintenance status #129

Comments

mthaddon commented Apr 18, 2024

Describe the bug

To Reproduce

Expected behavior

Screenshots

Logs

Environment

Additional context

gruyaume commented Apr 19, 2024

gruyaume commented Apr 23, 2024 • edited Loading

mthaddon commented Apr 24, 2024

gruyaume commented Apr 24, 2024

mthaddon commented Apr 24, 2024 • edited Loading

mthaddon commented Apr 25, 2024

gruyaume commented Apr 25, 2024 • edited Loading

gruyaume commented May 31, 2024

gruyaume commented Apr 23, 2024 •

edited

Loading

mthaddon commented Apr 24, 2024 •

edited

Loading

gruyaume commented Apr 25, 2024 •

edited

Loading