Kubernetes Flows / Workers do not gracefully handle evictions #12988

meggers · 2024-03-07T17:08:30Z

We see that evicted pods for flows started from a Kubernetes worker result in the Flow stuck in a running status. Furthermore, when cancelling a flow in this status, the flow gets stuck in Cancelling. This is a very troubling scenario because evictions are expected in kubernetes.

Additionally, for this case, we have tried setting timeout and retry annotations on our flows, and these are not respected in the case of evictions.

Expectation / Proposal

I expect that the flow should enter a crashed state (or be retried automatically). I expect that either the flow should handle evictions gracefully and post back crashed, or the worker should respond to the evicted pod and post back crashed or retry. The worker should also gracefully handle evictions to protect against the scenario when workers get restarted when monitoring flows.

We also expect that timeouts/retry annotations apply in the case of evictions.

Traceback / Example

Flow:
prefect-client==2.16.2

K8s Worker:
prefect-client==2.16.2
prefect-kubernetes==0.3.5

Steps to reproduce:

Create deployment

from time import sleep
from prefect import flow

@flow(log_prints=True)
def test_long_running_job():
    sleep(630)

if __name__ == "__main__":
    test_long_running_job.deploy(
        name="long-running-job", 
        work_pool_name="my-workpool", 
        image="my/image"
    )

Run deployment
prefect deployment run 'test_long_running_job/long-running-job
Wait a few minutes (9 minutes, in my tests) and evict the flow pod

from kubernetes import client, config

k8s_config_file = "your/config/file"
cluster = "your-cluster-context"
namespace = "your-namespace"
pod_name = "flow-pod-name"

config.load_kube_config(config_file="/home/vscode/.kube/config", context=cluster)
v1 = client.CoreV1Api()
v1.create_namespaced_pod_eviction(
    name=pod_name,
    namespace=namespace,
    body=client.V1Eviction(metadata=client.V1ObjectMeta(name=pod_name))
)

Observe that the flow remains in Running. Logs from the worker (notice no logs indicating worker observed eviction):

I would like to help contribute a pull request to resolve this!

The text was updated successfully, but these errors were encountered:

meggers · 2024-04-25T16:52:09Z

Couple notes:

This continues to happen almost daily for us. It is becoming a larger issue.

I notice that pod_watch_timeout_seconds is passed to KubernetesEventsReplicator . It ends up getting consumed by the kubernetes watch.stream. I think this is incorrect. I think instead the job_watch_timeout_seconds should be passed in IF IT IS SET, if it is not set I think there should be no timeout set at all. See https://github.com/kubernetes-client/python/blob/master/examples/watch/timeout-settings.md

I also notice that there are two watches, that from KubernetesEventsReplicator and that from _watch_job. Are two watches necessary?

In a recent case, we see that _replicate_pod_events failed after initially submitting the Job.

urimandujano added the enhancement An improvement of an existing feature label Mar 7, 2024

gabcoyne self-assigned this Apr 2, 2024

desertaxle transferred this issue from PrefectHQ/prefect-kubernetes Apr 26, 2024

jeanluciano mentioned this issue Jun 10, 2024

Migrates to Kubernetes_asyncio for asynchronous support #13910

Open

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kubernetes Flows / Workers do not gracefully handle evictions #12988

Kubernetes Flows / Workers do not gracefully handle evictions #12988

meggers commented Mar 7, 2024 •

edited

Loading

meggers commented Apr 25, 2024

Kubernetes Flows / Workers do not gracefully handle evictions #12988

Kubernetes Flows / Workers do not gracefully handle evictions #12988

Comments

meggers commented Mar 7, 2024 • edited Loading

Expectation / Proposal

Traceback / Example

meggers commented Apr 25, 2024

meggers commented Mar 7, 2024 •

edited

Loading