Multi-select flow runs and set state in bulk (to clear late runs, delete scheduled runs in bulk, and more) #7006

anna-geller · 2022-09-28T17:07:29Z

First check

I added a descriptive title to this issue.
I used the GitHub search to find a similar request and didn't find it.
I searched the Prefect documentation for this feature.

Prefect Version

2.x

Describe the current behavior

Currently, there is no easy way to:

clear late runs
cancel multiple scheduled runs
delete many runs with arbitrary state or tag when needed

Describe the proposed behavior

Create a feature allowing to:

Filter for runs based on state e.g. Late/Scheduled/Failed or tag or work queue name
Select all (e.g., from the UI, CLI, or Python client)
Change the state for all those runs in bulk or delete those runs

Example Use

Feature parity with 1.0 having a button "Clear late runs"
Ability to clear late runs scheduled for the past so that they are not picked up when the agent gets restarted -- especially useful when an agent goes down, and the user doesn't want that those late runs to get executed (e.g. you may prefer to delete those late runs and start scheduling only future runs from the point when an agent got restarted)

Additional context

Imagine the scenario: you have an hourly scheduled deployment. Your agent went down yesterday at 9 AM. You realized that agent issue today and restarted the agent process at 9 AM the following day.

Current behavior: there are 24 late runs that get immediately picked up once the agent process gets restarted. Some users, justifiably, don't like it, as they would prefer to just start scheduling future runs and clear those 24 late runs scheduled for the past.

efranksrecroom · 2023-01-20T20:01:20Z

Would LOVE to see this implemented. I have spent MANY hours over the last couple of months clearing "stuck" flow runs when my k8s agent has silently crashed. I must clear the pending jobs before deleting the agent pod otherwise the new agent will try to pickup the, at times, hundreds of pending flows which will then cause the agent to get into a continuous crash cycle. Having to select them 1 by 1 from the UI, when there are 500+ flow runs, is incredibly time consuming and tedious.

zanieb · 2023-01-20T20:03:10Z

cc @zhen0 / @billpalombi this looks like it was triaged but I want to be sure it's actually on a roadmap.

zhen0 · 2023-01-20T21:18:51Z

Thanks @madkinsz! I'm adding to the UI backlog and we can set a priority and see who can pick it up at our team huddle on Monday.

EmilRex · 2023-01-20T21:43:25Z

@zhen0 Just wanted to add that pages for flows, deployments, and work-queues already support the desired behavior (i.e. anywhere we use lists instead of cards). Having a check box next to the "{n} Flow runs" text above the cards that acted as "select all" would probably work well.

zhen0 · 2023-01-24T19:43:34Z

Small update here that we want to think through how we clear these runs - we can easily have thousands (or more) of flow runs in the ui at one time so it's not as simple as the run multi-select or flow/deployment table view objects. We're actively looking into it.

EmilRex · 2023-01-24T22:31:12Z

@zhen0 definitely! In case it makes sense, I'll just add that even being able to select one page-worth at a time would be helpful.

zhen0 · 2023-01-25T03:59:12Z

@billpalombi - shall we add this to product orchestration's backlog?

cgoodric · 2023-02-01T18:21:28Z

Can we have a flow deployment flag that just says "don't catch up on missed runs?" I have jobs that really cannot have two instances running at the same time (trying to insert the same data set into the same table.)

krasoffski · 2023-02-08T14:15:00Z

Is there is any API example related to Prefect 2.0 which can workaround this problem right now?
Some ideas were provided in the original question: #5005 but mostly don't work for prefect 2.0 API.
Also it would be great as suggested @cgoodric to have ability handle this on API level automatically. Every time doing clean-up via UI is very annoying.

krasoffski · 2023-02-09T13:02:03Z

Hello again,

@anna-geller, can I use this approach for skipping unnecessary flows (within flow function)?

flow_run = context.get_run_context().flow_run
max_overdue = 300

if (
    flow_run.auto_scheduled  # allowing manual runs
    and flow_run.estimated_start_time_delta > datetime.timedelta(seconds=max_overdue)
):
    run_logger.warning("Cancelling task as outdated")
    return Cancelled()

bobpeers · 2023-04-08T15:19:45Z

Just experienced this issues today and found out it's not possible to delete late runs. In our case the vast majority of times I just want to delete late runs and get fresh data on the next scheduled run.
Seems the agent lost connection and being Easter I didn't realise so today I spent 20 minutes clicking 486 checkboxes to delete the late runs 🥲

zanieb · 2023-04-09T17:51:09Z

@bobpeers sounds like this could be resolved by #9054

bobpeers · 2023-04-09T17:57:52Z

@madkinsz Yes that would solve it for me 🫡

klayhb · 2023-08-29T12:14:44Z

clicking checkboxes isn't fun. we would love to see this impl.

th0ger · 2023-11-14T09:10:10Z

Slow cleanup workaround (1-5s per deletion), but it does the job.

for _ in {1..10}; do time prefect flow-run ls --limit 100 --state-type=SCHEDULED --state=Late --flow-name=FLOW_NAME | tail -n +5 | head -n -1 | awk '{print $2}' | while read guid; do prefect flow-run delete $guid; done; done

jackharrhy · 2024-01-23T18:07:38Z

let buttons = Array.from(document.querySelectorAll('input[class=p-checkbox__input]'));

(async () => {
  for (let b of buttons) {
    console.log(b)
    b.click();
    await new Promise(r => setTimeout(r, 100));
  }
})();

you can also put this into the browser console to script clicking every button on the page (note: this will select 50 since the list is virtualized, so you must do this each time per 50 items.

khgouldy · 2024-02-25T22:32:59Z

Can we have a flow deployment flag that just says "don't catch up on missed runs?" I have jobs that really cannot have two instances running at the same time (trying to insert the same data set into the same table.)

We need this

collincchoy · 2024-03-25T19:14:28Z

As of v2.16.5, the UI now has a shortcut for selecting/deselecting multiple flow runs at once. The selection takes current in-use filters into account to ultimately provide users with the ability to:

Navigate to a page within the UI
Filter by criteria like state
Select all
Delete

Should be available on all pages where flow runs are currently listed in filterable and selectable views - e.g. the flow runs, flow, and deployment pages.

Please note that these changes only affected the UI.

anna-geller added enhancement An improvement of an existing feature status:triage status:accepted We may work on this; we will accept work from external contributors and removed status:triage labels Sep 28, 2022

zanieb added ui Related to the Prefect web interface and removed status:accepted We may work on this; we will accept work from external contributors labels Jan 20, 2023

billpalombi added the priority:medium label Jan 24, 2023

taylor-curran added the from:sales Submitted by a sales engineer label Mar 10, 2023

cicdw removed priority:medium labels Aug 15, 2023

zzstoatzz mentioned this issue Dec 11, 2023

UI: Add checkbox to Block overview to allow multi-selection/-deletion #11372

Closed

3 tasks

zhen0 assigned collincchoy Feb 28, 2024

collincchoy removed their assignment Mar 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-select flow runs and set state in bulk (to clear late runs, delete scheduled runs in bulk, and more) #7006

Multi-select flow runs and set state in bulk (to clear late runs, delete scheduled runs in bulk, and more) #7006

anna-geller commented Sep 28, 2022

efranksrecroom commented Jan 20, 2023

zanieb commented Jan 20, 2023

zhen0 commented Jan 20, 2023

EmilRex commented Jan 20, 2023

zhen0 commented Jan 24, 2023

EmilRex commented Jan 24, 2023

zhen0 commented Jan 25, 2023

cgoodric commented Feb 1, 2023 •

edited

krasoffski commented Feb 8, 2023 •

edited

krasoffski commented Feb 9, 2023 •

edited

bobpeers commented Apr 8, 2023

zanieb commented Apr 9, 2023

bobpeers commented Apr 9, 2023

klayhb commented Aug 29, 2023

th0ger commented Nov 14, 2023

jackharrhy commented Jan 23, 2024

khgouldy commented Feb 25, 2024

collincchoy commented Mar 25, 2024

Multi-select flow runs and set state in bulk (to clear late runs, delete scheduled runs in bulk, and more) #7006

Multi-select flow runs and set state in bulk (to clear late runs, delete scheduled runs in bulk, and more) #7006

Comments

anna-geller commented Sep 28, 2022

First check

Prefect Version

Describe the current behavior

Describe the proposed behavior

Example Use

Additional context

efranksrecroom commented Jan 20, 2023

zanieb commented Jan 20, 2023

zhen0 commented Jan 20, 2023

EmilRex commented Jan 20, 2023

zhen0 commented Jan 24, 2023

EmilRex commented Jan 24, 2023

zhen0 commented Jan 25, 2023

cgoodric commented Feb 1, 2023 • edited

krasoffski commented Feb 8, 2023 • edited

krasoffski commented Feb 9, 2023 • edited

bobpeers commented Apr 8, 2023

zanieb commented Apr 9, 2023

bobpeers commented Apr 9, 2023

klayhb commented Aug 29, 2023

th0ger commented Nov 14, 2023

jackharrhy commented Jan 23, 2024

khgouldy commented Feb 25, 2024

collincchoy commented Mar 25, 2024

cgoodric commented Feb 1, 2023 •

edited

krasoffski commented Feb 8, 2023 •

edited

krasoffski commented Feb 9, 2023 •

edited