Adjusted the EMRServerlessStartJobOperator to cancel failed jobs #51883

dominikhei · 2025-06-18T12:12:49Z

I have introduced a cancel_job method to the EMRServerlessHook, which wraps the cancel_job_run method from boto3.

In cases of a non deferrable job run, if an Exception that waiter_max_attempts has been reached is thrown, cancel_job is executed. If deferrable is set to True, the cancellation logic is placed inside execute_complete, as this method evaluates the job state in this case.

… failure

vincbeck

I feel like this is a very opinionated decision. I am wondering if this is not something the user should set by using on_failure_callback and not us to take this decision.

I'd like to hear more thoughts on that from others.

providers/amazon/src/airflow/providers/amazon/aws/hooks/emr.py

providers/amazon/src/airflow/providers/amazon/aws/operators/emr.py

…rlessStartJobTrigger

dominikhei · 2025-06-18T17:17:22Z

I feel like this is a very opinionated decision. I am wondering if this is not something the user should set by using on_failure_callback and not us to take this decision.

I'd like to hear more thoughts on that from others.

Apologies if there is an obvious answer, but is there a use case where you would want the job to not be cancelled in EMR if a new one is created due to retries, now running / pending concurrently?

vincbeck · 2025-06-18T17:21:51Z

I feel like this is a very opinionated decision. I am wondering if this is not something the user should set by using on_failure_callback and not us to take this decision.
I'd like to hear more thoughts on that from others.

Apologies if this is an obvious question, but is there a use case where you would want the job to not be cancelled in EMR if a new one is created due to retries, now running / pending concurrently?

Hard to know all the different user use cases but I think you're correct, I do not see any, so I am probably wrong in my perception :)

dominikhei · 2025-06-18T17:54:11Z

I feel like this is a very opinionated decision. I am wondering if this is not something the user should set by using on_failure_callback and not us to take this decision.
I'd like to hear more thoughts on that from others.

Apologies if this is an obvious question, but is there a use case where you would want the job to not be cancelled in EMR if a new one is created due to retries, now running / pending concurrently?

Hard to know all the different user use cases but I think you're correct, I do not see any, so I am probably wrong in my perception :)

That’s true, there’s definetly a point in letting the user decide, albeit introducing more complexity. As you said lets wait on other opinions :)

Adjusted the EMRServerlessStartJobOperator to cancel submited jobs on…

4a70c70

… failure

boring-cyborg bot added area:providers provider:amazon AWS/Amazon - related issues labels Jun 18, 2025

dominikhei marked this pull request as ready for review June 18, 2025 12:55

dominikhei requested review from eladkal and o-nikolas as code owners June 18, 2025 12:55

vincbeck requested changes Jun 18, 2025

View reviewed changes

providers/amazon/src/airflow/providers/amazon/aws/hooks/emr.py Outdated Show resolved Hide resolved

providers/amazon/src/airflow/providers/amazon/aws/operators/emr.py Outdated Show resolved Hide resolved

Removed hook.cancel_job_run and adjusted the return value of EmrServe…

7703c44

…rlessStartJobTrigger

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adjusted the EMRServerlessStartJobOperator to cancel failed jobs #51883

Adjusted the EMRServerlessStartJobOperator to cancel failed jobs #51883

dominikhei commented Jun 18, 2025

Uh oh!

vincbeck left a comment

Uh oh!

Uh oh!

Uh oh!

dominikhei commented Jun 18, 2025 •

edited

Loading

Uh oh!

vincbeck commented Jun 18, 2025

Uh oh!

dominikhei commented Jun 18, 2025

Uh oh!

Uh oh!

Adjusted the EMRServerlessStartJobOperator to cancel failed jobs #51883

Are you sure you want to change the base?

Adjusted the EMRServerlessStartJobOperator to cancel failed jobs #51883

Conversation

dominikhei commented Jun 18, 2025

Uh oh!

vincbeck left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

dominikhei commented Jun 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vincbeck commented Jun 18, 2025

Uh oh!

dominikhei commented Jun 18, 2025

Uh oh!

Uh oh!

dominikhei commented Jun 18, 2025 •

edited

Loading