Amazon NeptuneStopDbClusterOperator bug #38120

ferruzzi · 2024-03-13T16:52:50Z

Apache Airflow Provider(s)

amazon

Versions of Apache Airflow Providers

latest; AWS System test stack pulls from main

Apache Airflow version

latest; AWS System test stack pulls from main

Operating System

linux

Deployment

Docker-Compose

Deployment details

No response

What happened

On occasion the stop_db_cluster() call will return a "backing-up" state which is not handled, causing an exception. Example log message:

ERROR [airflow.task] Task failed with exception
Traceback (most recent call last):
  File "/opt/airflow/airflow/models/taskinstance.py", line 447, in _execute_task
    result = _execute_callable(context=context, **execute_callable_kwargs)
  File "/opt/airflow/airflow/models/taskinstance.py", line 417, in _execute_callable
    return execute_callable(context=context, **execute_callable_kwargs)
  File "/opt/airflow/airflow/providers/amazon/aws/operators/neptune.py", line 186, in execute
    resp = self.hook.conn.stop_db_cluster(DBClusterIdentifier=self.cluster_id)
  File "/usr/local/lib/python3.8/site-packages/botocore/client.py", line 553, in _api_call
    return self._make_api_call(operation_name, kwargs)
  File "/usr/local/lib/python3.8/site-packages/botocore/client.py", line 1009, in _make_api_call
    raise error_class(parsed_response, operation_name)
botocore.errorfactory.InvalidDBClusterStateFault: An error occurred (InvalidDBClusterStateFault) when calling the StopDBCluster operation: DbCluster env989356b0-cluster is in backing-up state but expected it to be one of available.
INFO  [airflow.models.taskinstance] Marking task as UP_FOR_RETRY. dag_id=example_neptune, task_id=stop_task, execution_date=20210101T000000, start_date=20240313T100059, end_date=20240313T100100

What you think should happen instead

If that is a possible state, it needs to be handled gracefully

How to reproduce

The issue is intermittent so I'm not 100% sure. Force a cluster to start a backup then immediately try to delete it while it is in progress.

Anything else

No response

Are you willing to submit PR?

Yes I am willing to submit a PR!

Code of Conduct

I agree to follow this project's Code of Conduct

The text was updated successfully, but these errors were encountered:

ferruzzi · 2024-03-13T16:53:41Z

@ellisms - This was your contribution, do you want to have a look at it?

ellisms · 2024-03-13T17:17:11Z

Yep, I'll take a look at it

ferruzzi · 2024-03-13T17:28:53Z

Cool, thanks. Assigned to you. Let me know if you need anything more from me, and feel free to ping me in the PR for a review.

ferruzzi added kind:bug This is a clearly a bug area:providers needs-triage label for new issues that we didn't triage yet labels Mar 13, 2024

ferruzzi assigned ellisms Mar 13, 2024

ferruzzi removed the needs-triage label for new issues that we didn't triage yet label Mar 13, 2024

Taragolis added the provider:amazon-aws AWS/Amazon - related issues label Mar 13, 2024

ellisms mentioned this issue Mar 19, 2024

NeptuneStopDbClusterOperator - Handle invalid cluster states #38287

Merged

eladkal added the good first issue label May 21, 2024

eladkal closed this as completed in #38287 May 22, 2024

eladkal mentioned this issue May 26, 2024

Status of testing Providers that were prepared on May 26, 2024 #39842

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Amazon NeptuneStopDbClusterOperator bug #38120

Amazon NeptuneStopDbClusterOperator bug #38120

ferruzzi commented Mar 13, 2024

ferruzzi commented Mar 13, 2024

ellisms commented Mar 13, 2024

ferruzzi commented Mar 13, 2024

Amazon NeptuneStopDbClusterOperator bug #38120

Amazon NeptuneStopDbClusterOperator bug #38120

Comments

ferruzzi commented Mar 13, 2024

Apache Airflow Provider(s)

Versions of Apache Airflow Providers

Apache Airflow version

Operating System

Deployment

Deployment details

What happened

What you think should happen instead

How to reproduce

Anything else

Are you willing to submit PR?

Code of Conduct

ferruzzi commented Mar 13, 2024

ellisms commented Mar 13, 2024

ferruzzi commented Mar 13, 2024