Skip to content

SFTPOperator does not finish downloading #28603

@ReneTC

Description

@ReneTC

Apache Airflow version

Other Airflow 2 version (please specify below)

What happened

I have an issue with the SFTPOperator from airflow.providers.sftp.operators.sftp.. Mostly it works perfectly.
However, sometimes it refuses to download the complete file and is just stuck "downloading". A job has been running for 7 days to download a 38 mb file. I can see in the output file there is 37.5 mb out of the 38 mb. Last output of the log is Starting to transfer from ftp_server_file.csv to local_gcs_file.csv

But most of the time it works perfectly - It has downloaded a 39 mb file before without issues.

I'm downloading from a sftp server to google cloud storage in Airflow 2.3.3.

What you think should happen instead

Somehow this operator still think it is downloading. Which makes sense since the file is not 100% downloaded. But the download speed must be close to 0 byte per second. Job should terminate.

How to reproduce

No response

Operating System

Google Cloud Composer

Versions of Apache Airflow Providers

No response

Deployment

Composer

Deployment details

No response

Anything else

No response

Are you willing to submit PR?

  • Yes I am willing to submit a PR!

Code of Conduct

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions