Fix for cloud fetch #362

andrefurlan-db · 2024-02-21T22:09:38Z

Throw when failed to download file
Retry properly while downloading file
Add a bunch of debug logs
Prevent thread issues

TODO: http connection pools for cloud storage, proxies, etc.

Also backported to version 2

* fixes for cloud fetch Signed-off-by: Andre Furlan <andre.furlan@databricks.com> --------- Signed-off-by: Andre Furlan <andre.furlan@databricks.com> Co-authored-by: Raymond Cypher <raymond.cypher@databricks.com>

benc-db · 2024-02-21T22:11:21Z

src/databricks/sql/cloudfetch/download_manager.py

        self._shutdown_manager()
-        return None
+        raise ResultSetDownloadError(


Per the change in the comment above, there is no retry attempted?

Or is it just handled by raising the exception?

the retry is done outside this function, closer to the actual http request

Signed-off-by: Andre Furlan <andre.furlan@databricks.com>

benc-db · 2024-02-21T22:13:46Z

src/databricks/sql/cloudfetch/downloader.py

@@ -171,3 +202,40 @@ def decompress_data(compressed_data: bytes) -> bytes:
                uncompressed_data += data
                start += num_bytes
        return uncompressed_data
+
+
+def http_get_with_retry(url, max_retries=5, backoff_factor=2, download_timeout=60):


why are we implementing retry behavior here rather than using a Retry passed to the session?

agreed. It is in the TODO to also have connection pools

benc-db

Approve, but consider if we can implement with urllib3 Retry.

fixes for cloud fetch - part un (databricks#356)

7bc38d6

* fixes for cloud fetch Signed-off-by: Andre Furlan <andre.furlan@databricks.com> --------- Signed-off-by: Andre Furlan <andre.furlan@databricks.com> Co-authored-by: Raymond Cypher <raymond.cypher@databricks.com>

andrefurlan-db requested review from rcypher-databricks, arikfr, yunbodeng-db, jackyhu-db and benc-db as code owners February 21, 2024 22:09

benc-db reviewed Feb 21, 2024

View reviewed changes

bump version to 3.1.1

cef6cf3

Signed-off-by: Andre Furlan <andre.furlan@databricks.com>

andrefurlan-db force-pushed the 3.1.1 branch from 61dda59 to cef6cf3 Compare February 21, 2024 22:12

benc-db reviewed Feb 21, 2024

View reviewed changes

benc-db approved these changes Feb 21, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for cloud fetch #362

Fix for cloud fetch #362

andrefurlan-db commented Feb 21, 2024 •

edited

benc-db Feb 21, 2024

benc-db Feb 21, 2024

andrefurlan-db Feb 21, 2024

benc-db Feb 21, 2024

andrefurlan-db Feb 21, 2024

benc-db left a comment

Fix for cloud fetch #362

Are you sure you want to change the base?

Fix for cloud fetch #362

Conversation

andrefurlan-db commented Feb 21, 2024 • edited

benc-db Feb 21, 2024

Choose a reason for hiding this comment

benc-db Feb 21, 2024

Choose a reason for hiding this comment

andrefurlan-db Feb 21, 2024

Choose a reason for hiding this comment

benc-db Feb 21, 2024

Choose a reason for hiding this comment

andrefurlan-db Feb 21, 2024

Choose a reason for hiding this comment

benc-db left a comment

Choose a reason for hiding this comment

andrefurlan-db commented Feb 21, 2024 •

edited