[v2] Bug fix and optimization in s3transfer's download #9345

aemous · 2025-03-05T16:01:24Z

Issue #, if available:

Possibly related: Memory leak in aws s3 cp when piping .zstd files through stdout to a slow consumer #8910

Context

This PR aims to address a bug that occurs when an IncompleteReadError is encountered while downloading a multipart AWS S3 object to a non-seekable stream (e.g. stdout). In this case, all bytes of the object with offset larger than the offset of the incomplete read get queued in a buffer, and the CLI terminates before the buffer is flushed.

The results of this are (1) the object does not fully get written to the stream, and (2) the memory usage of the CLI grows linearly with respect to the amount of object bytes with offset larger than the offset of the incomplete read. This means that most of the object may be stored in memory no matter how large the object is.

The scope of the changes in this PR to s3transfer will only impact users downloading multi-part AWS S3 objects to a non-seekable stream (e.g. stdout).

Description of changes

s3transfer

Fix bug that occurs when an IncompleteReadError is encountered while downloading a multipart AWS S3 object to a non-seekable stream (e.g. stdout).
Changed DeferQueue so that it will overwrite pending writes to the same offset with whichever write request has the most data. So, when the request with the incomplete read is retried, it will overwrite the incomplete data in the queue assuming the retry contains more data (e.g. the full part).
Changed DeferQueue so that if it has already dequeued an incomplete part, it will queue the subset of the bytes in the retry that were not previously dequeued.

Performance Benchmark Scripts

Added two new benchmark definitions to benchmarks.json: one simulates a standard multi-part download of a 10GB file (1,192 parts), and the other simulates the same download except an IncompleteReadError occurs (with retry) on the 32nd part.
Updated the benchmark harness to support loading mocked HTTP response bodies from files. Updated the performance scripts README with the updated definition schema.
Added support for a --debug-dir <path> path to make benchmark-definition debugging easier. When this path is specified, all output of the child (benchmarked) processes gets written to files in this directory.

Description of tests

Correctness

Added 2 new unit tests to cover the 2 main modifications of DeferQueue. Also updated names/documentation of existing related tests.
Verified that the new benchmark that simulates an incomplete read successfully writes the entire object to stdout by piping the output to a file and checking its size.
- Before the changes to DeferQueue contained in this PR, I verified that the full object does NOT successfully get written.
Ran and passed all existing tests (unit, functional, integration, etc.)
Manually used this CLI build to successfully download a 10GB object from S3 to stdout using the default concurrency options.

Performance Testing

Ran the performance tests before and after the changes to DeferQueue contained in this PR and observed significant optimizations to memory usage and execution time. The results are summarized below

Summary #

Raw Data

Each cell in the table below is the average value of the metric taken over 5 iterations.

-	2.24.14	2.24.14	2.24.14 (w/ DeferQueue improvements)	2.24.14 (w/ DeferQueue improvements)
-	Happy Path	IncompleteReadError	Happy Path	IncompleteReadError
Max Memory (GB)	0.7246	5.242	0.207	0.20836
p95 Memory (GB)	0.7101	5.025	0.207	0.19397
Max CPU (%)	19.72	11.82	18.42	18.42
p95 CPU (%)	1.5	1.5	2.5	2.5
Total Time (s)	179.4	9.848	111.7	110.18

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

… is specified

* Bump PyInstaller to 6.11.1 This includes pinning `macholib` because of this change in `PyInstaller` that makes `macholib` a build dependency. pyinstaller/pyinstaller#5883 --------- Co-authored-by: Github Actions <> Co-authored-by: Steve Yoo <hssyoo@amazon.com> Co-authored-by: Alex Shovlin <shovlia@amazon.com>

ashovlin

Is it worth running this new benchmark continuously?

Mostly asking for ergonomics, since GitHub won't even show the file diff now. If we don't want to run it as part of the new suite, wondering if it'd be better to have a separate benchmarks-s3-advanced.json (or whatever naming) separately.

aemous · 2025-03-10T14:50:14Z

Is it worth running this new benchmark continuously?

Mostly asking for ergonomics, since GitHub won't even show the file diff now. If we don't want to run it as part of the new suite, wondering if it'd be better to have a separate benchmarks-s3-advanced.json (or whatever naming) separately.

@ashovlin Agreed that would be a good idea, I iterated on this feedback in the latest revision.

hssyoo · 2025-03-20T17:51:40Z

So this seems like it would work, but I'm not sure I understand why we're writing an incomplete get at all. It looks like IncompleteReadError is one of the retryable errors (ref) (caught here). I think we should be exploring fixing the underlying issue here. Do you have any debug logs for the failure case?

aemous · 2025-03-20T19:18:08Z

So this seems like it would work, but I'm not sure I understand why we're writing an incomplete get at all. It looks like IncompleteReadError is one of the retryable errors (ref) (caught here). I think we should be exploring fixing the underlying issue here. Do you have any debug logs for the failure case?

@hssyoo

IncompleteReadErrors are unavoidable

Based on my investigation, IncompleteReadErrors are unavoidable; for example, if we pause the program for 10 minutes while its reading from the socket, S3 will likely close the connection in such case due to this idle time. As a result, when the program resumes, it will find nothing left in the socket to read. Then, it compares the amount of data read to the Content-Length header, and since the amount read is less than the amount specified in the header, we raise an IncompleteReadError (code).

So, since these errors are unavoidable, I decided we need to at least be able to handle it gracefully and performantly.

Why write an incomplete read?

We don't know the read is incomplete until we read the entire part from S3 (described above). So, if we wanted to only write to the DeferQueue after we're sure it's not an incomplete read, we would have to (1) wait for the entire part to be received over the wire, and (2) load the entire part in memory before sending it to the DeferQueue.

(1) By waiting for the entire part to be received, we would be decreasing throughput of writing the object to the queue/stream.

(2) An average part size in practice with default config values can be about 8.4 MB (or higher depending on what user sets for multipart_chunksize, and since we're processing multiple parts concurrently this memory can multiply up with the number of threads.

For reasons (1) and (2), I decided that sending incomplete data to the DeferQueue is preferable for performance reasons.

SamRemis · 2025-04-06T22:03:45Z

awscli/s3transfer/download.py

+            next_write_offset = heapq.heappop(self._writes)
+            next_write = self._pending_offsets[next_write_offset]
+            writes.append({'offset': next_write_offset, 'data': next_write})
+            del self._pending_offsets[next_write_offset]


[non-blocking, nit] Any reason we are using del here instead of just next_write = self._pending_offsets.pop(next_write_offset)?

kdaily

Looks good, discussed offline one change - the benchmarks-s3-advanced.json is quite a large file (805KB) to include in the repository. We should consider what benchmarks need to be run continuously or ways to reduce the size of the data files added to the repository. Good to ship otherwise.

aemous and others added 30 commits February 20, 2025 15:13

Attempt fix for rare memory issue following incomplete reads.

2a3b96d

Bug fix in Deferqueue

edb338c

Merge remote-tracking branch 'origin/v2' into s3-disconnect-memory-fix

0e2309a

Remove elastic-inference service as part of deprecation process

7039b00

Add changelog entry

3c4ef55

update changelog

59ce54f

Update to latest models

369d3c0

Update endpoints model

cacdf6a

Bump version to 2.24.11

d366f88

Changelog for OpenSSL 1.1.1zb for Linux installers (aws#9317)

301e65b

Update to latest models

6ad7548

Update endpoints model

2a50eb8

Bump version to 2.24.12

b439a95

[v2] CloudFormation deploy docs clarification (aws#9321)

bfc0646

Update SSO configuration and related tests

64cb0b5

Add changelog entry for SSO configuration updates

9572a62

Fix missing fallback to json output when profile_name is None

937d41a

Set default output to 'json' in both final config and user prompts

66d1d0e

cleanup comment

24c0fce

specify json is the default output, but None will be saved if nothing…

b7a842e

… is specified

Remove untracked files that were accidentally committed

d8bda4a

Remove test for empty profile name handling

d60e326

Merge customizations for Chime

0ba7910

Update to latest models

0863931

Update endpoints model

feb87c5

Bump version to 2.24.13

a89ffb3

Fix sdist lockfile test (aws#9326)

b28fab2

Add support for AWS_CLI_OUTPUT_ENCODING (aws#9319)

4b21fb5

Update to latest models

1c7e542

aemous and others added 11 commits March 4, 2025 12:12

Removed print statements.

20b079d

Clean up download.py and add tests for new functionality.

efc5b57

Merge branch 'v2' into s3-mem-perf-test

70b68dc

Delete out.json

348afb5

Delete out2.json

009b157

Delete ops_investigation.py

0307562

Delete out2.successful.json

1b53b57

Fix whitespace.

d58c8cd

Fix debugging.

7254ab0

Merge benchmarks files.

aa0a880

Update performance scripts README.

5d06428

aemous requested review from aws-sdk-python-automation and a team March 5, 2025 16:24

aemous marked this pull request as ready for review March 5, 2025 16:24

kdaily removed the request for review from aws-sdk-python-automation March 5, 2025 16:26

aemous marked this pull request as draft March 5, 2025 16:42

ashovlin reviewed Mar 7, 2025

View reviewed changes

aemous added 2 commits March 10, 2025 10:47

Separate ranged download benchmark definitions to its own file.

2337f18

Refactor default benchmarks.json location.

16708a2

aemous added 2 commits March 21, 2025 11:31

Merge v2

8aff0ea

Fix formatting.

d3d3b85

aemous marked this pull request as ready for review March 21, 2025 15:35

SamRemis reviewed Apr 6, 2025

View reviewed changes

Merge branch 'v2' into s3-mem-perf-test

f009453

kdaily approved these changes Apr 8, 2025

View reviewed changes

Remove the advanced s3 benchmark definitions.

d3c5d65

aemous merged commit 21a2029 into aws:v2 Apr 9, 2025
58 of 60 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[v2] Bug fix and optimization in s3transfer's download #9345

[v2] Bug fix and optimization in s3transfer's download #9345

aemous commented Mar 5, 2025 •

edited

Loading

Uh oh!

ashovlin left a comment

Uh oh!

aemous commented Mar 10, 2025

Uh oh!

hssyoo commented Mar 20, 2025

Uh oh!

aemous commented Mar 20, 2025

Uh oh!

SamRemis Apr 6, 2025 •

edited

Loading

Uh oh!

kdaily left a comment

Uh oh!

Uh oh!

Uh oh!

[v2] Bug fix and optimization in s3transfer's download #9345

[v2] Bug fix and optimization in s3transfer's download #9345

Conversation

aemous commented Mar 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

Description of changes

s3transfer

Performance Benchmark Scripts

Description of tests

Correctness

Performance Testing

Summary #

Raw Data

Uh oh!

ashovlin left a comment

Choose a reason for hiding this comment

Uh oh!

aemous commented Mar 10, 2025

Uh oh!

hssyoo commented Mar 20, 2025

Uh oh!

aemous commented Mar 20, 2025

IncompleteReadErrors are unavoidable

Why write an incomplete read?

Uh oh!

SamRemis Apr 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kdaily left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

aemous commented Mar 5, 2025 •

edited

Loading

SamRemis Apr 6, 2025 •

edited

Loading