Retry intermittent S3 download failures #594

jamesls · 2014-01-15T04:15:55Z

This builds on the recent work in botocore
(boto/botocore#210) to address two things:

ensure than any .read() call used from the streaming response of
GetObject will never hang. We do this by applying a timeout
to the underlying socket.
Catch and retry IncompleteReadError. There may be times when we
don't receive all of the contents from a GetObject request such
that amount_read != content-length header. Botocore now checks
this and raises an exception, which the CLI now catches and retries.

This builds on the recent work in botocore (boto/botocore#210) to address two things: * ensure than any .read() call used from the streaming response of `GetObject` will never hang. We do this by applying a timeout to the underlying socket. * Catch and retry IncompleteReadError. There may be times when we don't receive all of the contents from a `GetObject` request such that amount_read != content-length header. Botocore now checks this and raises an exception, which the CLI now catches and retries.

toastdriven · 2014-01-15T21:03:19Z

awscli/customizations/s3/tasks.py

@@ -292,17 +298,30 @@ class DownloadPartTask(object):

    # Amount to read from response body at a time.
    ITERATE_CHUNK_SIZE = 1024 * 1024
+    READ_TIMEOUT = 60
+    TOTAL_ATTEMPTS = 5


Where does this come from? Given our typical number of retries is 6, I'm just curious.

I believe botocore uses 5 by default (except for special cases like ddb).

TIL: https://github.com/boto/botocore/blob/develop/botocore/data/aws/_retry.json#L46-L53 Good to know, thanks.

toastdriven · 2014-01-15T21:03:38Z

LGTM otherwise.

garnaat · 2014-01-15T21:11:16Z

LGTM

@mhart

* fix: Functional tests must run on localhost to work in Windows (aws#552) * fix: spacing typo in Log statement in start-lambda (aws#559) * docs: Fix syntax highlighting in README.md (aws#561) * docs: Change jest to mocha in Nodejs init README (aws#564) * docs: Fix @mhart link in README (aws#562) * docs(README): removed cloudtrail, added SNS to generate-event (aws#569) * docs: Update repo name references (aws#577) * feat(debugging): Fixing issues around debugging Golang functions. (aws#565) * fix(init): Improve current init samples around docs and fixes (aws#558) * docs(README): Update launch config to SAM CLI from SAM Local (aws#587) * docs(README): Update sample code for calling Local Lambda Invoke (aws#584) * refactor(init): renamed handler for camel case, moved callback call up (aws#586) * chore: aws-lambda-java-core 1.1.0 -> 1.2.0 for java sam init (aws#578) * feat(validate): Add profile and region options (aws#582) Currently, `sam validate` requires AWS Creds (due to the SAM Translator). This commits adds the ability to pass in the credientials through a profile that is configured through `aws configure`. * docs(README): Update README prerequisites to include awscli (aws#596) * fix(start-lambda): Remove Content-Type Header check (aws#594) * docs: Disambiguation "Amazon Kinesis" (aws#599) * docs: Adding instructions for how to add pyenv to your PATH for Windows (aws#600) * docs: Update README with small grammar fix (aws#601) * fix: Update link in NodeJS package.json (aws#603) * docs: Creating instructions for Windows users to install sam (aws#605) * docs: Adding a note directing Windows users to use pipenv (aws#606) * fix: Fix stringifying λ environment variables when using Python2 (aws#579) * feat(generate-event): Added support for 50+ events (aws#612) * feat(invoke): Add region parameter to all invoke related commands (aws#608) * docs: Breaking up README into separate files to make it easier to read (aws#607) * chore: Update JVM size params to match docker-lambda (aws#615) * feat(invoke): Invoke Function Without Parameters through --no-event (aws#604) * docs: Update advanced_usage.rst with clarification on --env-vars usage (aws#610) * docs: Remove an extra word in the sam packaging command (aws#618) * UX: Improves event names to reflect Lambda Event Sources (aws#619) * docs: Fix git clone typo in installation docs (aws#630) * docs(README): Callout go1.x runtime support (aws#631) * docs(installation): Update sam --version command (aws#634) * chore(0.6.0): SAM CLI Version bump (aws#635)

toastdriven reviewed Jan 15, 2014
View reviewed changes

jamesls merged commit ed5999f into aws:develop Jan 15, 2014

jamesls mentioned this pull request Jan 15, 2014

"aws s3 cp" hanging after suspend/resume #568

Closed

jamesls mentioned this pull request Feb 6, 2014

Uploading files to S3 from EC2 instances fails on some instance types #634

Closed

jamesls deleted the retry-s3-download branch June 23, 2014 18:20

thoward-godaddy pushed a commit to thoward-godaddy/aws-cli that referenced this pull request Feb 12, 2022

fix(start-lambda): Remove Content-Type Header check (aws#594)

b7c3d89

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Retry intermittent S3 download failures #594

Retry intermittent S3 download failures #594

jamesls commented Jan 15, 2014

toastdriven Jan 15, 2014

jamesls Jan 15, 2014

toastdriven Jan 15, 2014

toastdriven commented Jan 15, 2014

garnaat commented Jan 15, 2014

Retry intermittent S3 download failures #594

Retry intermittent S3 download failures #594

Conversation

jamesls commented Jan 15, 2014

toastdriven Jan 15, 2014

Choose a reason for hiding this comment

jamesls Jan 15, 2014

Choose a reason for hiding this comment

toastdriven Jan 15, 2014

Choose a reason for hiding this comment

toastdriven commented Jan 15, 2014

garnaat commented Jan 15, 2014