"Your socket connection to the server was not read from or written to within the timeout period." because of non replayable input stream #4910

psalagnac · 2024-02-09T10:32:14Z

Describe the bug

When pushing an object to S3 with PutObjectRequest failed for any reason (network, latency...), the S3 client retries the push operation. If we provided an InputStream that does not implement markSupported(), mark() and reset(), the retry will fail.

By default when creating a new S3 client, retry is enabled with one attempt.

Expected Behavior

If the implementation of markSupported() the provided InputStream returns false, we should not retry the push operation and fail fast. The retry attempt makes sense only when we have a guarantee we can read input data a second time.

We should either fail fast and raise an exception after the first failure to push, or just reject the input stream very early and always require an implementation that supports mark()/reset().

Current Behavior

Here is the stack trace raised:

software.amazon.awssdk.services.s3.model.S3Exception: Your socket connection to the server was not read from or written to within the timeout period. Idle connections will be closed. (Service: S3, Status Code: 400, Request ID: DF9X2DJMH6FN46B6, Extended Request ID: 9ch9A9wKThV//Q1tKuk8R5+0//gIeiQajbaLA/N1zWRH2tyDmEAy1a771XHhs1dqllgzP515Q1E=)

	at software.amazon.awssdk.protocols.xml.internal.unmarshall.AwsXmlPredicatedResponseHandler.handleErrorResponse(AwsXmlPredicatedResponseHandler.java:156)
	at software.amazon.awssdk.protocols.xml.internal.unmarshall.AwsXmlPredicatedResponseHandler.handleResponse(AwsXmlPredicatedResponseHandler.java:108)
	at software.amazon.awssdk.protocols.xml.internal.unmarshall.AwsXmlPredicatedResponseHandler.handle(AwsXmlPredicatedResponseHandler.java:85)
	at software.amazon.awssdk.protocols.xml.internal.unmarshall.AwsXmlPredicatedResponseHandler.handle(AwsXmlPredicatedResponseHandler.java:43)
	at software.amazon.awssdk.awscore.client.handler.AwsSyncClientHandler$Crc32ValidationResponseHandler.handle(AwsSyncClientHandler.java:95)
	at software.amazon.awssdk.core.internal.handler.BaseClientHandler.lambda$successTransformationResponseHandler$6(BaseClientHandler.java:234)
	........

Reproduction Steps

See below broken input stream that will inject a failure at the first push. It has a call to sleep() to cause a timeout error in the middle of the push operation.

private static class S3BreakerInputStream extends InputStream {
  private final byte[] array;
  private int bytesRead;

  S3BreakerInputStream(byte[] array) {
    this.array = array;
  }

  @Override
  public int read() {
    if (bytesRead == 500) {
      try {
        // Inject an error to make the push fail
        Thread.sleep(30000L);
      } catch (InterruptedException e) {
        e.printStackTrace();
      }
    }

    if (bytesRead >= array.length) {
      return -1;
    } else {
      return array[bytesRead++];
    }
  }
}

We can reproduce the bug by pushing any object using this input stream. Because of the sleep, the first push attempt will fail as expected after ~30 seconds.
Then, the retry will fail too after a ~60 seconds timeout, which is actually the socket read timeout of the S3 client.

  @Test
  public void testTimeout() throws Exception {

    String bucket = "...";
    String accessKey = "...";
    String secretKey = "...";

    S3Client s3 = S3Client.builder()
            .credentialsProvider(() -> AwsBasicCredentials.create(accessKey, secretKey))
            .serviceConfiguration(builder -> builder.pathStyleAccessEnabled(true))
            .httpClient(ApacheHttpClient.create())
            .build();

    Random random = new Random();
    int length = 1024;
    byte[] array = new byte[length];
    random.nextBytes(array);

    PutObjectRequest put = PutObjectRequest.builder()
        .bucket(bucket)
        .key("/push-test")
        .contentLength((long)length)
        .build();

    long startTime = System.currentTimeMillis();
    try {
      s3.putObject(put, RequestBody.fromInputStream(new S3BreakerInputStream(array), length));
    } finally {
      System.out.println("RUNTIME: " + (System.currentTimeMillis() - startTime));
    }
  }

The runtime of test is always a little more than 90s. My understanding is this is the sum of the 30s timeout explicitly added in the test, and the 60s of the default socket read timeout.

As a workaround, if I disable the S3 client retry attempt with .overrideConfiguration(a -> a.retryPolicy(RetryPolicy.none())), now the push fails as expected after 30 seconds.

Possible Solution

No response

Additional Information/Context

The error message has already been reported in many issue. But I found no issue where we actually point a non replayable InputStream.

AWS Java SDK version used

2.17.121

JDK version used

openjdk_11.0.19.0.103_11.65.54_x64

Operating System and version

Darwin 21.6.0 (Max OS)

The text was updated successfully, but these errors were encountered:

debora-ito · 2024-02-27T02:26:28Z

@psalagnac thank you to bringing it up. We added a task in our backlog to make this change.

sonali-2024 · 2024-08-28T16:05:00Z

What is the proposed solution for this issue, and what is the ETA for this fix?
Is there any plan to enable retry on non replayable input streams like it was supported in aws sdk v1 using the setReadLimit option?

psalagnac added bug This issue is a bug. needs-triage This issue or PR still needs to be triaged. labels Feb 9, 2024

debora-ito added needs-review This issue or PR needs review from the team. and removed needs-triage This issue or PR still needs to be triaged. labels Feb 15, 2024

debora-ito added p2 This is a standard priority issue and removed needs-review This issue or PR needs review from the team. labels Feb 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

"Your socket connection to the server was not read from or written to within the timeout period." because of non replayable input stream #4910

"Your socket connection to the server was not read from or written to within the timeout period." because of non replayable input stream #4910

psalagnac commented Feb 9, 2024

debora-ito commented Feb 27, 2024

sonali-2024 commented Aug 28, 2024 •

edited

Loading

"Your socket connection to the server was not read from or written to within the timeout period." because of non replayable input stream #4910

"Your socket connection to the server was not read from or written to within the timeout period." because of non replayable input stream #4910

Comments

psalagnac commented Feb 9, 2024

Describe the bug

Expected Behavior

Current Behavior

Reproduction Steps

Possible Solution

Additional Information/Context

AWS Java SDK version used

JDK version used

Operating System and version

debora-ito commented Feb 27, 2024

sonali-2024 commented Aug 28, 2024 • edited Loading

sonali-2024 commented Aug 28, 2024 •

edited

Loading