Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HADOOP-16767 Handle non-IO exceptions in reopen() #2685

Merged
merged 1 commit into from
Feb 8, 2021

Conversation

yzhangal
Copy link
Contributor

@yzhangal yzhangal commented Feb 8, 2021

Contributed by Sergei Poganshev.

Catches Exception instead of IOException in closeStream()
and so handle exceptions such as SdkClientException by
aborting the wrapped stream. This will increase resilience
to failures, as any which occuring during stream closure
will be caught. Furthermore, because the
underlying HTTP connection is aborted, rather than closed,
it will not be recycled to cause problems on subsequent
operations.

NOTICE

Please create an issue in ASF JIRA before opening a pull request,
and you need to set the title of the pull request which starts with
the corresponding JIRA issue number. (e.g. HADOOP-XXXXX. Fix a typo in YYY.)
For more details, please see https://cwiki.apache.org/confluence/display/HADOOP/How+To+Contribute

Contributed by Sergei Poganshev.

Catches Exception instead of IOException in closeStream() 
and so handle exceptions such as SdkClientException by 
aborting the wrapped stream. This will increase resilience
to failures, as any which occuring during stream closure
will be caught. Furthermore, because the
underlying HTTP connection is aborted, rather than closed,
it will not be recycled to cause problems on subsequent
operations.
@yzhangal
Copy link
Contributor Author

yzhangal commented Feb 8, 2021

Hi @steveloughran , sorry I'm behind getting HADOOP-17338 to 2.10 branch. I have a version ready, but before that, I'd like to get HADOOP-16767 to 2.10. Here are the test results I have with both together (largely some timeout failures). Would you please help getting this one in and I will put HADOOP-17338 on top of it.

Thanks a lot.

[WARNING] Tests run: 11, Failures: 0, Errors: 0, Skipped: 2, Time elapsed: 64.07 s - in org.apache.hadoop.fs.contract.s3a.ITestS3AContractCreate
[INFO] Tests run: 18, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 171.593 s - in org.apache.hadoop.fs.contract.s3a.ITestS3AContractGetFileStatus
[INFO] Tests run: 48, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 168.231 s - in org.apache.hadoop.fs.contract.s3a.ITestS3AContractSeek
[ERROR] Tests run: 9, Failures: 0, Errors: 3, Skipped: 0, Time elapsed: 558.659 s <<< FAILURE! - in org.apache.hadoop.fs.contract.s3a.ITestS3AContractRootDir
[INFO] Tests run: 8, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 76.078 s - in org.apache.hadoop.fs.contract.s3a.ITestS3AContractRename
[INFO] Tests run: 4, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 457.171 s - in org.apache.hadoop.fs.contract.s3a.ITestS3AContractDistCp
[INFO] Tests run: 8, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 110.398 s - in org.apache.hadoop.fs.contract.s3a.ITestS3AContractMkdir
[INFO] Tests run: 8, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 65.483 s - in org.apache.hadoop.fs.contract.s3a.ITestS3AContractDelete
[INFO] Tests run: 6, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 27.233 s - in org.apache.hadoop.fs.contract.s3a.ITestS3AContractOpen
[WARNING] Tests run: 2, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: 6.871 s - in org.apache.hadoop.fs.s3a.ITestS3ATemporaryCredentials
[INFO] Tests run: 12, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.107 s - in org.apache.hadoop.fs.s3a.impl.TestDirectoryMarkerPolicy
[INFO] Tests run: 3, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 20.985 s - in org.apache.hadoop.fs.s3a.fileContext.ITestS3AFileContextStatistics
[WARNING] Tests run: 17, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: 242.879 s - in org.apache.hadoop.fs.s3a.fileContext.ITestS3AFileContextURI
[INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.15 s - in org.apache.hadoop.fs.s3a.fileContext.ITestS3AFileContext
[INFO] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 25.962 s - in org.apache.hadoop.fs.s3a.fileContext.ITestS3AFileContextUtil
[INFO] Tests run: 11, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 87.202 s - in org.apache.hadoop.fs.s3a.fileContext.ITestS3AFileContextCreateMkdir
[WARNING] Tests run: 63, Failures: 0, Errors: 0, Skipped: 3, Time elapsed: 643.64 s - in org.apache.hadoop.fs.s3a.fileContext.ITestS3AFileContextMainOperations
[INFO] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.531 s - in org.apache.hadoop.fs.s3a.TestListing
[INFO] Tests run: 12, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.121 s - in org.apache.hadoop.fs.s3a.TestS3AExceptionTranslation
[WARNING] Tests run: 1, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: 4.151 s - in org.apache.hadoop.fs.s3a.ITestS3GuardCreate
[INFO] Tests run: 10, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.917 s - in org.apache.hadoop.fs.s3a.TestSSEConfiguration
[INFO] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 39.734 s - in org.apache.hadoop.fs.s3a.ITestS3AEncryptionSSES3
[INFO] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 8.955 s - in org.apache.hadoop.fs.s3a.ITestS3ABlocksize
[INFO] Tests run: 4, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 18.49 s - in org.apache.hadoop.fs.s3a.ITestS3AFailureHandling
[WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 5, Time elapsed: 15.067 s - in org.apache.hadoop.fs.s3a.scale.ITestS3AHugeFilesByteBufferBlocks
[WARNING] Tests run: 1, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: 4.267 s - in org.apache.hadoop.fs.s3a.scale.ITestS3ADeleteFilesOneByOne
[WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 5, Time elapsed: 14.164 s - in org.apache.hadoop.fs.s3a.scale.ITestS3AHugeFilesArrayBlocks
[WARNING] Tests run: 2, Failures: 0, Errors: 0, Skipped: 2, Time elapsed: 7.357 s - in org.apache.hadoop.fs.s3a.scale.ITestDynamoDBMetadataStoreScale
[WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 5, Time elapsed: 14.207 s - in org.apache.hadoop.fs.s3a.scale.ITestS3ADirectoryPerformance
[WARNING] Tests run: 1, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: 4.388 s - in org.apache.hadoop.fs.s3a.scale.ITestS3ACreatePerformance
[WARNING] Tests run: 8, Failures: 0, Errors: 0, Skipped: 8, Time elapsed: 22.418 s - in org.apache.hadoop.fs.s3a.scale.ITestS3AInputStreamPerformance
[WARNING] Tests run: 1, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: 4.524 s - in org.apache.hadoop.fs.s3a.scale.ITestS3AConcurrentOps
[WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 5, Time elapsed: 14.804 s - in org.apache.hadoop.fs.s3a.scale.ITestS3AHugeFilesSSECDiskBlocks
[WARNING] Tests run: 1, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: 4.647 s - in org.apache.hadoop.fs.s3a.scale.ITestS3ADeleteManyFiles
[WARNING] Tests run: 2, Failures: 0, Errors: 0, Skipped: 2, Time elapsed: 6.706 s - in org.apache.hadoop.fs.s3a.scale.ITestLocalMetadataStoreScale
[WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 5, Time elapsed: 14.39 s - in org.apache.hadoop.fs.s3a.scale.ITestS3AHugeFilesDiskBlocks
[INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 7.855 s - in org.apache.hadoop.fs.s3a.ITestS3ADelayedFNF
[INFO] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 42.084 s - in org.apache.hadoop.fs.s3a.ITestS3AEncryptionSSEKMSDefaultKey
[INFO] Tests run: 30, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 11.345 s - in org.apache.hadoop.fs.s3a.ITestS3AConfiguration
[INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 6.855 s - in org.apache.hadoop.fs.s3a.ITestS3AInconsistency
[ERROR] Tests run: 10, Failures: 3, Errors: 0, Skipped: 0, Time elapsed: 103.302 s <<< FAILURE! - in org.apache.hadoop.fs.s3a.ITestS3AEncryptionSSEC
[INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 31.618 s - in org.apache.hadoop.fs.s3a.yarn.ITestS3AMiniYarnCluster
[INFO] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 7.258 s - in org.apache.hadoop.fs.s3a.yarn.ITestS3A
[INFO] Tests run: 4, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.81 s - in org.apache.hadoop.fs.s3a.ITestBlockingThreadPoolExecutorService
[WARNING] Tests run: 1, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: 4.178 s - in org.apache.hadoop.fs.s3a.ITestS3GuardWriteBack
[INFO] Tests run: 5, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 23.037 s - in org.apache.hadoop.fs.s3a.ITestS3ABlockOutputByteBuffer
[INFO] Tests run: 7, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 43.654 s - in org.apache.hadoop.fs.s3a.ITestS3AFileOperationCost
[WARNING] Tests run: 9, Failures: 0, Errors: 0, Skipped: 8, Time elapsed: 49.644 s - in org.apache.hadoop.fs.s3a.ITestS3GuardListConsistency
[WARNING] Tests run: 2, Failures: 0, Errors: 0, Skipped: 2, Time elapsed: 0.189 s - in org.apache.hadoop.fs.s3a.ITestS3AEncryptionSSEKMSUserDefinedKey
[WARNING] Tests run: 5, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: 21.4 s - in org.apache.hadoop.fs.s3a.ITestS3ABlockOutputDisk
[INFO] Tests run: 5, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 21.999 s - in org.apache.hadoop.fs.s3a.ITestS3ABlockOutputArray
[INFO] Tests run: 5, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.596 s - in org.apache.hadoop.fs.s3a.TestS3AGetFileStatus
[INFO] Tests run: 4, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 15.316 s - in org.apache.hadoop.fs.s3a.ITestS3AMiscOperations
[INFO] Tests run: 31, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 221.542 s - in org.apache.hadoop.fs.s3a.ITestS3AFileSystemContract
[INFO] Tests run: 8, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.298 s - in org.apache.hadoop.fs.s3a.s3guard.TestPathMetadataDynamoDBTranslation
[WARNING] Tests run: 21, Failures: 0, Errors: 0, Skipped: 21, Time elapsed: 53.878 s - in org.apache.hadoop.fs.s3a.s3guard.ITestS3GuardToolLocal
[WARNING] Tests run: 10, Failures: 0, Errors: 0, Skipped: 10, Time elapsed: 28.464 s - in org.apache.hadoop.fs.s3a.s3guard.ITestS3GuardToolDynamoDB
[WARNING] Tests run: 1, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: 4.58 s - in org.apache.hadoop.fs.s3a.s3guard.ITestS3GuardConcurrentOps
[INFO] Tests run: 10, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.127 s - in org.apache.hadoop.fs.s3a.s3guard.TestS3GuardCLI
[INFO] Tests run: 20, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.1 s - in org.apache.hadoop.fs.s3a.s3guard.TestDirListingMetadata
[INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.084 s - in org.apache.hadoop.fs.s3a.s3guard.TestS3Guard
[INFO] Tests run: 23, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 1.595 s - in org.apache.hadoop.fs.s3a.s3guard.TestNullMetadataStore
[INFO] Tests run: 24, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.544 s - in org.apache.hadoop.fs.s3a.s3guard.TestLocalMetadataStore
[INFO] Tests run: 4, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.103 s - in org.apache.hadoop.fs.s3a.ITestS3ATestUtils
[INFO] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.084 s - in org.apache.hadoop.fs.s3a.TestDataBlocks
[WARNING] Tests run: 1, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: 4.197 s - in org.apache.hadoop.fs.s3a.ITestS3GuardEmptyDirs
[INFO] Tests run: 5, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 3.287 s - in org.apache.hadoop.fs.s3a.ITestS3AAWSCredentialsProvider
[INFO] Tests run: 9, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.229 s - in org.apache.hadoop.fs.s3a.TestS3AAWSCredentialsProvider
[INFO] Tests run: 14, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 0.091 s - in org.apache.hadoop.fs.s3a.TestS3AInputPolicies
[INFO] Tests run: 18, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 150.313 s - in org.apache.hadoop.fs.s3a.performance.ITestDirectoryMarkerListing
[INFO] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 16.538 s - in org.apache.hadoop.fs.s3a.ITestS3AEmptyDirectory
[WARNING] Tests run: 1, Failures: 0, Errors: 0, Skipped: 1, Time elapsed: 0.017 s - in org.apache.hadoop.fs.s3a.ITestS3AEncryptionAlgorithmValidation
[ERROR] Tests run: 603, Failures: 3, Errors: 3, Skipped: 95

@hadoop-yetus
Copy link

💔 -1 overall

Vote Subsystem Runtime Comment
+0 🆗 reexec 11m 38s Docker mode activated.
_ Prechecks _
+1 💚 dupname 0m 0s No case conflicting files found.
+1 💚 @author 0m 0s The patch does not contain any @author tags.
-1 ❌ test4tests 0m 0s The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch.
_ branch-2.10 Compile Tests _
+1 💚 mvninstall 16m 14s branch-2.10 passed
+1 💚 compile 0m 33s branch-2.10 passed
+1 💚 checkstyle 0m 24s branch-2.10 passed
+1 💚 mvnsite 0m 45s branch-2.10 passed
+1 💚 javadoc 0m 29s branch-2.10 passed
+0 🆗 spotbugs 1m 8s Used deprecated FindBugs config; considering switching to SpotBugs.
+1 💚 findbugs 1m 4s branch-2.10 passed
_ Patch Compile Tests _
+1 💚 mvninstall 0m 34s the patch passed
+1 💚 compile 0m 28s the patch passed
+1 💚 javac 0m 28s the patch passed
+1 💚 checkstyle 0m 16s the patch passed
+1 💚 mvnsite 0m 35s the patch passed
+1 💚 whitespace 0m 0s The patch has no whitespace issues.
+1 💚 javadoc 0m 20s the patch passed
+1 💚 findbugs 1m 7s the patch passed
_ Other Tests _
+1 💚 unit 0m 39s hadoop-aws in the patch passed.
+1 💚 asflicense 0m 31s The patch does not generate ASF License warnings.
38m 40s
Subsystem Report/Notes
Docker ClientAPI=1.41 ServerAPI=1.41 base: https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-2685/1/artifact/out/Dockerfile
GITHUB PR #2685
Optional Tests dupname asflicense compile javac javadoc mvninstall mvnsite unit shadedclient findbugs checkstyle
uname Linux 59949fa7ed62 4.15.0-60-generic #67-Ubuntu SMP Thu Aug 22 16:55:30 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
Build tool maven
Personality dev-support/bin/hadoop.sh
git revision branch-2.10 / ef14e16
Default Java Oracle Corporation-1.7.0_95-b00
Test Results https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-2685/1/testReport/
Max. process+thread count 195 (vs. ulimit of 5500)
modules C: hadoop-tools/hadoop-aws U: hadoop-tools/hadoop-aws
Console output https://ci-hadoop.apache.org/job/hadoop-multibranch/job/PR-2685/1/console
versions git=2.7.4 maven=3.3.9 findbugs=3.0.1
Powered by Apache Yetus 0.12.0 https://yetus.apache.org

This message was automatically generated.

@steveloughran steveloughran merged commit ceea431 into apache:branch-2.10 Feb 8, 2021
@yzhangal
Copy link
Contributor Author

yzhangal commented Feb 8, 2021

Thanks a lot @steveloughran !
I just uploaded HADOOP-17338 PR for 2.10 branch, but somehow it's not visible in the jira yet. Wonder if you were able to see. Thanks.

@steveloughran
Copy link
Contributor

Has yetus started on the PR? as that's what really matters. You can always just link the JIRA to the PR by hand

@yzhangal
Copy link
Contributor Author

yzhangal commented Feb 8, 2021

#2688 Seems the build failed, maybe because I uploaded the diff before your merge thus the base is different. As of now, I don't see your merge of HADOOP-16767. I will rebase to the latest 2.10 branch once your merge shows up and recreate the PR. Thanks Steve.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants