Skip to content

[SPARK-47008][CORE] Added Hadoops fileSystems hasPathCapability check to avoid FileNotFoundException(s) when using S3 Express One Zone Storage#48497

Closed
leovegas wants to merge 22 commits intoapache:masterfrom
leovegas:feature/SPARK-47008-add-hasPathCapability-check
Closed

[SPARK-47008][CORE] Added Hadoops fileSystems hasPathCapability check to avoid FileNotFoundException(s) when using S3 Express One Zone Storage#48497
leovegas wants to merge 22 commits intoapache:masterfrom
leovegas:feature/SPARK-47008-add-hasPathCapability-check

Conversation

@leovegas
Copy link

@leovegas leovegas commented Oct 16, 2024

What changes were proposed in this pull request?

Jira SPARK-47008

Added check for inconsistent directory listings through Hadoop fs.hasPathCapability(path, "fs.capability.directory.listing.inconsistent") in the following method:
org.apache.spark.util.Utils#fetchHcfsFile

In org.apache.spark.deploy.SparkHadoopUtil#listLeafStatuses
treewalk logic replaced by Hadoop's fs.listFiles method.

Why are the changes needed?

Spark to support S3 Express One Zone Storage
Details in this Jira

Does this PR introduce any user-facing change?

No

How was this patch tested?

Unit tests added to

  1. test("SPARK-47008: ...) core/src/test/scala/org/apache/spark/util/UtilsSuite.scala
  2. test("SPARK-47008: ...) core/src/test/scala/org/apache/spark/deploy/SparkHadoopUtilSuite.scala

Was this patch authored or co-authored using generative AI tooling?

No

Leonid Timofeev and others added 22 commits May 17, 2024 08:56
…d FileNotFoundException(s) when using S3 Express One Zone Storage.
…apability-check' into feature/SPARK-47008-add-hasPathCapability-check

# Conflicts:
#	core/src/main/scala/org/apache/spark/deploy/SparkHadoopUtil.scala
…apability-check' into feature/SPARK-47008-add-hasPathCapability-check
@github-actions github-actions bot added the CORE label Oct 16, 2024
@leovegas leovegas changed the title Feature/spark 47008 add has path capability check [SPARK-47008][CORE] Added Hadoops fileSystems hasPathCapability check to avoid FileNotFoundException(s) when using S3 Express One Zone Storage Oct 16, 2024
@HyukjinKwon
Copy link
Member

can you fill the PR description please?

@github-actions
Copy link

We're closing this PR because it hasn't been updated in a while. This isn't a judgement on the merit of the PR in any way. It's just a way of keeping the PR queue manageable.
If you'd like to revive this PR, please reopen it and ask a committer to remove the Stale tag!

@github-actions github-actions bot added the Stale label Jan 26, 2025
@github-actions github-actions bot closed this Jan 27, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants