[DOP-8157] Add SparkS3 troubleshooting guide #124
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Change Summary
spark-hadoop-cloud
library which should be used for Spark on S3. It already includeshadoop-aws
, but its versioning is the same as Spark version. In this case we don't need to pass Hadoop libraries version explicitly, because it is already built with the same version the Spark is compiled with. So I replacedhadoop-aws
withspark-hadoop-cloud
, and updatedSparkS3.get_options
signature and tests.Related issue number
Checklist
docs/changelog/next_release/<pull request or issue id>.<change type>.rst
file added describing change(see CONTRIBUTING.rst for details.)