Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-26766][CORE] Remove the list of filesystems from HadoopDelegationTokenProvider.obtainDelegationTokens #23698

Closed
wants to merge 6 commits into from

Conversation

gaborgsomogyi
Copy link
Contributor

@gaborgsomogyi gaborgsomogyi commented Jan 30, 2019

What changes were proposed in this pull request?

Delegation token providers interface now has a parameter fileSystems but this is needed only for HadoopFSDelegationTokenProvider.

In this PR I've addressed this issue in the following way:

  • Removed fileSystems parameter from HadoopDelegationTokenProvider
  • Moved YarnSparkHadoopUtil.hadoopFSsToAccess into HadoopFSDelegationTokenProvider
  • Moved spark.yarn.stagingDir into core
  • Moved spark.yarn.access.namenodes into core and renamed to spark.kerberos.access.namenodes
  • Moved spark.yarn.access.hadoopFileSystems into core and renamed to spark.kerberos.access.hadoopFileSystems

How was this patch tested?

Existing unit tests.

@SparkQA
Copy link

SparkQA commented Jan 30, 2019

Test build #101891 has finished for PR 23698 at commit 69646eb.

  • This patch fails to build.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Jan 30, 2019

Test build #101895 has finished for PR 23698 at commit 6eb9ab1.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

Copy link
Contributor

@vanzin vanzin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You'll need to update the docs that reference the settings you're renaming.

 * Config parameter deprecation
 * Return defaultFS all the time
 * get("spark.master", null)
@SparkQA
Copy link

SparkQA commented Jan 31, 2019

Test build #101959 has finished for PR 23698 at commit 07ff492.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Jan 31, 2019

Test build #101960 has finished for PR 23698 at commit e73250d.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@vanzin
Copy link
Contributor

vanzin commented Jan 31, 2019

Still missing the doc update. running-on-yarn.md still mentions the old config, and that information is not restricted to YARN anymore.

* Doc update
* Param deprecation
@gaborgsomogyi
Copy link
Contributor Author

gaborgsomogyi commented Feb 1, 2019

Still missing the doc update.

Yeah, left from the last commit :/ Now updated.

@SparkQA
Copy link

SparkQA commented Feb 1, 2019

Test build #101994 has finished for PR 23698 at commit 32c5d5d.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

docs/running-on-yarn.md Outdated Show resolved Hide resolved
* Simplified hadoopFSsToAccess
* Moved doc to generic area
@SparkQA
Copy link

SparkQA commented Feb 7, 2019

Test build #102077 has finished for PR 23698 at commit 1c87238.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@vanzin
Copy link
Contributor

vanzin commented Feb 7, 2019

retest this please

@SparkQA
Copy link

SparkQA commented Feb 8, 2019

Test build #102082 has finished for PR 23698 at commit 1c87238.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@vanzin
Copy link
Contributor

vanzin commented Feb 8, 2019

Merging to master.

@asfgit asfgit closed this in d0443a7 Feb 8, 2019
jackylee-ch pushed a commit to jackylee-ch/spark that referenced this pull request Feb 18, 2019
…ionTokenProvider.obtainDelegationTokens

## What changes were proposed in this pull request?

Delegation token providers interface now has a parameter `fileSystems` but this is needed only for `HadoopFSDelegationTokenProvider`.

In this PR I've addressed this issue in the following way:
* Removed `fileSystems` parameter from `HadoopDelegationTokenProvider`
* Moved `YarnSparkHadoopUtil.hadoopFSsToAccess` into `HadoopFSDelegationTokenProvider`
* Moved `spark.yarn.stagingDir` into core
* Moved `spark.yarn.access.namenodes` into core and renamed to `spark.kerberos.access.namenodes`
* Moved `spark.yarn.access.hadoopFileSystems` into core and renamed to `spark.kerberos.access.hadoopFileSystems`

## How was this patch tested?

Existing unit tests.

Closes apache#23698 from gaborgsomogyi/SPARK-26766.

Authored-by: Gabor Somogyi <gabor.g.somogyi@gmail.com>
Signed-off-by: Marcelo Vanzin <vanzin@cloudera.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
3 participants