Skip to content

[SPARK-33824][PYTHON][DOCS][FOLLOW-UP] Clarify about PYSPARK_DRIVER_PYTHON and spark.yarn.appMasterEnv.PYSPARK_PYTHON#30938

Closed
HyukjinKwon wants to merge 1 commit intoapache:masterfrom
HyukjinKwon:SPARK-33824-followup
Closed

[SPARK-33824][PYTHON][DOCS][FOLLOW-UP] Clarify about PYSPARK_DRIVER_PYTHON and spark.yarn.appMasterEnv.PYSPARK_PYTHON#30938
HyukjinKwon wants to merge 1 commit intoapache:masterfrom
HyukjinKwon:SPARK-33824-followup

Conversation

@HyukjinKwon
Copy link
Member

@HyukjinKwon HyukjinKwon commented Dec 27, 2020

What changes were proposed in this pull request?

This PR proposes to clarify:

  • PYSPARK_DRIVER_PYTHON should not be set for cluster modes in YARN and Kubernates.
  • spark.yarn.appMasterEnv.PYSPARK_PYTHON is not required in YARN. This is just another way to set PYSPARK_PYTHON that is specific for a Spark application.

Why are the changes needed?

To clarify what's required and not.

Does this PR introduce any user-facing change?

Yes, this is a user-facing doc change.

How was this patch tested?

Manually tested.

Note that this credits to @gaborgsomogyi who actually tested and raised a doubt about this offline to me.
I also manually tested all again to double check.

@HyukjinKwon
Copy link
Member Author

cc @srowen, @zero323 and @viirya would you mind taking a quick look please?

@SparkQA
Copy link

SparkQA commented Dec 27, 2020

Test build #133408 has finished for PR 30938 at commit 4d08638.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA
Copy link

SparkQA commented Dec 27, 2020

Kubernetes integration test starting
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/37999/

@SparkQA
Copy link

SparkQA commented Dec 27, 2020

Kubernetes integration test status success
URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/37999/

@HyukjinKwon
Copy link
Member Author

Thanks guys, merged to master and branch-3.1.

HyukjinKwon pushed a commit that referenced this pull request Dec 28, 2020
…YTHON and spark.yarn.appMasterEnv.PYSPARK_PYTHON

### What changes were proposed in this pull request?

This PR proposes to clarify:
- `PYSPARK_DRIVER_PYTHON` should not be set for cluster modes in YARN and Kubernates.
- `spark.yarn.appMasterEnv.PYSPARK_PYTHON` is not required in YARN. This is just another way to set `PYSPARK_PYTHON` that is specific for a Spark application.

### Why are the changes needed?

To clarify what's required and not.

### Does this PR introduce _any_ user-facing change?

Yes, this is a user-facing doc change.

### How was this patch tested?

Manually tested.

Note that this credits to gaborgsomogyi who actually tested and raised a doubt about this offline to me.
I also manually tested all again to double check.

Closes #30938 from HyukjinKwon/SPARK-33824-followup.

Authored-by: Gabor Somogyi <gabor.g.somogyi@gmail.com>
Signed-off-by: HyukjinKwon <gurwls223@apache.org>
(cherry picked from commit 678294d)
Signed-off-by: HyukjinKwon <gurwls223@apache.org>
@HyukjinKwon HyukjinKwon deleted the SPARK-33824-followup branch January 4, 2022 00:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants