[SPARK-47795][K8S][DOCS] Supplement the doc of job schedule for K8S by beliefer · Pull Request #45982 · apache/spark

beliefer · 2024-04-10T09:39:26Z

What changes were proposed in this pull request?

This PR propose to supplement the doc of job schedule for K8S.

Why are the changes needed?

Spark document missing the description of job schedule for K8S.

Does this PR introduce any user-facing change?

'No'.
Just update document.

How was this patch tested?

Manual tests.

Was this patch authored or co-authored using generative AI tooling?

'No'.

dongjoon-hyun · 2024-04-10T09:43:11Z

docs/job-scheduling.md

This line change is irrelevant to K8s.

Yes.
Because the link is not accurate even if it was not related to K8S.
Do you want me revert it?

Yes, please handle this separately, @beliefer .
As you know, I'm the release manager of Apache Spark 3.4.3 (for next Monday).
I can backport this fix to branch-3.4 as a part of Apache Spark 3.4.3.

dongjoon-hyun · 2024-04-10T09:47:10Z

docs/job-scheduling.md

This looks wrong to me because there is actually nothing to follow in the future work section. May I ask what do you want to say here?

How about In K8S mode, K8S doesn't support external shuffle service yet. ?

dongjoon-hyun

Is this a required change, @beliefer?

beliefer · 2024-04-10T10:12:59Z

Is this a required change, @beliefer?

I think we should add these document so as follows the other cluster mangers.

beliefer · 2024-04-10T10:13:19Z

cc @yaooqinn @LuciferYang

yaooqinn · 2024-04-10T12:28:11Z

I didn't even notice this page or section. The dropdown from the Navi Bar is enough for me.

Do we support scheduling jobs across applications? It's odd to me.

yaooqinn · 2024-04-10T12:33:36Z

Nit: Use K8s instead of K8S, the former is the official abbreviation

beliefer · 2024-04-11T02:28:25Z

Do we support scheduling jobs across applications? It's odd to me.

This section is about scheduling across applications.
Scheduling Within an Application section is related to jobs.

dongjoon-hyun · 2024-04-11T04:56:04Z

docs/job-scheduling.md

Given the content, ### Caveats section would be better than here.

docs/job-scheduling.md

Co-authored-by: Kent Yao <yao@apache.org>

dongjoon-hyun

+1, LGTM. Thank you, @beliefer .

dongjoon-hyun · 2024-04-11T15:26:21Z

Let's wait for @yaooqinn 's final sign-off, @beliefer .

bjornjorgensen · 2024-04-11T16:11:03Z

docs/job-scheduling.md

+### Caveats
+
+- In [standalone mode](spark-standalone.html), without explicitly setting `spark.executor.cores`, each executor will get all the available cores of a worker. In this case, when dynamic allocation enabled, spark will possibly acquire much more executors than expected. When you want to use dynamic allocation in [standalone mode](spark-standalone.html), you are recommended to explicitly set cores for each executor before the issue [SPARK-30299](https://issues.apache.org/jira/browse/SPARK-30299) got fixed.
+- In [K8s mode](running-on-kubernetes.html), we can't using this feature by set `spark.shuffle.service.enabled` to `true` due to Spark on K8s doesn't support external shuffle service yet.


In K8s mode, we cannot use this feature by setting spark.shuffle.service.enabled to true because Spark on K8s does not yet support the external shuffle service.

bjornjorgensen · 2024-04-11T16:17:46Z

docs/job-scheduling.md

+* **K8s:** The same as the situation with Yarn, please refer to the description of Yarn above. Furthermore,
+  Spark on K8s offers higher priority versions of spark.kubernetes.executor.limit.cores and
+  spark.kubernetes.executor.request.cores than spark.executor.cores. For more information, see the
+  [K8s Spark Properties](running-on-kubernetes.html#spark-properties).


spark.kubernetes.executor.limit.cores and
spark.kubernetes.executor.request.cores than spark.executor.cores -> spark.kubernetes.executor.limit.cores and
spark.kubernetes.executor.request.cores than spark.executor.cores

yaooqinn · 2024-04-12T01:55:28Z

Merged to master

Thank you @beliefer @dongjoon-hyun @bjornjorgensen

beliefer · 2024-04-12T04:56:36Z

@yaooqinn @dongjoon-hyun @bjornjorgensen Thank you!

github-actions bot added the DOCS label Apr 10, 2024

dongjoon-hyun reviewed Apr 10, 2024

View reviewed changes

beliefer changed the title ~~[SPARK-47795][DOC] Supplement the doc of job schedule for K8S~~ [SPARK-47795][DOCS] Supplement the doc of job schedule for K8S Apr 10, 2024

beliefer force-pushed the SPARK-47795 branch 2 times, most recently from d6486d5 to 09ecc74 Compare April 11, 2024 02:24

dongjoon-hyun reviewed Apr 11, 2024

View reviewed changes

beliefer added 3 commits April 11, 2024 14:39

[SPARK-47795][DOC] Supplement the doc of job schedule for K8S

5808dd4

u

e5fb9c2

u

ab080ba

beliefer force-pushed the SPARK-47795 branch from 09ecc74 to ab080ba Compare April 11, 2024 06:55

beliefer changed the title ~~[SPARK-47795][DOCS] Supplement the doc of job schedule for K8S~~ [SPARK-47795][KUBERNETES][DOCS] Supplement the doc of job schedule for K8S Apr 11, 2024

yaooqinn reviewed Apr 11, 2024

View reviewed changes

docs/job-scheduling.md Outdated Show resolved Hide resolved

yaooqinn reviewed Apr 11, 2024

View reviewed changes

docs/job-scheduling.md Outdated Show resolved Hide resolved

beliefer and others added 2 commits April 11, 2024 20:04

Update docs/job-scheduling.md

ef72667

Co-authored-by: Kent Yao <yao@apache.org>

Update docs/job-scheduling.md

d1e31b0

Co-authored-by: Kent Yao <yao@apache.org>

dongjoon-hyun approved these changes Apr 11, 2024

View reviewed changes

dongjoon-hyun changed the title ~~[SPARK-47795][KUBERNETES][DOCS] Supplement the doc of job schedule for K8S~~ [SPARK-47795][K8S][DOCS] Supplement the doc of job schedule for K8S Apr 11, 2024

bjornjorgensen reviewed Apr 11, 2024

View reviewed changes

u

4d4dc77

yaooqinn approved these changes Apr 12, 2024

View reviewed changes

yaooqinn closed this in de20791 Apr 12, 2024

Comments

Conversation

beliefer commented Apr 10, 2024

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dongjoon-hyun left a comment

Choose a reason for hiding this comment

Uh oh!

beliefer commented Apr 10, 2024

Uh oh!

beliefer commented Apr 10, 2024

Uh oh!

yaooqinn commented Apr 10, 2024

Uh oh!

yaooqinn commented Apr 10, 2024

Uh oh!

beliefer commented Apr 11, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

dongjoon-hyun left a comment

Choose a reason for hiding this comment

Uh oh!

dongjoon-hyun commented Apr 11, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yaooqinn commented Apr 12, 2024

Uh oh!

beliefer commented Apr 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants