[SPARK-48210][DOC]Modify the description of whether dynamic partition… #46496

guixiaowen · 2024-05-09T03:58:27Z

…ing is enabled in the “ Stage Level Scheduling Overview”

What changes were proposed in this pull request?

“ Stage Level Scheduling Overview ” in running-on-yarn and running-on-kubernetes

The description of dynamic partitioning is inconsistent with the code implementation verification.

In running-on-yarn

'

When dynamic allocation is disabled: It allows users to specify different task resource requirements at the stage level and will use the same executors requested at startup.
'

But the implementation is：

Class：ResourceProfileManager

Fuc：isSupported

private[spark] def isSupported(rp: ResourceProfile): Boolean = {
assert(master != null)
if (rp.isInstanceOf[TaskResourceProfile] && !dynamicEnabled) {
if ((notRunningUnitTests || testExceptionThrown) &&
!(isStandaloneOrLocalCluster || isYarn || isK8s))

{ throw new SparkException("TaskResourceProfiles are only supported for Standalone, " + "Yarn and Kubernetes cluster for now when dynamic allocation is disabled.") }
}

The judgment of this code is that it does not support TaskResourceProfile in Yarn and k8s when dynamic partitioning is closed.

The description in the document does not match, so the document needs to be modified.

Why are the changes needed?

This description is a bit misleading for users.

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

…ing is enabled in the “ Stage Level Scheduling Overview”

guixiaowen · 2024-05-09T03:59:45Z

@cloud-fan hi, Can you help me review this PR？

dongjoon-hyun · 2024-05-10T04:59:04Z

cc @mridulm and @tgravescs

tgravescs · 2024-05-10T14:26:13Z

sorry this description is very hard to read. I think you are trying to say that stage level scheduling isn't supported on k8s and yarn when dynamic allocation is disabled? When you say dynamic partitions do you mean dynamic allocation of executors?

That is not true we added support - https://issues.apache.org/jira/browse/SPARK-45495

The description as is:

When dynamic allocation is disabled: It allows users to specify different task resource requirements at the stage level and will use the same executors requested at startup.

Is correct. When dynamic allocation is off, you can specify a different resource profile that changes the task requiresments but uses the existing executors.

guixiaowen · 2024-05-11T03:36:54Z

sorry this description is very hard to read. I think you are trying to say that stage level scheduling isn't supported on k8s and yarn when dynamic allocation is disabled? When you say dynamic partitions do you mean dynamic allocation of executors?

That is not true we added support - https://issues.apache.org/jira/browse/SPARK-45495

The description as is:

When dynamic allocation is disabled: It allows users to specify different task resource requirements at the stage level and will use the same executors requested at startup.

Is correct. When dynamic allocation is off, you can specify a different resource profile that changes the task requiresments but uses the existing executors.

@tgravescs Thank you for your reply.

Can you help me check my configuration, code, and error messages returned.

Is there a problem with my understanding？

My set is：
set spark.dynamicAllocation.enabled=false

My code is：
`
import org.apache.spark.resource.ResourceProfileBuilder
import org.apache.spark.resource.TaskResourceRequests

val rdd = sc.range(0, 9)
val rpBuilder = new ResourceProfileBuilder()
val taskReqs = new TaskResourceRequests().cpus(1)
val rp = rpBuilder.require(taskReqs).build
rdd.withResources(rp)
`

Return info：

`
org.apache.spark.SparkException: TaskResourceProfiles are only supported for Standalone cluster for now when dynamic allocation is disabled.
at org.apache.spark.resource.ResourceProfileManager.isSupported(ResourceProfileManager.scala:71)
at org.apache.spark.resource.ResourceProfileManager.addResourceProfile(ResourceProfileManager.scala:126)
at org.apache.spark.rdd.RDD.withResources(RDD.scala:1829)
... 48 elided

`
According to the document description， is 'specify different task resource requirements ' set TaskResourceRequests？

‘
When dynamic allocation is disabled: It allows users to specify different task resource requirements at the stage level and will use the same executors requested at startup.
’

Is there a problem with my understanding?
Can you help me answer this? Thank you.

tgravescs · 2024-05-17T13:35:42Z

what version of Spark are you using? That issue I pointed to was fixed in 4.0.0 and 3.5.1.

The check and error that is on the main branch should read:

      if ((notRunningUnitTests || testExceptionThrown) &&
        !(isStandaloneOrLocalCluster || isYarn || isK8s)) {
        throw new SparkException("TaskResourceProfiles are only supported for Standalone, " +
          "Yarn and Kubernetes cluster for now when dynamic allocation is disabled.")

https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/resource/ResourceProfileManager.scala#L73

I'm guessing you are using a version of Spark before that change.

guixiaowen · 2024-05-20T03:52:17Z

what version of Spark are you using? That issue I pointed to was fixed in 4.0.0 and 3.5.1.

The check and error that is on the main branch should read:
      if ((notRunningUnitTests || testExceptionThrown) &&
        !(isStandaloneOrLocalCluster || isYarn || isK8s)) {
        throw new SparkException("TaskResourceProfiles are only supported for Standalone, " +
          "Yarn and Kubernetes cluster for now when dynamic allocation is disabled.")
https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/resource/ResourceProfileManager.scala#L73

I'm guessing you are using a version of Spark before that change.

@tgravescs
I tested on 3.5.1 and main branch (4.0.0)
The results of these two branch tests are the same. The code is restricted, but the documentation states that it is possible to enable it.
This place is a bit contradictory

[SPARK-48210][doc]Modify the description of whether dynamic partition…

e8bb363

…ing is enabled in the “ Stage Level Scheduling Overview”

github-actions bot added the DOCS label May 9, 2024

guixiaowen changed the title ~~[SPARK-48210][doc]Modify the description of whether dynamic partition…~~ [SPARK-48210][DOC]Modify the description of whether dynamic partition… May 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-48210][DOC]Modify the description of whether dynamic partition… #46496

[SPARK-48210][DOC]Modify the description of whether dynamic partition… #46496

guixiaowen commented May 9, 2024

guixiaowen commented May 9, 2024

dongjoon-hyun commented May 10, 2024

tgravescs commented May 10, 2024

guixiaowen commented May 11, 2024 •

edited

tgravescs commented May 17, 2024

guixiaowen commented May 20, 2024

[SPARK-48210][DOC]Modify the description of whether dynamic partition… #46496

Are you sure you want to change the base?

[SPARK-48210][DOC]Modify the description of whether dynamic partition… #46496

Conversation

guixiaowen commented May 9, 2024

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

guixiaowen commented May 9, 2024

dongjoon-hyun commented May 10, 2024

tgravescs commented May 10, 2024

guixiaowen commented May 11, 2024 • edited

tgravescs commented May 17, 2024

guixiaowen commented May 20, 2024

guixiaowen commented May 11, 2024 •

edited