[SPARK-33446][CORE] Add config spark.executor.coresOverhead #30370

warrenzhu25 · 2020-11-13T17:48:58Z

What changes were proposed in this pull request?

Add config spark.executor.coresOverhead

Why are the changes needed?

This can handle mismatch of memory per core ratio between executor and underlying physical machines or vm.

Does this PR introduce any user-facing change?

Yes.

How was this patch tested?

Covered in UT.

AmplabJenkins · 2020-11-13T17:53:50Z

Can one of the admins verify this patch?

mridulm · 2020-11-13T18:07:26Z

I am not sure I understand what the usecase here is.

To answer the jira example:
Currently we can specify 1 core and 6GB - even if the underlying system is allocating using 3gb/1core.

If the underlying resource allocation is in blocks of 1core/3gb - then it will result in wasting 1 core when requesting for 6gb container. Do we have cases where it only considers cores and not memory ? (The reverse used to be the case in yarn about a decade back - where allocation was only around memory, with cores inferred)

To give a slightly different example - if the memory allocation is in multiples of 1G, asking for 1.5g executor would give you a 2g container : with Xmx set to 1.5g, 'ignoring' the additional 0.5g [1].
Similarly, asking for a 6g/1core executor should give you a 6g/2core container - where we dont use 1 core.

[1] Purposefully ignoring memory overhead for explanation simplicity.

warrenzhu25 · 2020-11-13T18:15:56Z

I am not sure I understand what the usecase here is.

To answer the jira example:
Currently we can specify 1 core and 6GB - even if the underlying system is allocating using 3gb/1core.

If the underlying resource allocation is in blocks of 1core/3gb - then it will result in wasting 1 core when requesting for 6gb container. Do we have cases where it only considers cores and not memory ? (The reverse used to be the case in yarn about a decade back - where allocation was only around memory, with cores inferred)

To give a slightly different example - if the memory allocation is in multiples of 1G, asking for 1.5g executor would give you a 2g container : with Xmx set to 1.5g, 'ignoring' the additional 0.5g [1].
Similarly, asking for a 6g/1core executor should give you a 6g/2core container - where we dont use 1 core.

[1] Purposefully ignoring memory overhead for explanation simplicity.

Your understanding is correct. For your example, if we request 1 core + 1 coreOverhead and 6G, then this executor can only run 1 task. The coreOverhead won't be considered for task scheduling, we can still have 6G for 1 task and partition.

mridulm · 2020-11-13T21:13:14Z

Given this, why do we want to add core overhead ? What is the usecase for it ?

warrenzhu25 · 2020-11-14T00:09:28Z

Given this, why do we want to add core overhead ? What is the usecase for it ?

We want 2 cores and 6G, but only 1 task could be allocated. Then we can guarantee 1 task can use 6 GB, and extra cpu time.

mridulm · 2020-11-14T00:55:09Z

I am not sure I follow.

If you want an executor with 2 cores and 6 gb, you can allocate them with existing configs.
If you want an executor with 1 core and 6 gb, you can do the same - if underlying cluster allocates in 3gb/1core block, 1 core will be wasted - which is by requirement here.
If you want 1 core and 6 gb - then setting core overhead will not help - since it is an underlying cluster issue that you cant get this : if I understood it properly (and extrapolating based on what yarn used to do).

In other words, spark application makes the request for what it needs - cluster provides the next higher multiple which satisfies the requirements. If this means extra memory or additional cores, spark wont use it.

I am trying to understand what is the actual usecase we have, and how the existing configs wont help. Thanks !

warrenzhu25 · 2020-11-14T01:04:47Z

I am not sure I follow.

If you want an executor with 2 cores and 6 gb, you can allocate them with existing configs.

If you want an executor with 1 core and 6 gb, you can do the same - if underlying cluster allocates in 3gb/1core block, 1 core will be wasted - which is by requirement here.

If you want 1 core and 6 gb - then setting core overhead will not help - since it is an underlying cluster issue that you cant get this : if I understood it properly (and extrapolating based on what yarn used to do).

In other words, spark application makes the request for what it needs - cluster provides the next higher multiple which satisfies the requirements. If this means extra memory or additional cores, spark wont use it.

I am trying to understand what is the actual usecase we have, and how the existing configs wont help. Thanks !

I want an executor with 2 cores and 6 gb, but only 1 core used for task allocation, which means at most 1 task could be running on this executor. If I use existing configs, there would be at most 2 tasks. I want to give each task 6GB memory, but also use extra cpu time for gc or other things.

mridulm · 2020-11-14T02:57:12Z

I want an executor with 2 cores and 6 gb, but only 1 core used for task allocation, which means at most 1 task could be running on this executor. If I use existing configs, there would be at most 2 tasks. I want to give each task 6GB memory, but also use extra cpu time for gc or other things.

GC is handled by the VM - we dont need an additional core for it.
Without a good usecase, I am not in favor of adding additional configurations.

Having said that, if the requirement is strictly what you mentioned : allocate two cores and 6gb, run only 1 task and leave 1 core around for "other things" (let me assume some background processing ? Not sure what is happening) : you can use spark.task.cpus=2

warrenzhu25 · 2020-11-15T18:30:15Z

I'll try to use spark.task.cpus

[SPARK-33446][CORE] Add config spark.executor.coresOverhead

a403c2c

github-actions bot added CORE YARN labels Nov 13, 2020

warrenzhu25 closed this Nov 15, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-33446][CORE] Add config spark.executor.coresOverhead #30370

[SPARK-33446][CORE] Add config spark.executor.coresOverhead #30370

warrenzhu25 commented Nov 13, 2020

AmplabJenkins commented Nov 13, 2020

mridulm commented Nov 13, 2020

warrenzhu25 commented Nov 13, 2020

mridulm commented Nov 13, 2020

warrenzhu25 commented Nov 14, 2020

mridulm commented Nov 14, 2020 •

edited

warrenzhu25 commented Nov 14, 2020

mridulm commented Nov 14, 2020

warrenzhu25 commented Nov 15, 2020

[SPARK-33446][CORE] Add config spark.executor.coresOverhead #30370

[SPARK-33446][CORE] Add config spark.executor.coresOverhead #30370

Conversation

warrenzhu25 commented Nov 13, 2020

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

AmplabJenkins commented Nov 13, 2020

mridulm commented Nov 13, 2020

warrenzhu25 commented Nov 13, 2020

mridulm commented Nov 13, 2020

warrenzhu25 commented Nov 14, 2020

mridulm commented Nov 14, 2020 • edited

warrenzhu25 commented Nov 14, 2020

mridulm commented Nov 14, 2020

warrenzhu25 commented Nov 15, 2020

mridulm commented Nov 14, 2020 •

edited