Skip to content

[SPIP][POC] Driver scaling: parallel schedulers#40077

Closed
ozhembr wants to merge 1 commit intoapache:branch-3.1from
ozhembr:spip_driver_scaling_parallel_schedulers
Closed

[SPIP][POC] Driver scaling: parallel schedulers#40077
ozhembr wants to merge 1 commit intoapache:branch-3.1from
ozhembr:spip_driver_scaling_parallel_schedulers

Conversation

@ozhembr
Copy link

@ozhembr ozhembr commented Feb 18, 2023

What changes were proposed in this pull request?

SPIP: https://docs.google.com/document/d/1_MVEpGxz6U_CNqKArR1M1l2oP-3I7O67grfwPtniLaA/edit?usp=sharing

POC of scaling Spark Driver via parallel schedulers.
Uses multiple groups of CoarseGrainedSchedulerBackend, TaskSchedulerImpl

Why are the changes needed?

Low performance of Spark Driver with multiple large jobs.

Does this PR introduce any user-facing change?

Configs for enabling parallel schedulers:
spark.driver.schedulers.parallelism - number of parallel schedulers, no value or <= 1 will disable parallelism

How was this patch tested?

Comparison tests with spark-sql processes of same parallelism level

@hvanhovell
Copy link
Contributor

hvanhovell commented Feb 19, 2023

Before we try to use multiple task schedulers, why not fix the parallelism issues in the existing one. Moving from a locking everywhere approach, to an event loop should yield quite a throughput improvement.

@ozhembr
Copy link
Author

ozhembr commented Feb 21, 2023

@hvanhovell it won't be enough, we still need to scale communication with executors

@github-actions
Copy link

github-actions bot commented Jun 2, 2023

We're closing this PR because it hasn't been updated in a while. This isn't a judgement on the merit of the PR in any way. It's just a way of keeping the PR queue manageable.
If you'd like to revive this PR, please reopen it and ask a committer to remove the Stale tag!

@github-actions github-actions bot added the Stale label Jun 2, 2023
@github-actions github-actions bot closed this Jun 3, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants