-
Notifications
You must be signed in to change notification settings - Fork 1.5k
[multistage] On Multistage Engine Query Parallelism #15842
Copy link
Copy link
Open
Labels
multi-stageRelated to the multi-stage query engineRelated to the multi-stage query enginestaleNo activity for an extended periodNo activity for an extended period
Description
MSE Query Parallelism as it stands is quite complicated for users to understand. There are some other issues with it too:
- Options like
stageParallelismcan or cannot apply based on Runtime conditions which are complicated to understand for users. - There are overlapping responsibilities. e.g. partition_parallelism is largely the same as stageParallelism.
- Names are confusing. e.g. "partition_size" controls the number of workers in the leaf stage.
- ... etc.
This is covered in detail in the following doc which also proposes a new behavior that the new optimizer will follow: https://docs.google.com/document/d/1h_IgCiUU4u0xQQ6lz4ZhzH0htd6NaJV8lFBE59MeQZA/edit?tab=t.0#heading=h.aq4kaj522uyn
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
multi-stageRelated to the multi-stage query engineRelated to the multi-stage query enginestaleNo activity for an extended periodNo activity for an extended period
Type
Fields
Give feedbackNo fields configured for issues without a type.