[FLINK-25226][doc] Add documentation about the AdaptiveBatchScheduler #18757

wanglijie95 · 2022-02-14T13:51:26Z

What is the purpose of the change

Add documentation about the AdaptiveBatchScheduler

Verifying this change

Document change without any test coverage.

Does this pull request potentially affect one of the following parts:

Dependencies (does it add or upgrade a dependency): (no)
The public API, i.e., is any changed class annotated with @Public(Evolving): (no)
The serializers: (no)
The runtime per-record code paths (performance sensitive): (no)
Anything that affects deployment or recovery: JobManager (and its components), Checkpointing, Kubernetes/Yarn, ZooKeeper: (no)
The S3 file system connector: (no)

Documentation

Does this pull request introduce a new feature? (no)
If yes, how is the feature documented? (not applicable)

flinkbot · 2022-02-14T13:57:17Z

Thanks a lot for your contribution to the Apache Flink project. I'm the @flinkbot. I help the community
to review your pull request. We will use this comment to track the progress of the review.

Automated Checks

Last check on commit 8f1fbaa (Mon Feb 14 13:57:17 UTC 2022)

Warnings:

This pull request references an unassigned Jira ticket. According to the code contribution guide, tickets need to be assigned before starting with the implementation work.

_{Mention the bot in a comment to re-run the automated checks.}

Review Progress

❓ 1. The [description] looks good.
❓ 2. There is [consensus] that the contribution should go into to Flink.
❓ 3. Needs [attention] from.
❓ 4. The change fits into the overall [architecture].
❓ 5. Overall code [quality] is good.

Please see the Pull Request Review Guide for a full explanation of the review process.

The Bot is tracking the review progress through labels. Labels are applied according to the order of the review items. For consensus, approval by a Flink committer of PMC member is required

Bot commands

The @flinkbot bot supports the following commands:

@flinkbot approve description to approve one or more aspects (aspects: description, consensus, architecture and quality)
@flinkbot approve all to approve all aspects
@flinkbot approve-until architecture to approve everything until architecture
@flinkbot attention @username1 [@username2 ..] to require somebody's attention
@flinkbot disapprove architecture to remove an approval you gave earlier

flinkbot · 2022-02-14T13:57:33Z

CI report:

6063dec Azure: SUCCESS

Bot commands

The @flinkbot bot supports the following commands:

@flinkbot run azure re-run the last Azure build

zhuzhurk

Thanks for adding docs for adaptive batch scheduler! @wanglijie95
I have a few comments. Please take a look.

docs/content.zh/docs/deployment/adaptive_batch_scheduler.md

docs/content/docs/deployment/adaptive_batch_scheduler.md

zhuzhurk · 2022-02-15T04:46:42Z

docs/content/docs/deployment/adaptive_batch_scheduler.md

+
+### Limitations
+
+- **ALL-EDGES-BLOCKING batch jobs only**: The first version of Adaptive Batch Scheduler only supports ALL-EDGES-BLOCKING batch jobs only.


ALL-EDGES-BLOCKING -> ALL-EXCHANGES-BLOCKING

And maybe add a link to the config option "execution.batch-shuffle-mode" for reference?

The first version of -> At the moment,

there are 2 only and either should be removed

What does this mean? (from the user perspective)

+1 for @zhuzhurk 's comment. Just tell user adaptive batch scheduler only support the case where execution.batch-shuffle-mode is ALL-EXCHANGES-BLOCKING, and link to the config pages.

zhuzhurk · 2022-02-15T04:53:50Z

@tillrohrmann @dmvk would you help to take a look at the EN version document if it is convenient?

dmvk · 2022-02-15T14:37:15Z

I'm not sure whether ABS should have its own "top level" section under the deployment menu. Would it make sense to incorporate this into elastic scaling page?

dmvk

Thanks for the PR @wanglijie95 👍 It's great to see that this feature will get a proper documentation 😍. I'm mostly concerned about what audience are we targeting with this docs, I think we should take a less advanced users into consideration here, because this is a really cool feature that many people will want to try out.

Also it would be nice to add a section about how this could be integrated with the external shuffle service (without it, this effort lacks the benefit of the being resource effective).

I left some comments in-line, please take a look.

For the grammar, once you're finished, you can ping @infoverload and she can help to correct it.

Are you also planning a blog post for this? It would be a good opportunity to enhance this with some high level pictures that could be then reused.

👍

dmvk · 2022-02-15T14:44:42Z

docs/content/docs/deployment/adaptive_batch_scheduler.md

+The Adaptive Batch Scheduler can automatically decide parallelisms of job vertices for batch jobs. If a job vertex is not set with a parallelism, the scheduler will decide parallelism for the job vertex according to the size of its consumed datasets. This can bring many benefits:
+- Batch job users can be relieved from parallelism tuning
+- Automatically tuned parallelisms can be vertex level and can better fit consumed datasets which have a varying volume size every day
+- Vertices from SQL batch jobs can be assigned with different parallelisms which are automatically tuned


What's the target audience? Does regular Flink user supposed to know what the job vertex is? Overall this page feels bit too low level 🤔.

On the other hand I don't think that other pages withing this section are all much better in this regard 🤔

What's the target audience? Does regular Flink user supposed to know what the job vertex is? Overall this page feels bit too low level 🤔.

Thanks for pointing that out. Maybe stage is more appropriate？

On the other hand I don't think that other pages withing this section are all much better in this regard 🤔

I'll check the rest content.

Or use operator, although it is not exactly the same as the job vertex.

+1 for operator. It is the concept that users can/must understand. I think adaptively deciding parallelisms does mean to adaptively deciding parallelisms for operators. We just do not want to break beneficial operator chaining, so that parallelisms are decided for OperatorChain/JobVertex.

dmvk · 2022-02-15T14:46:59Z

docs/content/docs/deployment/adaptive_batch_scheduler.md

+
+#### Set the parallelism of job vertices to `-1`
+Adaptive Batch Scheduler will only decide parallelism for job vertices whose parallelism is not specified by users (parallelism is `-1`). So if you want the parallelism of vertices can be decided automatically, you should configure as follows:
+- Set `paralleims.default` to `-1`


dmvk · 2022-02-15T14:47:39Z

docs/content/docs/deployment/adaptive_batch_scheduler.md

+- Set the parallelism of job vertices to `-1`.
+
+#### Configure to use Adaptive Batch Scheduler
+To use Adaptive Batch Scheduler, you need to set the [`jobmanager.scheduler`]({{< ref "docs/deployment/config" >}}#jobmanager-scheduler) to `AdpaptiveBatch`. In addition, there are several optional config options that might need adjustment when using Adaptive Batch Scheduler:


typo AdpaptiveBatch

dmvk · 2022-02-15T14:48:15Z

docs/content/docs/deployment/adaptive_batch_scheduler.md

+- [`jobmanager.scheduler.adaptive-batch.data-volume-per-task`]({{< ref "docs/deployment/config" >}}#jobmanager-scheduler-adaptive-batch-data-volume-per-task): The size of data volume to expect each task instance to process
+- [`jobmanager.scheduler.adaptive-batch.source-parallelism.default`]({{< ref "docs/deployment/config" >}}#jobmanager-scheduler-adaptive-batch-source-parallelism-default): The default parallelism of source vertices
+
+#### Set the parallelism of job vertices to `-1`


Why don't we set the defaults automatically when the ABS is enabled? Are there cases where we can't assume that this is what user wants?

If users explicitly configure the parallelism.default (with a value > 0) in flink-conf, but we override this value with -1, I think this may give the users a feeling that the configuration does not take effect. Maybe we can check the value of parallelism.default and then print an ERROR or WARNING log if the value > 0 ?

dmvk · 2022-02-15T14:48:49Z

docs/content/docs/deployment/adaptive_batch_scheduler.md

+
+### Performance tuning
+
+1. It's recommended to use `Sort Shuffle` and set [`taskmanager.network.memory.buffers-per-channel`]({{< ref "docs/deployment/config" >}}#taskmanager-network-memory-buffers-per-channel) to `0`. This can decouple the network memory consumption from parallelism, so for large scale jobs, the possibility of "Insufficient number of network buffers" error can be decreased.


Would it make sense to link this with a blog post?

+1 to add a link to "https://flink.apache.org/2021/10/26/sort-shuffle-part1.html" (or maybe "https://flink.apache.org/2021/10/26/sort-shuffle-part1.html#motivation-behind-the-sort-based-implementation" which explains the benefits of Sort Shuffle including saving network buffers).

+1 for this. I will add it.

dmvk · 2022-02-15T14:50:16Z

docs/content/docs/deployment/adaptive_batch_scheduler.md

+### Performance tuning
+
+1. It's recommended to use `Sort Shuffle` and set [`taskmanager.network.memory.buffers-per-channel`]({{< ref "docs/deployment/config" >}}#taskmanager-network-memory-buffers-per-channel) to `0`. This can decouple the network memory consumption from parallelism, so for large scale jobs, the possibility of "Insufficient number of network buffers" error can be decreased.
+2. It's not recommended to configure an excessive value for [`jobmanager.scheduler.adaptive-batch.max-parallelism`]({{< ref "docs/deployment/config" >}}#jobmanager-scheduler-adaptive-batch-max-parallelism), otherwise it will affect the performance. Because this option can affect the number of subpartitions produced by upstream tasks, excessive number of subpartitions may degrade the performance of hash shuffle and the performance of network transmission due to small packets.


What is an excessive value in this context?

I think the maximum parallelism should be set to the parallelism you expect to need to process the data in the worst case, a value large than it (expect value in worst case) can be considered as "excessive value". I will revise the description in this part.

dmvk · 2022-02-15T14:51:11Z

docs/content/docs/deployment/adaptive_batch_scheduler.md

+
+### Limitations
+
+- **ALL-EDGES-BLOCKING batch jobs only**: The first version of Adaptive Batch Scheduler only supports ALL-EDGES-BLOCKING batch jobs only.


What does this mean? (from the user perspective)

tillrohrmann

Thanks for creating this PR @wanglijie95. I think it is already really good. One thing that I missing is what David said: An explanation for less advanced users would be really cool. I think it could go along the lines of how the batch scheduler works and what benefits it brings.

docs/content/docs/deployment/adaptive_batch_scheduler.md

zhuzhurk · 2022-02-16T03:43:03Z

I'm not sure whether ABS should have its own "top level" section under the deployment menu. Would it make sense to incorporate this into elastic scaling page?

Good idea! +1 to add this doc as an Adaptive Batch Scheduler section in Elastic Scaling page.

wanglijie95 · 2022-02-16T11:43:21Z

Are you also planning a blog post for this? It would be a good opportunity to enhance this with some high level pictures that could be then reused.

Yes, blog post is in our plan, maybe shortly after 1.15 release.

wanglijie95 · 2022-02-17T02:48:00Z

Thanks for your comments @zhuzhurk @dmvk @tillrohrmann, this goes a long way towards perfecting this document. I've updated the document, looking forward for your further feedback.

@dmvk Currently RSS does not support ABS (mainly does not support one input gate consumes subpartition range), so the part of integrating with external shuffle services , I think it can be added after RSS is adapted to ABS. When posting blog posts in the future, if the adaptation of RSS has finished, users can also be recommended to use it.

docs/content.zh/docs/deployment/elastic_scaling.md

zhuzhurk · 2022-02-18T03:12:52Z

docs/content/docs/deployment/adaptive_batch_scheduler.md

+The Adaptive Batch Scheduler can automatically decide parallelisms of job vertices for batch jobs. If a job vertex is not set with a parallelism, the scheduler will decide parallelism for the job vertex according to the size of its consumed datasets. This can bring many benefits:
+- Batch job users can be relieved from parallelism tuning
+- Automatically tuned parallelisms can be vertex level and can better fit consumed datasets which have a varying volume size every day
+- Vertices from SQL batch jobs can be assigned with different parallelisms which are automatically tuned


+1 for operator. It is the concept that users can/must understand. I think adaptively deciding parallelisms does mean to adaptively deciding parallelisms for operators. We just do not want to break beneficial operator chaining, so that parallelisms are decided for OperatorChain/JobVertex.

docs/content.zh/docs/deployment/adaptive_batch_scheduler.md

docs/content/docs/deployment/elastic_scaling.md

docs/content.zh/docs/deployment/adaptive_batch_scheduler.md

docs/content/docs/deployment/elastic_scaling.md

zhuzhurk · 2022-02-22T06:05:18Z

Thanks for addressing the comments! The change looks good to me.
@dmvk do you want to take another look?

docs/content/docs/deployment/elastic_scaling.md

docs/content.zh/docs/deployment/elastic_scaling.md

zhuzhurk

Thanks for addressing all the comments. @wanglijie95
The doc now looks good to me.

wanglijie95 · 2022-03-19T00:56:15Z

@flinkbot run azure

This closes #18757.

This closes apache#18757.

wanglijie95 force-pushed the FLINK-25226 branch from 8f1fbaa to 11e85ed Compare February 14, 2022 14:02

rmetzger added the component=Documentation label Feb 14, 2022

zhuzhurk reviewed Feb 15, 2022

View reviewed changes

dmvk reviewed Feb 15, 2022

View reviewed changes

tillrohrmann reviewed Feb 15, 2022

View reviewed changes

docs/content/docs/deployment/adaptive_batch_scheduler.md Outdated Show resolved Hide resolved

wanglijie95 force-pushed the FLINK-25226 branch from 11e85ed to 633a085 Compare February 16, 2022 13:28

zhuzhurk reviewed Feb 18, 2022

View reviewed changes

zhuzhurk reviewed Feb 22, 2022

View reviewed changes

docs/content/docs/deployment/elastic_scaling.md Show resolved Hide resolved

wanglijie95 force-pushed the FLINK-25226 branch from 604773d to 138555c Compare February 22, 2022 03:28

wanglijie95 force-pushed the FLINK-25226 branch from 138555c to 5a93c1c Compare March 16, 2022 08:34

zhuzhurk reviewed Mar 18, 2022

View reviewed changes

zhuzhurk approved these changes Mar 18, 2022

View reviewed changes

wanglijie95 force-pushed the FLINK-25226 branch from db3252d to 989d228 Compare March 18, 2022 09:18

[FLINK-25226][doc] Add documentation about the AdaptiveBatchScheduler

6063dec

wanglijie95 force-pushed the FLINK-25226 branch from 989d228 to 6063dec Compare March 19, 2022 03:16

zhuzhurk closed this in 192351e Mar 20, 2022

zhuzhurk pushed a commit that referenced this pull request Mar 20, 2022

[FLINK-25226][doc] Add documentation about the AdaptiveBatchScheduler

41099c4

This closes #18757.

wanglijie95 deleted the FLINK-25226 branch March 21, 2022 01:52

JasonLeeCoding pushed a commit to JasonLeeCoding/flink that referenced this pull request May 27, 2022

[FLINK-25226][doc] Add documentation about the AdaptiveBatchScheduler

6c24a6d

This closes apache#18757.

zstraw pushed a commit to zstraw/flink that referenced this pull request Jul 4, 2022

[FLINK-25226][doc] Add documentation about the AdaptiveBatchScheduler

453c137

This closes apache#18757.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FLINK-25226][doc] Add documentation about the AdaptiveBatchScheduler #18757

[FLINK-25226][doc] Add documentation about the AdaptiveBatchScheduler #18757

wanglijie95 commented Feb 14, 2022

flinkbot commented Feb 14, 2022

flinkbot commented Feb 14, 2022 •

edited

zhuzhurk left a comment

zhuzhurk Feb 15, 2022

zhuzhurk Feb 15, 2022

zhuzhurk Feb 15, 2022

dmvk Feb 15, 2022

wanglijie95 Feb 16, 2022

zhuzhurk commented Feb 15, 2022

dmvk commented Feb 15, 2022

dmvk left a comment

dmvk Feb 15, 2022

wanglijie95 Feb 16, 2022

wanglijie95 Feb 16, 2022

zhuzhurk Feb 18, 2022 •

edited

dmvk Feb 15, 2022

dmvk Feb 15, 2022

dmvk Feb 15, 2022

wanglijie95 Feb 16, 2022

dmvk Feb 15, 2022

zhuzhurk Feb 16, 2022 •

edited

wanglijie95 Feb 16, 2022

dmvk Feb 15, 2022

wanglijie95 Feb 16, 2022

dmvk Feb 15, 2022

tillrohrmann left a comment

zhuzhurk commented Feb 16, 2022

wanglijie95 commented Feb 16, 2022

wanglijie95 commented Feb 17, 2022 •

edited

zhuzhurk Feb 18, 2022 •

edited

zhuzhurk commented Feb 22, 2022

zhuzhurk left a comment

wanglijie95 commented Mar 19, 2022


		### Limitations

		- ALL-EDGES-BLOCKING batch jobs only: The first version of Adaptive Batch Scheduler only supports ALL-EDGES-BLOCKING batch jobs only.


		### Performance tuning

		1. It's recommended to use `Sort Shuffle` and set [`taskmanager.network.memory.buffers-per-channel`]({{< ref "docs/deployment/config" >}}#taskmanager-network-memory-buffers-per-channel) to `0`. This can decouple the network memory consumption from parallelism, so for large scale jobs, the possibility of "Insufficient number of network buffers" error can be decreased.

[FLINK-25226][doc] Add documentation about the AdaptiveBatchScheduler #18757

[FLINK-25226][doc] Add documentation about the AdaptiveBatchScheduler #18757

Conversation

wanglijie95 commented Feb 14, 2022

What is the purpose of the change

Verifying this change

Does this pull request potentially affect one of the following parts:

Documentation

flinkbot commented Feb 14, 2022

Automated Checks

Review Progress

flinkbot commented Feb 14, 2022 • edited

CI report:

zhuzhurk left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zhuzhurk commented Feb 15, 2022

dmvk commented Feb 15, 2022

dmvk left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zhuzhurk Feb 18, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zhuzhurk Feb 16, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tillrohrmann left a comment

Choose a reason for hiding this comment

zhuzhurk commented Feb 16, 2022

wanglijie95 commented Feb 16, 2022

wanglijie95 commented Feb 17, 2022 • edited

zhuzhurk Feb 18, 2022 • edited

Choose a reason for hiding this comment

zhuzhurk commented Feb 22, 2022

zhuzhurk left a comment

Choose a reason for hiding this comment

wanglijie95 commented Mar 19, 2022

flinkbot commented Feb 14, 2022 •

edited

zhuzhurk Feb 18, 2022 •

edited

zhuzhurk Feb 16, 2022 •

edited

wanglijie95 commented Feb 17, 2022 •

edited

zhuzhurk Feb 18, 2022 •

edited