[FLINK-13688][hive] Limit the parallelism/memory of HiveCatalogUseBlinkITCase #9417

JingsongLi · 2019-08-12T08:27:07Z

What is the purpose of the change

limit the parallelism of HiveCatalogUseBlinkITCase to avoid too many slot requests by default parallelism (use the core size of machine).

Verifying this change

This change is already covered by existing tests.

Does this pull request potentially affect one of the following parts:

Dependencies (does it add or upgrade a dependency): no
The public API, i.e., is any changed class annotated with @Public(Evolving): no
The serializers: no
The runtime per-record code paths (performance sensitive): no
Anything that affects deployment or recovery: JobManager (and its components), Checkpointing, Yarn/Mesos, ZooKeeper: no
The S3 file system connector: no

Documentation

Does this pull request introduce a new feature? no

flinkbot · 2019-08-12T08:29:19Z

Thanks a lot for your contribution to the Apache Flink project. I'm the @flinkbot. I help the community
to review your pull request. We will use this comment to track the progress of the review.

Automated Checks

Last check on commit 4df01ef (Fri Aug 23 10:18:58 UTC 2019)

Warnings:

1 pom.xml files were touched: Check for build and licensing issues.
No documentation files were touched! Remember to keep the Flink docs up to date!

_{Mention the bot in a comment to re-run the automated checks.}

Review Progress

❓ 1. The [description] looks good.
❓ 2. There is [consensus] that the contribution should go into to Flink.
❓ 3. Needs [attention] from.
❓ 4. The change fits into the overall [architecture].
❓ 5. Overall code [quality] is good.

Please see the Pull Request Review Guide for a full explanation of the review process.

Details

The Bot is tracking the review progress through labels. Labels are applied according to the order of the review items. For consensus, approval by a Flink committer of PMC member is required

Bot commands

The @flinkbot bot supports the following commands:

@flinkbot approve description to approve one or more aspects (aspects: description, consensus, architecture and quality)
@flinkbot approve all to approve all aspects
@flinkbot approve-until architecture to approve everything until architecture
@flinkbot attention @username1 [@username2 ..] to require somebody's attention
@flinkbot disapprove architecture to remove an approval you gave earlier

flinkbot · 2019-08-12T08:37:06Z

CI report:

2d6b138 : CANCELED Build
8dacb85 : SUCCESS Build
92f59a1 : CANCELED Build
4df01ef : SUCCESS Build

KurtYoung · 2019-08-12T09:16:47Z

Another way is to extend AbstractTestBase, which will configure mini cluster in a more predictable way.

KurtYoung · 2019-08-12T09:18:26Z

And I'm not sure the root cause is the parallelism is too high for this case. We should support running very high parallelism batch job with only one slot by design.

JingsongLi · 2019-08-12T11:01:20Z

@KurtYoung Yeah, the main problem is memory request is too high(128MB). I think 8dacb85 will solve it.

KurtYoung · 2019-08-13T05:39:41Z

How about we use a pre configured mini cluster instead of lower the memory request? Because it's not 100% safe to just rely on lowering the sort memory, because we might choose hash aggregate.

JingsongLi · 2019-08-13T07:47:24Z

How about we use a pre configured mini cluster instead of lower the memory request? Because it's not 100% safe to just rely on lowering the sort memory, because we might choose hash aggregate.

Yeah, that is a way too, but in this case, I think use lower memory is OK. Because the data is too few.
1.It must rely on sort now, because UDAF only can be support by sort agg.
2.I think I can extract the configuration in BatchTestBase, to config all memory things.

KurtYoung · 2019-08-13T08:09:43Z

I would like to repeat my first question: why not just extending AbstractTestBase

JingsongLi · 2019-08-13T08:32:09Z

I would like to repeat my first question: why not just extending AbstractTestBase

OK.

JingsongLi · 2019-08-13T08:41:01Z

This case makes me feel that the first time a user uses blink batch sql, he has to think about resources.....

bowenli86 · 2019-08-14T21:32:45Z

+1 for fixing it. @KurtYoung @JingsongLi

KurtYoung · 2019-08-16T01:53:42Z

Verified locally, +1.

…nkITCase This closes #9417 (cherry picked from commit a194b37)

[FLINK-13688][hive] Limit the parallelism of HiveCatalogUseBlinkITCase

2d6b138

rmetzger added the review=description? label Aug 12, 2019

limit sort memory

8dacb85

rmetzger added component=Connectors/Hive component=Tests labels Aug 12, 2019

JingsongLi changed the title ~~[FLINK-13688][hive] Limit the parallelism of HiveCatalogUseBlinkITCase~~ [FLINK-13688][hive] Limit the parallelism/memory of HiveCatalogUseBlinkITCase Aug 12, 2019

Use BatchTestBase config

92f59a1

extends AbstractTestBase

4df01ef

KurtYoung pushed a commit that referenced this pull request Aug 16, 2019

[FLINK-13688][hive] Limit the parallelism/memory of HiveCatalogUseBli…

03b3430

…nkITCase This closes #9417 (cherry picked from commit a194b37)

KurtYoung closed this in a194b37 Aug 16, 2019

JingsongLi deleted the hivetest branch August 16, 2019 07:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FLINK-13688][hive] Limit the parallelism/memory of HiveCatalogUseBlinkITCase #9417

[FLINK-13688][hive] Limit the parallelism/memory of HiveCatalogUseBlinkITCase #9417

Uh oh!

JingsongLi commented Aug 12, 2019

Uh oh!

flinkbot commented Aug 12, 2019 •

edited

Loading

Uh oh!

flinkbot commented Aug 12, 2019 •

edited

Loading

Uh oh!

KurtYoung commented Aug 12, 2019

Uh oh!

KurtYoung commented Aug 12, 2019

Uh oh!

JingsongLi commented Aug 12, 2019

Uh oh!

KurtYoung commented Aug 13, 2019

Uh oh!

JingsongLi commented Aug 13, 2019

Uh oh!

KurtYoung commented Aug 13, 2019

Uh oh!

JingsongLi commented Aug 13, 2019

Uh oh!

JingsongLi commented Aug 13, 2019

Uh oh!

bowenli86 commented Aug 14, 2019

Uh oh!

KurtYoung commented Aug 16, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[FLINK-13688][hive] Limit the parallelism/memory of HiveCatalogUseBlinkITCase #9417

[FLINK-13688][hive] Limit the parallelism/memory of HiveCatalogUseBlinkITCase #9417

Uh oh!

Conversation

JingsongLi commented Aug 12, 2019

What is the purpose of the change

Verifying this change

Does this pull request potentially affect one of the following parts:

Documentation

Uh oh!

flinkbot commented Aug 12, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Automated Checks

Review Progress

Uh oh!

flinkbot commented Aug 12, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CI report:

Uh oh!

KurtYoung commented Aug 12, 2019

Uh oh!

KurtYoung commented Aug 12, 2019

Uh oh!

JingsongLi commented Aug 12, 2019

Uh oh!

KurtYoung commented Aug 13, 2019

Uh oh!

JingsongLi commented Aug 13, 2019

Uh oh!

KurtYoung commented Aug 13, 2019

Uh oh!

JingsongLi commented Aug 13, 2019

Uh oh!

JingsongLi commented Aug 13, 2019

Uh oh!

bowenli86 commented Aug 14, 2019

Uh oh!

KurtYoung commented Aug 16, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

flinkbot commented Aug 12, 2019 •

edited

Loading

flinkbot commented Aug 12, 2019 •

edited

Loading