Release 1.9FLINK-13461 #9338

jaslou · 2019-08-02T09:11:14Z

What is the purpose of the change

(For example: This pull request makes task deployment go through the blob server, rather than through RPC. That way we avoid re-transferring them on each deployment (during recovery).)

Brief change log

(for example:)

The TaskInfo is stored in the blob store on job creation time as a persistent artifact
Deployments RPC transmits only the blob storage reference
TaskManagers retrieve the TaskInfo from the blob cache

Verifying this change

(Please pick either of the following options)

This change is a trivial rework / code cleanup without any test coverage.

(or)

This change is already covered by existing tests, such as (please describe tests).

(or)

This change added tests and can be verified as follows:

(example:)

Added integration tests for end-to-end deployment with large payloads (100MB)
Extended integration test for recovery after master (JobManager) failure
Added test that validates that TaskInfo is transferred only once across recoveries
Manually verified the change by running a 4 node cluser with 2 JobManagers and 4 TaskManagers, a stateful streaming program, and killing one JobManager and two TaskManagers during the execution, verifying that recovery happens correctly.

Does this pull request potentially affect one of the following parts:

Dependencies (does it add or upgrade a dependency): (yes / no)
The public API, i.e., is any changed class annotated with @Public(Evolving): (yes / no)
The serializers: (yes / no / don't know)
The runtime per-record code paths (performance sensitive): (yes / no / don't know)
Anything that affects deployment or recovery: JobManager (and its components), Checkpointing, Yarn/Mesos, ZooKeeper: (yes / no / don't know)
The S3 file system connector: (yes / no / don't know)

Documentation

Does this pull request introduce a new feature? (yes / no)
If yes, how is the feature documented? (not applicable / docs / JavaDocs / not documented)

…ng to PubSubSink serializer and emulator settings

…tests for ResourceProfile.

…rofileTest and ResourceSpecTest

…urceSpecs.

… and ResourceProfile

- Some JavaDoc comments - Make the class final, because several methods are not designed to handle inheritence well. - Avoid repeated string concatenation/building

…ices have configured resource profiles

…k slot hierarchy

…lot is oversubscribed

…resource profile This change is covered by various existing integration tests that failed prior to this fix.

… TopNBuffer (#9098)

…ke it compile with Scala 2.12

Before, exceptions that occurred after cancelling a source (as the KafkaConsumer did, for example) would make a job fail when attempting a "stop-with-savepoint". Now we ignore those exceptions.

…eStreamTask checkpoint injecting thread

…ial revert of FLINK-11458): use single threaded Task's dispatcher thread pool

…on in the blocking method in case of spurious wakeups

This commit reworks JSON format to use a runtime converter created based on given TypeInformation. Pre this commit conversion logic was based on reference comparison of TypeInformation which was not working after serialization of the format. This also introduces a builder pattern for ensuring future immutability of schemas. This closes #7932.

This closes #9116.

This PR makes HiveTableSink implements OverwritableTableSink. This closes #9067.

…talog when creating sink for CatalogTable Planner should first try getting table factory from catalog when creating table sinks for CatalogTable. This closes #9039.

This PR adds comprehensive documentation for unified catalog APIs and catalogs. The ticket for corresponding Chinese documentation is FLINK-13086. This closes #8976.

This PR integrates FunctionCatalog with Catalog APIs. This closes #8920.

This closes #9114

…LI SessionContext This PR supports remembering current catalog and database that users set in SQL CLI SessionContext. This closes #9049.

Add a areTypesCompatible() method to LogicalTypeChecks. This will compare two LogicalTypes without field names and other logical attributes (e.g. description, isFinal).

…re row type field names

This commit combines HBaseTableSourceITCase and HBaseLookupFunctionITCase and HBaseConnectorITCase into one class. This can save much cluster initialization time for us. This closes #9275

…limit to -1

This closes #9312

…name has upper-case characters This closes #9254

…consumption

…emantics fixed per partition type In a long term we do not need auto-release semantics for blocking (persistent) partition. We expect them always to be released externally by JM and assume they can be consumed multiple times. The pipelined partitions have always only one consumer and one consumption attempt. Afterwards they can be always released automatically. ShuffleDescriptor.ReleaseType was introduced to make release semantics more flexible but it is not needed in a long term. FORCE_PARTITION_RELEASE_ON_CONSUMPTION was introduced as a safety net to be able to fallback to 1.8 behaviour without the partition tracker and JM taking care about blocking partition release. We can make this option specific for NettyShuffleEnvironment which was the only existing shuffle service before. If it is activated then the blocking partition is also auto-released on a consumption attempt as it was before. The fine-grained recovery will just not find the partition after the job restart in this case and will restart the producer.

@FunctionalInterface

…Info with @FunctionalInterface

…uration

… based on ResultPartitionType.isBlocking

… configurations This closes #9277

Make MultiTaskSlot not available for allocation when it’s releasing children to avoid ConcurrentModificationException. This closes #9288.

flinkbot · 2019-08-02T09:13:44Z

Thanks a lot for your contribution to the Apache Flink project. I'm the @flinkbot. I help the community
to review your pull request. We will use this comment to track the progress of the review.

Automated Checks

Last check on commit bf99b26 (Tue Aug 06 15:59:02 UTC 2019)

Warnings:

14 pom.xml files were touched: Check for build and licensing issues.

_{Mention the bot in a comment to re-run the automated checks.}

Review Progress

❓ 1. The [description] looks good.
❓ 2. There is [consensus] that the contribution should go into to Flink.
❓ 3. Needs [attention] from.
❓ 4. The change fits into the overall [architecture].
❓ 5. Overall code [quality] is good.

Please see the Pull Request Review Guide for a full explanation of the review process.

Details

The Bot is tracking the review progress through labels. Labels are applied according to the order of the review items. For consensus, approval by a Flink committer of PMC member is required

Bot commands

The @flinkbot bot supports the following commands:

@flinkbot approve description to approve one or more aspects (aspects: description, consensus, architecture and quality)
@flinkbot approve all to approve all aspects
@flinkbot approve-until architecture to approve everything until architecture
@flinkbot attention @username1 [@username2 ..] to require somebody's attention
@flinkbot disapprove architecture to remove an approval you gave earlier

flinkbot · 2019-08-02T09:27:54Z

CI report:

bf99b26 : SUCCESS Build

jaslou · 2019-08-04T03:36:01Z

I'm so sorry to do that, but I didn't mean it, It's my misoperation

zentol and others added 30 commits July 12, 2019 11:48

[FLINK-13219][hive] Disable tests for hadoop 2.4 profile

8e50e5d

[FLINK-13133] [pubsub] Fix small error in PubSub documentation relati…

9d7045c

…ng to PubSubSink serializer and emulator settings

[FLINK-12766][runtime] Introduce merge and subtract calculations and …

a8cacc6

…tests for ResourceProfile.

[FLINK-12765][coordinator] Change the default resource spec to UNKNOWN

c5a6b1f

[hotfix] [tests] Checkstyle fixes and minor code cleanup in ResourceP…

5e3d77a

…rofileTest and ResourceSpecTest

[FLINK-12766][runtime] Fix bug in merging and converting UNKNOWN Reso…

3de2417

…urceSpecs.

[hotfix][runtime] Preserve singleton property of UNKNOWN ResourceSpec…

a81344c

… and ResourceProfile

[hotfix][core] Minor cleanups to the ResourceSpec class

e82ce37

- Some JavaDoc comments - Make the class final, because several methods are not designed to handle inheritence well. - Avoid repeated string concatenation/building

[FLINK-12765][coordinator] Disable jobs where some, but nor all, vert…

a740773

…ices have configured resource profiles

[FLINK-12765][jobmanager] Keep track of the resources used in the tas…

4048a53

…k slot hierarchy

[FLINK-12765][jobmanager] Let some slot reqests fail if the sharing s…

9b5256e

…lot is oversubscribed

[FLINK-13250][blink runner] Make sure that all nodes have a concrete …

9190a67

…resource profile This change is covered by various existing integration tests that failed prior to this fix.

[FLINK-13236][table-runtime-blink] Fix bug and improve performance in…

a15834a

… TopNBuffer (#9098)

[FLINK-13217][table-planner-blink] Fix FlinkLogicalRelFactories to ma…

79552ef

…ke it compile with Scala 2.12

[FLINK-13253][jdbc] Deadlock may occur in JDBCUpsertOutputFormat (#9108)

98a4b60

[FLINK-13243][tests] Simplify exception matching

550eec3

[FLINK-13154][docs] Fix broken links

8310ad9

[FLINK-13124] Don't forward exceptions when finishing SourceStreamTask

42a475d

Before, exceptions that occurred after cancelling a source (as the KafkaConsumer did, for example) would make a job fail when attempting a "stop-with-savepoint". Now we ignore those exceptions.

[FLINK-13205][runtime] Make stop-with-savepoint non-blocking on Sourc…

67184c2

…eStreamTask checkpoint injecting thread

[FLINK-13205][runtime] Make checkpoints injection ordered again (part…

1568ecc

…ial revert of FLINK-11458): use single threaded Task's dispatcher thread pool

[hotfix] Fix IOUtils.closeAll exceptions aggregation

2678e69

[hotfix][runtime] SynchronousSavepointLatch: check completion conditi…

b5aa679

…on in the blocking method in case of spurious wakeups

[FLINK-13264][table] Let the planner supply its type inference util

4184049

This closes #9116.

[FLINK-13069][hive] HiveTableSink should implement OverwritableTableSink

7e4da43

This PR makes HiveTableSink implements OverwritableTableSink. This closes #9067.

[FLINK-13170][table-planner] Planner should get table factory from ca…

7b4f39d

…talog when creating sink for CatalogTable Planner should first try getting table factory from catalog when creating table sinks for CatalogTable. This closes #9039.

[FLINK-12277][table/hive/doc] Add documentation for catalogs

5d07a34

This PR adds comprehensive documentation for unified catalog APIs and catalogs. The ticket for corresponding Chinese documentation is FLINK-13086. This closes #8976.

[FLINK-13024][table] integrate FunctionCatalog with CatalogManager

22c38df

This PR integrates FunctionCatalog with Catalog APIs. This closes #8920.

[FLINK-13263] [python] Supports explain DAG plan in flink-python

4497072

This closes #9114

[FLINK-13176][SQL CLI] remember current catalog and database in SQL C…

8dcd504

…LI SessionContext This PR supports remembering current catalog and database that users set in SQL CLI SessionContext. This closes #9049.

wuchong and others added 23 commits August 2, 2019 10:09

[FLINK-13290][table-api] Add method to check LogicalType compatible

4925389

Add a areTypesCompatible() method to LogicalTypeChecks. This will compare two LogicalTypes without field names and other logical attributes (e.g. description, isFinal).

[FLINK-13290][table-planner-blink] SinkCodeGenerator should not compa…

55f399c

…re row type field names

[FLINK-13290][hbase] Enable blink planner for integration tests of HBase

61adb66

This commit combines HBaseTableSourceITCase and HBaseLookupFunctionITCase and HBaseConnectorITCase into one class. This can save much cluster initialization time for us. This closes #9275

[table-api-java] Change the default value of table.exec.sort.default-…

816741b

…limit to -1

[FLINK-13436][e2e] Add TPC-H queries as E2E tests

c465ce1

This closes #9312

[FLINK-13427][hive] HiveCatalog's createFunction fails when function …

32dd1b1

…name has upper-case characters This closes #9254

[hotfix][tests] Remove setting the default value of force-release-on-…

ddaeffa

…consumption

[hotfix][network] fix codestyle issues in ResultPartitionFactory

a7869bc

[hotfix][network] Annotate NettyShuffleDescriptor#PartitionConnection…

58e0ed2

…Info with @FunctionalInterface

[hotfix][network] fix codestyle issues in NettyShuffleMaster

18baca7

[hotfix] fix codestyle issues in ShuffleDescriptor

355a80a

[hotfix][tests] Make PartitionTestUtils enum singleton and fix codestyle

0b2161a

[hotfix][tests] fix codestyle issues in ResultPartitionBuilder

9201cb8

[hotfix][tests] fix codestyle issues in ResultPartitionFactoryTest

06468ef

[hotfix][tests] fix codestyle issues in NettyShuffleEnvironmentBuilder

95d6d41

[hotfix][tests] fix codestyle issues in NettyShuffleDescriptorBuilder

0bbeedf

[hotfix][tests] fix codestyle issues in NettyShuffleEnvironmentConfig…

c422c71

…uration

[hotfix][network] Simplify ResultPartitionFactory.createSubpartitions…

18c3686

… based on ResultPartitionType.isBlocking

[hotfix][coordination] Check whether partition set to track is empty

039ec02

[FLINK-13371][coordination] Prevent leaks of blocking partitions

b5ab84c

[FLINK-13494][table-planner-blink] Remove source and sink parallelism…

70475d3

… configurations This closes #9277

[FLINK-13421] Exclude releasing root slots from slot allocation

ed38718

Make MultiTaskSlot not available for allocation when it’s releasing children to avoid ConcurrentModificationException. This closes #9288.

rmetzger added the review=description? label Aug 2, 2019

[hotfix] Set proper version and relative path in flink-tpch-test/pom.xml

bf99b26

zentol closed this Aug 2, 2019

rmetzger added the component=TableSQL/API label Aug 2, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release 1.9FLINK-13461 #9338

Release 1.9FLINK-13461 #9338

Uh oh!

jaslou commented Aug 2, 2019

Uh oh!

flinkbot commented Aug 2, 2019 •

edited

Loading

Uh oh!

flinkbot commented Aug 2, 2019 •

edited

Loading

Uh oh!

jaslou commented Aug 4, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Release 1.9FLINK-13461 #9338

Release 1.9FLINK-13461 #9338

Uh oh!

Conversation

jaslou commented Aug 2, 2019

What is the purpose of the change

Brief change log

Verifying this change

Does this pull request potentially affect one of the following parts:

Documentation

Uh oh!

flinkbot commented Aug 2, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Automated Checks

Review Progress

Uh oh!

flinkbot commented Aug 2, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CI report:

Uh oh!

jaslou commented Aug 4, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

flinkbot commented Aug 2, 2019 •

edited

Loading

flinkbot commented Aug 2, 2019 •

edited

Loading