chore: partition-by primitive key support #4098

big-andy-coates · 2019-12-10T14:30:44Z

Description

NOTE: This was stacked on top of #4096

WIP: This commit gets PARTITION BY clauses working with primitive key types. However, it does disable a couple of join until #4094 has been completed.

BREAKING CHANGE: A PARTITION BY now changes the SQL type of ROWKEY in the output schema of a query.

For example, consider:

CREATE STREAM INPUT (ROWKEY STRING KEY, ID INT) WITH (...);
CREATE STREAM OUTPUT AS SELECT ROWKEY AS NAME FROM INPUT PARTITION BY ID;

Previously, the above would have resulted in an output schema of ROWKEY STRING KEY, NAME STRING, where ROWKEY would have stored the string representation of the integer from the ID column. With this commit the output schema will be ROWKEY INT KEY, NAME STRING.

Testing done

Suitable QTT tests added / updated.

Reviewer checklist

Ensure docs are updated if necessary. (eg. if a user visible feature is being added or changed).
Ensure relevant issues are linked (description should include text like "Fixes #")

First of a few commits to start introducing support for primitive keys in different query types. This commit opens the door for CT/CS statements with primitive keys, (`STRING`, `INT`, `BIGINT`, `BOOLEAN` and `DOUBLE`), and for using those sources in non-join, non-aggregate and non-partition-by queries.

Fixes: confluentinc#4092 WIP: This commit gets `PARTITION BY` clauses working with primitive key types. However, it does disable a couple of join until confluentinc#4094 has been completed. BREAKING CHANGE: A `PARTITION BY` now changes the SQL type of `ROWKEY` in the output schema of a query. For example, consider: ```sql CREATE STREAM INPUT (ROWKEY STRING KEY, ID INT) WITH (...); CREATE STREAM OUTPUT AS SELECT ROWKEY AS NAME FROM INPUT PARTITION BY ID; ``` Previously, the above would have resulted in an output schema of `ROWKEY STRING KEY, NAME STRING`, where `ROWKEY` would have stored the string representation of the integer from the `ID` column. With this commit the output schema will be `ROWKEY INT KEY, NAME STRING`.

purplefox

LGTM

rodesai

LGTM

rodesai · 2019-12-11T01:33:26Z

ksql-streams/src/main/java/io/confluent/ksql/execution/streams/StepSchemaResolver.java

+      final LogicalSchema sourceSchema,
+      final StreamSelectKey step
+  ) {
+    final ExpressionTypeManager expressionTypeManager =


Can we move this out of this class? Ideally this class should just be routing to something else that owns the schema transformation. We can move it to the step builder for now.

big-andy-coates added 3 commits December 10, 2019 11:25

chore: fix up error message

256f92a

big-andy-coates requested a review from a team as a code owner December 10, 2019 14:30

purplefox approved these changes Dec 10, 2019

View reviewed changes

big-andy-coates and others added 2 commits December 10, 2019 18:33

Merge branch 'master' into partition_by_primitives

29d67da

chore: fix checkstyle

07cd724

big-andy-coates merged commit 7addf88 into confluentinc:master Dec 10, 2019

big-andy-coates deleted the partition_by_primitives branch December 10, 2019 22:48

rodesai reviewed Dec 11, 2019

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: partition-by primitive key support #4098

chore: partition-by primitive key support #4098

big-andy-coates commented Dec 10, 2019 •

edited

Loading

purplefox left a comment

rodesai left a comment

rodesai Dec 11, 2019

chore: partition-by primitive key support #4098

chore: partition-by primitive key support #4098

Conversation

big-andy-coates commented Dec 10, 2019 • edited Loading

Description

Testing done

Reviewer checklist

purplefox left a comment

Choose a reason for hiding this comment

rodesai left a comment

Choose a reason for hiding this comment

rodesai Dec 11, 2019

Choose a reason for hiding this comment

big-andy-coates commented Dec 10, 2019 •

edited

Loading