Primitive keys: Support `INT`, `BIGINT`, `DOUBLE` and `STRING` in `PARTITION BY` #4092

big-andy-coates · 2019-12-10T11:21:06Z

No description provided.

Fixes: confluentinc#4092 WIP: This commit gets `PARTITION BY` clauses working with primitive key types. However, it does disable a couple of join until confluentinc#4094 has been completed. BREAKING CHANGE: A `PARTITION BY` now changes the SQL type of `ROWKEY` in the output schema of a query. For example, consider: ```sql CREATE STREAM INPUT (ROWKEY STRING KEY, ID INT) WITH (...); CREATE STREAM OUTPUT AS SELECT ROWKEY AS NAME FROM INPUT PARTITION BY ID; ``` Previously, the above would have resulted in an output schema of `ROWKEY STRING KEY, NAME STRING`, where `ROWKEY` would have stored the string representation of the integer from the `ID` column. With this commit the output schema will be `ROWKEY INT KEY, NAME STRING`.

* chore: partition-by primitive key support Fixes: #4092 WIP: This commit gets `PARTITION BY` clauses working with primitive key types. However, it does disable a couple of join until #4094 has been completed. BREAKING CHANGE: A `PARTITION BY` now changes the SQL type of `ROWKEY` in the output schema of a query. For example, consider: ```sql CREATE STREAM INPUT (ROWKEY STRING KEY, ID INT) WITH (...); CREATE STREAM OUTPUT AS SELECT ROWKEY AS NAME FROM INPUT PARTITION BY ID; ``` Previously, the above would have resulted in an output schema of `ROWKEY STRING KEY, NAME STRING`, where `ROWKEY` would have stored the string representation of the integer from the `ID` column. With this commit the output schema will be `ROWKEY INT KEY, NAME STRING`.

Fixes: confluentinc#4092 This commit gets `GROUP BY` clauses working with primitive key types. BREAKING CHANGE: A `GROUP BY` on single expressions now changes the SQL type of `ROWKEY` in the output schema of the query to match the SQL type of the expression. For example, consider: ```sql CREATE STREAM INPUT (ROWKEY STRING KEY, ID INT) WITH (...); CREATE TABLE OUTPUT AS SELECT COUNT(*) AS COUNT FROM INPUT GROUP BY ID; ``` Previously, the above would have resulted in an output schema of `ROWKEY STRING KEY, COUNT BIGINT`, where `ROWKEY` would have stored the string representation of the integer from the `ID` column. With this commit the output schema will be `ROWKEY INT KEY COUNT BIGINT`. BREAKING CHANGE: Any`GROUP BY` expression that resolves to `NULL`, including because a UDF throws an exception, now results in the row being excluded from the result. Previously, as the key was a `STRING` a value of `"null"` could be used. With other primitive types this is not possible. As key columns must be non-null any exception is logged and the row is excluded.

* chore: group-by primitive key support Fixes: #4092 This commit gets `GROUP BY` clauses working with primitive key types. BREAKING CHANGE: A `GROUP BY` on single expressions now changes the SQL type of `ROWKEY` in the output schema of the query to match the SQL type of the expression. For example, consider: ```sql CREATE STREAM INPUT (ROWKEY STRING KEY, ID INT) WITH (...); CREATE TABLE OUTPUT AS SELECT COUNT(*) AS COUNT FROM INPUT GROUP BY ID; ``` Previously, the above would have resulted in an output schema of `ROWKEY STRING KEY, COUNT BIGINT`, where `ROWKEY` would have stored the string representation of the integer from the `ID` column. With this commit the output schema will be `ROWKEY INT KEY COUNT BIGINT`. BREAKING CHANGE: Any`GROUP BY` expression that resolves to `NULL`, including because a UDF throws an exception, now results in the row being excluded from the result. Previously, as the key was a `STRING` a value of `"null"` could be used. With other primitive types this is not possible. As key columns must be non-null any exception is logged and the row is excluded.

big-andy-coates mentioned this issue Dec 10, 2019

chore: partition-by primitive key support #4098

Merged

2 tasks

big-andy-coates added this to the 0.7.0 milestone Dec 10, 2019

big-andy-coates closed this as completed in #4098 Dec 10, 2019

big-andy-coates mentioned this issue Dec 10, 2019

chore: group-by primitive key support #4108

Merged

2 tasks

big-andy-coates self-assigned this Jan 6, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Primitive keys: Support `INT`, `BIGINT`, `DOUBLE` and `STRING` in `PARTITION BY` #4092

Primitive keys: Support `INT`, `BIGINT`, `DOUBLE` and `STRING` in `PARTITION BY` #4092

big-andy-coates commented Dec 10, 2019

Primitive keys: Support INT, BIGINT, DOUBLE and STRING in PARTITION BY #4092

Primitive keys: Support INT, BIGINT, DOUBLE and STRING in PARTITION BY #4092

Comments

big-andy-coates commented Dec 10, 2019

Primitive keys: Support `INT`, `BIGINT`, `DOUBLE` and `STRING` in `PARTITION BY` #4092

Primitive keys: Support `INT`, `BIGINT`, `DOUBLE` and `STRING` in `PARTITION BY` #4092