sql: relax restrictions on partition name reuse #39332

solongordon · 2019-08-05T18:36:22Z

Previously, we did not allow users to reuse partition names between
different indexes on the same table. The reason for this was that it
caused some ambiguity with the zone specifiers passed to the cockroach zone CLI. However, that CLI has now been removed in favor of more
explicit SQL statements, so this is no longer an issue. I removed this
restriction, so partition names now only need to be unique per index,
not per table.

My original intent was to do away with the CLI zone specifiers (also
known as zone names) entirely, since they are no longer specified by
users in any command. However, there are a few places that rely on
having a string identifier for each zone. One is the event log, which
logs an event whenever a zone configuration is set or removed. Another
is the 'SHOW ZONE CONFIGURATIONS' command, which includes a zone_name
column for each row. (We could remove this but I think readability would
suffer.) The Admin UI uses zone name as well in its proto interface,
though I'm not sure if it's actually displayed anywhere in the UI.

So rather than removing zone names, I decided to keep them around only
as a display identifier. I removed any logic which relied on parsing a
zone name into a ZoneSpecifier. I also added explicit columns to the
crdb_internal.zones table for database_name, table_name, etc. to avoid
reliance on the zone_name for JOINs and such. I also removed any
references to "CLI specifiers."

The zone name for a partition on a secondary index takes the form
DATABASE.TABLE@INDEX.PARTITION. This was previously a disallowed format
for CLI specifiers, which either specified an index or a partition.

Finally, please note that it is no longer possible to alter a secondary index
partition using the ALTER PARTITION ... OF TABLE command. Users must
use ALTER PARTITION ... OF INDEX instead.

Release note (sql change): Partition names can now be reused between
different indexes on the same table.

cockroach-teamcity · 2019-08-05T18:36:28Z

This change is

rohany

Why does alter partition of table ... need to be removed?

Reviewed 11 of 19 files at r1.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @danhhz and @rohany)

solongordon · 2019-08-05T19:49:40Z

It wasn't removed entirely. It's just you can only use that command to alter partitions on the primary index now. Otherwise it's ambiguous: if you have a partition named p on multiple indexes and run ALTER PARTITION p OF TABLE, which one did you mean to alter?

rohany · 2019-08-05T20:22:36Z

That makes sense

rohany

This looks pretty good to me, most of the changes are in deleting stuff from old tests. Restructuring the zones table like this definitely makes it much more usable in other contexts.

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @danhhz)

danhhz

Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @danhhz and @solongordon)

pkg/sql/logictest/testdata/logic_test/crdb_internal, line 206 at r2 (raw file):

SELECT * FROM crdb_internal.zones WHERE false
----
zone_id  zone_name  range_name  database_name  table_name  index_name  partition_name  config_yaml  config_sql  config_protobuf

we implicitly test this table by using it in SHOW PARTITIONS, but I don't see anywhere that you had to update something testing it directly. should we?

solongordon

Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @danhhz)

pkg/sql/logictest/testdata/logic_test/crdb_internal, line 206 at r2 (raw file):

Previously, danhhz (Daniel Harrison) wrote…

we implicitly test this table by using it in SHOW PARTITIONS, but I don't see anywhere that you had to update something testing it directly. should we?

Yes, good point, I'll add a test.

Previously, we did not allow users to reuse partition names between different indexes on the same table. The reason for this was that it caused some ambiguity with the zone specifiers passed to the `cockroach zone` CLI. However, that CLI has now been removed in favor of more explicit SQL statements, so this is no longer an issue. I removed this restriction, so partition names now only need to be unique per index, not per table. My original intent was to do away with the CLI zone specifiers (also known as zone names) entirely, since they are no longer specified by users in any command. However, there are a few places that rely on having a string identifier for each zone. One is the event log, which logs an event whenever a zone configuration is set or removed. Another is the 'SHOW ZONE CONFIGURATIONS' command, which includes a `zone_name` column for each row. (We could remove this but I think readability would suffer.) The Admin UI uses zone name as well in its proto interface, though I'm not sure if it's actually displayed anywhere in the UI. So rather than removing zone names, I decided to keep them around only as a display identifier. I removed any logic which relied on parsing a zone name into a `ZoneSpecifier`. I also added explicit columns to the crdb_internal.zones table for database_name, table_name, etc. to avoid reliance on the zone_name for JOINs and such. I also removed any references to "CLI specifiers." The zone name for a partition on a secondary index takes the form DATABASE.TABLE@INDEX.PARTITION. This was previously a disallowed format for CLI specifiers, which either specified an index or a partition. Finally, please note that it is no longer possible to alter a secondary index partition using the `ALTER PARTITION ... OF TABLE` command. Users must use `ALTER PARTITION ... OF INDEX` instead. Release note (sql change): Partition names can now be reused between different indexes on the same table.

solongordon · 2019-08-07T15:35:27Z

TFTRs!

bors r+

39332: sql: relax restrictions on partition name reuse r=solongordon a=solongordon Previously, we did not allow users to reuse partition names between different indexes on the same table. The reason for this was that it caused some ambiguity with the zone specifiers passed to the `cockroach zone` CLI. However, that CLI has now been removed in favor of more explicit SQL statements, so this is no longer an issue. I removed this restriction, so partition names now only need to be unique per index, not per table. My original intent was to do away with the CLI zone specifiers (also known as zone names) entirely, since they are no longer specified by users in any command. However, there are a few places that rely on having a string identifier for each zone. One is the event log, which logs an event whenever a zone configuration is set or removed. Another is the 'SHOW ZONE CONFIGURATIONS' command, which includes a `zone_name` column for each row. (We could remove this but I think readability would suffer.) The Admin UI uses zone name as well in its proto interface, though I'm not sure if it's actually displayed anywhere in the UI. So rather than removing zone names, I decided to keep them around only as a display identifier. I removed any logic which relied on parsing a zone name into a `ZoneSpecifier`. I also added explicit columns to the crdb_internal.zones table for database_name, table_name, etc. to avoid reliance on the zone_name for JOINs and such. I also removed any references to "CLI specifiers." The zone name for a partition on a secondary index takes the form DATABASE.TABLE@INDEX.PARTITION. This was previously a disallowed format for CLI specifiers, which either specified an index or a partition. Finally, please note that it is no longer possible to alter a secondary index partition using the `ALTER PARTITION ... OF TABLE` command. Users must use `ALTER PARTITION ... OF INDEX` instead. Release note (sql change): Partition names can now be reused between different indexes on the same table. Co-authored-by: Solon Gordon <solon@cockroachlabs.com>

craig · 2019-08-07T16:02:50Z

Build succeeded

GitHub CI (Cockroach)

The commands for partitioning indexes in the TPCC import were erroring out due to a syntax change introduced in cockroachdb#39332. I updated them to use `ALTER PARTITION ... OF INDEX` rather than `ALTER PARTITION ... OF TABLE`. Fixes cockroachdb#39005 Fixes cockroachdb#40360 Fixes cockroachdb#40416 Release note: None

40248: opt: calculate number of rows processed when costing joins r=rytaft a=rytaft This PR updates the costing of joins to take into account the number of rows processed by the operator. This number may be larger than the number of output rows if an additional filter is applied as part of the ON condition that is not used to determine equality columns for the join. For example, consider the query `SELECT * FROM abc JOIN def ON a = e AND b = 3;` Assuming there is no index on b, if a lookup join is used to execute this query, the number of rows processed is actually the same as the query `SELECT * FROM abc JOIN def ON a = e;` The difference is that the filter b=3 must also be applied to every row in the first query. The coster now takes this into account when determining the cost of joins. Fixes #34810 Release note: None 40431: workload: fix partition commands in tpcc import r=solongordon a=solongordon The commands for partitioning indexes in the TPCC import were erroring out due to a syntax change introduced in #39332. I updated them to use `ALTER PARTITION ... OF INDEX` rather than `ALTER PARTITION ... OF TABLE`. Fixes #39005 Fixes #40360 Fixes #40416 Release note: None Co-authored-by: Rebecca Taft <becca@cockroachlabs.com> Co-authored-by: Solon Gordon <solon@cockroachlabs.com>

solongordon requested review from rohany, a team and danhhz August 5, 2019 18:54

rohany reviewed Aug 5, 2019

View reviewed changes

solongordon force-pushed the index-scoped-partition-names branch from 1ed34fe to 867bda0 Compare August 5, 2019 20:11

solongordon force-pushed the index-scoped-partition-names branch from 867bda0 to 7251c53 Compare August 5, 2019 20:39

rohany reviewed Aug 5, 2019

View reviewed changes

danhhz reviewed Aug 6, 2019

View reviewed changes

solongordon commented Aug 6, 2019

View reviewed changes

solongordon force-pushed the index-scoped-partition-names branch from 7251c53 to f17eb1c Compare August 7, 2019 14:32

craig bot merged commit f17eb1c into cockroachdb:master Aug 7, 2019

solongordon deleted the index-scoped-partition-names branch August 8, 2019 12:00

ericharmeling mentioned this pull request Aug 20, 2019

Update partition documentation to show support for partition name reuse cockroachdb/docs#5245

Closed

5 tasks

jseldess mentioned this pull request Aug 26, 2019

sql: relax restrictions on partition name reuse cockroachdb/docs#5289

Closed

solongordon mentioned this pull request Aug 26, 2019

sql: make partition names index-scoped, not table-scoped #20880

Closed

solongordon mentioned this pull request Sep 3, 2019

workload: fix partition commands in tpcc import #40431

Merged

solongordon mentioned this pull request Sep 11, 2019

sql: backward compat for ALTER PARTITION OF TABLE #40650

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sql: relax restrictions on partition name reuse #39332

sql: relax restrictions on partition name reuse #39332

solongordon commented Aug 5, 2019 •

edited

Loading

cockroach-teamcity commented Aug 5, 2019

rohany left a comment

solongordon commented Aug 5, 2019

rohany commented Aug 5, 2019

rohany left a comment

danhhz left a comment

solongordon left a comment

solongordon commented Aug 7, 2019

craig bot commented Aug 7, 2019

sql: relax restrictions on partition name reuse #39332

sql: relax restrictions on partition name reuse #39332

Conversation

solongordon commented Aug 5, 2019 • edited Loading

cockroach-teamcity commented Aug 5, 2019

rohany left a comment

Choose a reason for hiding this comment

solongordon commented Aug 5, 2019

rohany commented Aug 5, 2019

rohany left a comment

Choose a reason for hiding this comment

danhhz left a comment

Choose a reason for hiding this comment

solongordon left a comment

Choose a reason for hiding this comment

solongordon commented Aug 7, 2019

craig bot commented Aug 7, 2019

Build succeeded

solongordon commented Aug 5, 2019 •

edited

Loading