sql: ensure that Upsert will never update the same row twice #45372

andy-kimball · 2020-02-25T08:18:42Z

Previously, there were cases where an UPSERT or INSERT..ON CONFLICT
statement could operate on the same row more than once. This is a
problem because these statements are designed to read once to get
existing rows, and then to insert or update based on that information.
They don't expect that previous writes for the same statement will
affect the correctness of subsequent writes. This can lead to index
corruption in cases where a row "moves" from one location to another
when one of its index keys changes. See issue #44466 for an example.

This PR fixes the problem by introducing a new variation of the
DistinctOn operator that ensures that the input to the Upsert
operator never has duplicates. It differs from the regular DistinctOn
operator in two ways:

Null behavior: UpsertDistinctOn treats NULL values as not equal
to one another for purposes of grouping. Two rows having a NULL-
valued grouping column will be placed in different groups. This
differs from DistinctOn behavior, where the two rows would be
grouped together. This behavior difference reflects SQL semantics,
in which a unique index key still allows multiple NULL values.
Duplicate behavior: UpsertDistinctOn raises an error if any
distinct grouping contains more than one row. It has "input must
be distinct" semantics rather than "make the input distinct"
semantics. This is used to ensure that no row will be updated
more than once.

The optbuilder now wraps the input to the Upsert operator with this
new UpsertDistinctOn operator. In addition, there are several commits
that add optimization rules designed to remove these distinct operators
when they can be proven to not be necessary.

Fixes #44466

cockroach-teamcity · 2020-02-25T08:18:53Z

This change is

andy-kimball · 2020-02-25T08:19:51Z

@jordanlewis, not sure who on your team should review the SQL execution changes (part of the 1st commit). I added @yuzefovich as a guess.

rytaft

very nice!

Reviewed 39 of 39 files at r1, 2 of 2 files at r2, 9 of 9 files at r3, 9 of 9 files at r4.
Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @andy-kimball, @jordanlewis, @RaduBerinde, and @yuzefovich)

pkg/sql/opt/norm/groupby.go, line 23 at r3 (raw file):

// RemoveGroupingCols returns a new grouping private struct with the given
// columns removed from the window partition column set.

window partition -> grouping

pkg/sql/opt/norm/groupby.go, line 222 at r4 (raw file):

}

// MakeRowsDistinct tries to ensure that the given rows are unique with respect

MakeRowsDistinct -> areRowsDistinct

Also need to change the rest of the text in this comment

pkg/sql/opt/norm/groupby.go, line 247 at r4 (raw file):

		forceDistinct := false
		for iCol, colID := range cols {

[nit] Why not just use i instead of iCol?

pkg/sql/opt/norm/rules/groupby.opt, line 120 at r1 (raw file):

#
# Note that this rule does not apply to UpsertDistinctOn, since that will raise
# error if there are duplicate rows.

raise error -> raise an error

pkg/sql/opt/norm/testdata/rules/groupby, line 1347 at r4 (raw file):

           └── variable: column1 [type=int]

# DistinctOn treats NULL values as distinct, so it can't be eliminated.

Isn't it the opposite?

pkg/sql/opt/norm/testdata/rules/groupby, line 1362 at r4 (raw file):

      └── (NULL,) [type=tuple{unknown}]

# UpsertDistinctOn treats NULL values as not distinct, so it can be eliminated.

Isn't it the opposite?

pkg/sql/opt/optbuilder/insert.go, line 139 at r1 (raw file):

//     CASE WHEN fetch_a IS NULL ins_c ELSE fetch_c END AS ups_c,
//   FROM (
//     SELECT DISTINCT ON (ins_a) *

would help to emphasize somewhere below that this is not a normal distinct on (or somehow make that clear with syntax here)

pkg/sql/rowexec/distinct.go, line 179 at r1 (raw file):

		// row.
		if d.nullsAreDistinct && d.lastGroupKey[colIdx].IsNull() {
			return false, err

return false, nil

andy-kimball

Reviewable status: complete! 0 of 0 LGTMs obtained (and 1 stale) (waiting on @jordanlewis, @RaduBerinde, @rytaft, and @yuzefovich)

pkg/sql/opt/norm/groupby.go, line 23 at r3 (raw file):