rowexec: joinReader optimizations for index joins #52952

sumeerbhola · 2020-08-18T00:31:34Z

These small optimizations are for index joins with large
numbers of rows, which are common with geospatial queries.

No longer uses the keyToInputRowIndices map which
was consuming about 1.5% in cpu profiles
Sorts the spans to use optimizations at the storage layer,
like sstable: optimize seeks to use next pebble#860
Avoids constructing the partial key string since it is
not used

Release note: None

cockroach-teamcity · 2020-08-18T00:31:41Z

This change is

sumeerbhola

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @asubiotto, @helenmhe, and @yuzefovich)

pkg/sql/rowexec/joinreader.go, line 460 at r1 (raw file):

	// - For indexJoinReaderType this allows lower layers to optimize iteration
	//   over the data. It's safe to sort since the looked up rows are output
	//   unchanged.

this sort for index joins is causing test failures -- there must be a gap in my understanding on why we can't sort.

helenmhe-zz

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @asubiotto, @helenmhe, @sumeerbhola, and @yuzefovich)

pkg/sql/rowexec/joinreader.go, line 460 at r1 (raw file):

Previously, sumeerbhola wrote…

this sort for index joins is causing test failures -- there must be a gap in my understanding on why we can't sort.

When I refactored index joins to use the joinreader there wasn't any logic in the old indexjoiner.go for sorting spans, so I didn't add any new logic in and based the index join strategy off of joinReaderNoOrderingStrategy. Sorting isn't safe in the index join case because the order of the output is changed and then not restored during output collection. We might need to do something similar to joinReaderOrderingStrategy to make sorting the spans correct.

sumeerbhola

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @asubiotto, @helenmhe, and @yuzefovich)

pkg/sql/rowexec/joinreader.go, line 460 at r1 (raw file):

Sorting isn't safe in the index join case because the order of the output is changed and then not restored during output collection.

Do you know why restoring is important given that we simply emit the looked up row? I was imagining we had one of two cases:

the input would already be fully sorted by the key, so this sort of the spans in a batch would be a noop.
the input was not sorted, so the next processor would not be expecting any ordering invariant, so sorting within a batch would be harmless.
What am I missing?

helenmhe-zz

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @asubiotto, @sumeerbhola, and @yuzefovich)

pkg/sql/rowexec/joinreader.go, line 460 at r1 (raw file):

Previously, sumeerbhola wrote…

Sorting isn't safe in the index join case because the order of the output is changed and then not restored during output collection.

Do you know why restoring is important given that we simply emit the looked up row? I was imagining we had one of two cases:

the input would already be fully sorted by the key, so this sort of the spans in a batch would be a noop.

the input was not sorted, so the next processor would not be expecting any ordering invariant, so sorting within a batch would be harmless.
What am I missing?

For me the tests that were failing were like TestSchemaChangeAfterCreateInTxn where the test checks for a certain order but the query doesn't actually have an ORDER BY - when I asked it was suggested this was more of an issue with the test, it's possible sorting within the batch is still harmless I guess?

helenmhe-zz · 2020-08-18T20:36:54Z

pkg/sql/rowexec/joinreader.go, line 460 at r1 (raw file):

Previously, helenmhe (Helen He) wrote…

For me the tests that were failing were like TestSchemaChangeAfterCreateInTxn where the test checks for a certain order but the query doesn't actually have an ORDER BY - when I asked it was suggested this was more of an issue with the test, it's possible sorting within the batch is still harmless I guess?

Hm never mind I took a look at the CI and saw the failing tests with ORDER BYs

asubiotto

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @asubiotto, @sumeerbhola, and @yuzefovich)

pkg/sql/rowexec/joinreader.go, line 460 at r1 (raw file):

Previously, helenmhe (Helen He) wrote…

Hm never mind I took a look at the CI and saw the failing tests with ORDER BYs

What's the MaintainOrdering parameter on the spec for these test failures? It's possible that the lookup rows are sorted by secondary index and this is the order the tests expect to be maintained. This call to sort.Sort will probably reorder spans by primary index, which might be a different order.

sumeerbhola

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @asubiotto, @helenmhe, and @yuzefovich)

pkg/sql/rowexec/joinreader.go, line 460 at r1 (raw file):

Previously, asubiotto (Alfonso Subiotto Marqués) wrote…

What's the MaintainOrdering parameter on the spec for these test failures? It's possible that the lookup rows are sorted by secondary index and this is the order the tests expect to be maintained. This call to sort.Sort will probably reorder spans by primary index, which might be a different order.

Yes, it looks like the failing ones are trying to maintain secondary index order.

I am trying to confirm that the cases I care about are not producing plans that MaintainOrdering, but the info is not printed in the EXPLAIN output, so I added some code here

cockroach/pkg/sql/walk.go

Line 220 in c362707

v.observer.attr(name, "key columns", strings.Join(cols, ", "))

with a new "ordering" field name (this was prior to Radu's cleanup from yesterday of the old EXPLAIN code), but I don't see that information in the EXPLAIN output. Where is the code that I should be changing instead?

sumeerbhola

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @asubiotto, @helenmhe, and @yuzefovich)

pkg/sql/opt/exec/execbuilder/testdata/ddl, line 208 at r2 (raw file):

SELECT message FROM [SHOW KV TRACE FOR SESSION] WITH ORDINALITY
 WHERE message LIKE 'fetched:%' OR message LIKE 'output row%'
 ORDER BY message LIKE 'fetched:%' DESC, ordinality ASC

I want to confirm that changing the expected output is ok here. I don't really understand what ordering this is asking -- query T suggests there is only 1 column, but this mentions ordinality which doesn't seem to contribute to the output. Does the DESC of message apply to the whole column when it matches fetched:%? If so, the reordering below seems wrong.

All the logictests are passing and the CI test failures seem unrelated.

pkg/sql/rowexec/joinreader.go, line 460 at r1 (raw file):

Previously, sumeerbhola wrote…

Yes, it looks like the failing ones are trying to maintain secondary index order.

I am trying to confirm that the cases I care about are not producing plans that MaintainOrdering, but the info is not printed in the EXPLAIN output, so I added some code here

cockroach/pkg/sql/walk.go

Line 220 in c362707

v.observer.attr(name, "key columns", strings.Join(cols, ", "))

with a new "ordering" field name (this was prior to Radu's cleanup from yesterday of the old EXPLAIN code), but I don't see that information in the EXPLAIN output. Where is the code that I should be changing instead?

I've fixed this -- it needed a change to pass on the maintainOrdering for the index join case.

asubiotto

minus test

Reviewed 1 of 2 files at r1, 5 of 5 files at r2.
Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @helenmhe, @rytaft, and @yuzefovich)

pkg/sql/opt/exec/execbuilder/testdata/ddl, line 208 at r2 (raw file):

Previously, sumeerbhola wrote…

I want to confirm that changing the expected output is ok here. I don't really understand what ordering this is asking -- query T suggests there is only 1 column, but this mentions ordinality which doesn't seem to contribute to the output. Does the DESC of message apply to the whole column when it matches fetched:%? If so, the reordering below seems wrong.

All the logictests are passing and the CI test failures seem unrelated.

cc @rytaft on the expectations

rytaft · 2020-08-21T19:55:33Z

pkg/sql/opt/exec/execbuilder/testdata/ddl, line 208 at r2 (raw file):

Previously, asubiotto (Alfonso Subiotto Marqués) wrote…

cc @rytaft on the expectations

Looks fine to me... I think the order by clause is just ordering on the boolean result of the LIKE expression, ensuring that the output row results come after the fetched results

sumeerbhola

Reviewable status: complete! 0 of 0 LGTMs obtained (and 1 stale) (waiting on @asubiotto, @helenmhe, @rytaft, and @yuzefovich)

pkg/sql/opt/exec/execbuilder/testdata/ddl, line 208 at r2 (raw file):

Previously, rytaft (Rebecca Taft) wrote…

Looks fine to me... I think the order by clause is just ordering on the boolean result of the LIKE expression, ensuring that the output row results come after the fetched results

I see. Looking at this again, these are the rows from the preceding SELECT * FROM t@b_desc, so it is explainable by the sort in the index join. If I change it to SELECT * FROM t@b_desc ORDER BY b DESC the index join doesn't do the sort, as expected.

sumeerbhola · 2020-08-21T21:10:18Z

TFTRs!

These small optimizations are for index joins with large numbers of rows, which are common with geospatial queries. - No longer uses the keyToInputRowIndices map which was consuming about 1.5% in cpu profiles - Sorts the spans to use optimizations at the storage layer, like cockroachdb/pebble#860 - Avoids constructing the partial key string since it is not used Release note: None

sumeerbhola · 2020-08-25T20:10:00Z

bors r+

craig · 2020-08-25T21:48:47Z

Build succeeded:

GitHub CI (Cockroach)

sumeerbhola requested review from yuzefovich, asubiotto and a team August 18, 2020 00:31

asubiotto requested a review from helenmhe-zz August 18, 2020 08:30

sumeerbhola commented Aug 18, 2020

View reviewed changes

helenmhe-zz reviewed Aug 18, 2020

View reviewed changes

sumeerbhola commented Aug 18, 2020

View reviewed changes

helenmhe-zz reviewed Aug 18, 2020

View reviewed changes

asubiotto reviewed Aug 19, 2020

View reviewed changes

sumeerbhola commented Aug 19, 2020

View reviewed changes

sumeerbhola force-pushed the indexjoin_opt branch from 48b2983 to bc79176 Compare August 21, 2020 12:12

sumeerbhola requested a review from a team as a code owner August 21, 2020 12:12

sumeerbhola commented Aug 21, 2020

View reviewed changes

asubiotto approved these changes Aug 21, 2020

View reviewed changes

sumeerbhola force-pushed the indexjoin_opt branch from bc79176 to 2b664b3 Compare August 21, 2020 21:03

sumeerbhola commented Aug 21, 2020

View reviewed changes

sumeerbhola force-pushed the indexjoin_opt branch from 2b664b3 to 1f14d1a Compare August 25, 2020 12:36

sumeerbhola force-pushed the indexjoin_opt branch from 1f14d1a to 511d98d Compare August 25, 2020 18:19

craig bot merged commit 62214d5 into cockroachdb:master Aug 25, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rowexec: joinReader optimizations for index joins #52952

rowexec: joinReader optimizations for index joins #52952

sumeerbhola commented Aug 18, 2020

cockroach-teamcity commented Aug 18, 2020

sumeerbhola left a comment

helenmhe-zz left a comment

sumeerbhola left a comment

helenmhe-zz left a comment

helenmhe-zz commented Aug 18, 2020

asubiotto left a comment

sumeerbhola left a comment

sumeerbhola left a comment

asubiotto left a comment

rytaft commented Aug 21, 2020

sumeerbhola left a comment

sumeerbhola commented Aug 21, 2020

sumeerbhola commented Aug 25, 2020

craig bot commented Aug 25, 2020

rowexec: joinReader optimizations for index joins #52952

rowexec: joinReader optimizations for index joins #52952

Conversation

sumeerbhola commented Aug 18, 2020

cockroach-teamcity commented Aug 18, 2020

sumeerbhola left a comment

Choose a reason for hiding this comment

helenmhe-zz left a comment

Choose a reason for hiding this comment

sumeerbhola left a comment

Choose a reason for hiding this comment

helenmhe-zz left a comment

Choose a reason for hiding this comment

helenmhe-zz commented Aug 18, 2020

asubiotto left a comment

Choose a reason for hiding this comment

sumeerbhola left a comment

Choose a reason for hiding this comment

sumeerbhola left a comment

Choose a reason for hiding this comment

asubiotto left a comment

Choose a reason for hiding this comment

rytaft commented Aug 21, 2020

sumeerbhola left a comment

Choose a reason for hiding this comment

sumeerbhola commented Aug 21, 2020

sumeerbhola commented Aug 25, 2020

craig bot commented Aug 25, 2020