release-21.1: opt: calculate more accurate selectivity for disjunctions #67730

mgartner · 2021-07-17T00:00:04Z

Backport 2/2 commits from #64886.

/cc @cockroachdb/release

I've changed the original commits, so please be mindful of these differences
while reviewing:

There is now a cluster setting which gates this feature.
The tests were moved to execbuilder tests so that the cluster
setting could be changed; optimizer tests do not currently allow
for changing cluster settings.

Release justification: This is needed to prevent the optimizer from choosing
query plans with full table scans for a customer's workload. It improves the
selectivity estimations for query filters with disjunctions. The change is gated
behind a cluster setting which is disabled by default to prevent other customers'
query plans from changing if they upgrade to a new patch release.

opt: refactor statisticsBuilder.applyFilter

This commit refactors statisticsBuilder.applyFilter by renaming it to
applyFilters and moving the logic in the applyConjunction closure to
a separate applyFiltersItem function.

Release note: None

opt: calculate more accurate selectivity for disjunctions

The optimal plan for a query with a disjunctive filter is often a
DistinctOn+UnionAll of the results of two index scans, when two separate
indexes can satisfy either side of the disjunction. For example:

SELECT * FROM t WHERE a = 'foo' OR b = 'foo' LIMIT 5

limit
 ├── distinct-on
 │    └── union-all
 │         ├── scan t@a_idx
 │         │    └── constraint: /a: [/'foo' - /'foo']
 │         └── scan t@a_idx
 │              └── constraint: /b: [/'foo' - /'foo']
 └── 5

However, this optimal plan is not always chosen. A constraint.Set
cannot be built for a disjunction involving multiple columns, so the
statistics builder assumes the selectivity of the disjunction to be 1/3,
often resulting in an over-estimated row count. When the DistinctOn is
built during exploration it is added to the same memo group as the
Select that it replaces, so it shares the same row count estimate. Even
though the UnionAll's cost is low because it produces a small subset of
the table, the DistinctOn's cost is high because the coster is under the
assumption that the DistinctOn will produce 1/3 of the rows in the
table. The overhead of producing so many rows adds significant overhead
to the overall cost, preventing this plan from being chosen by the
optimizer.

This commit fixes the issue by attempting to build a constraint set for
each side of a disjunction in a filter. By unioning the selectivity of
each constraint set, a more accurate row count estimate is calculated
for the filter. As a result, the cost of the DistinctOn is more accurate
and the optimal plan is chosen.

This fix is only enabled if the cluster setting is enabled:
sql.optimizer_improve_disjunction_selectivity.enabled.

Informs #58744

Release note (performance improvement): A new cluster setting
sql.optimizer_improve_disjunction_selectivity.enabled enables
more accurate selectivity estimation of query filters with OR
expressions. This improves query plans in some cases. The cluster
setting is disabled by default.

This commit refactors `statisticsBuilder.applyFilter` by renaming it to `applyFilters` and moving the logic in the `applyConjunction` closure to a separate `applyFiltersItem` function. Release note: None

blathers-crl · 2021-07-17T00:00:07Z

Thanks for opening a backport.

Please check the backport criteria before merging:

Patches should only be created for serious issues.
Patches should not break backwards-compatibility.
Patches should change as little code as possible.
Patches should not change on-disk formats or node communication protocols.
Patches should not add new functionality.

If some of the basic criteria cannot be satisfied, ensure that the exceptional criteria are satisfied within.

There is a high priority need for the functionality that cannot wait until the next release and is difficult to address in another way.
The new functionality is additive-only and only runs for clusters which have specifically “opted in” to it (e.g. by a cluster setting).
New code is protected by a conditional check that is trivial to verify and ensures that it only runs for opt-in clusters.
The PM and TL on the team that owns the changed code have signed off that the change obeys the above rules.

Add a brief release justification to the body of your PR to justify this backport.

Some other things to consider:

What did we do to ensure that a user that doesn’t know & care about this backport, has no idea that it happened?
Will this work in a cluster of mixed patch versions? Did we test that?
If a user upgrades a patch version, uses this feature, and then downgrades, what happens?

cockroach-teamcity · 2021-07-17T00:00:14Z

This change is

rytaft

Reviewed 1 of 1 files at r1, 5 of 5 files at r2.
Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @mgartner)

pkg/sql/opt/memo/statistics_builder.go, line 34 at r2 (raw file):

// efficient query plans in some cases.
var improveDisjunctionSelectivityEnabled = settings.RegisterBoolSetting(
	"sql.defaults.optimizer_improve_disjunction_selectivity.enabled",

I think that typically when we use sql.defaults.... there is a corresponding session variable. Maybe that's worth considering here as well?

If you don't want to add a session setting (maybe it requires too many code changes?) I think you should probably use a different name. I was looking around for some naming guidelines but all I could find was this: https://github.com/cockroachdb/cockroach/blob/master/docs/RFCS/20170317_settings_table.md, which doesn't explicitly include that rule about sql.defaults.

mgartner

Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @rytaft)

pkg/sql/opt/memo/statistics_builder.go, line 34 at r2 (raw file):

Previously, rytaft (Rebecca Taft) wrote…

I think that typically when we use sql.defaults.... there is a corresponding session variable. Maybe that's worth considering here as well?

If you don't want to add a session setting (maybe it requires too many code changes?) I think you should probably use a different name. I was looking around for some naming guidelines but all I could find was this: https://github.com/cockroachdb/cockroach/blob/master/docs/RFCS/20170317_settings_table.md, which doesn't explicitly include that rule about sql.defaults.

My reasoning was to not add anything that's not absolutely necessary, so I didn't add a session variable.

I see what you mean about the naming. defaults implies that it is a cluster setting to set the default value of a session setting. So maybe sql.optimizer_improve_disjunction_selectivity.enabled would be better?

The optimal plan for a query with a disjunctive filter is often a DistinctOn+UnionAll of the results of two index scans, when two separate indexes can satisfy either side of the disjunction. For example: SELECT * FROM t WHERE a = 'foo' OR b = 'foo' LIMIT 5 limit ├── distinct-on │ └── union-all │ ├── scan t@a_idx │ │ └── constraint: /a: [/'foo' - /'foo'] │ └── scan t@a_idx │ └── constraint: /b: [/'foo' - /'foo'] └── 5 However, this optimal plan is not always chosen. A `constraint.Set` cannot be built for a disjunction involving multiple columns, so the statistics builder assumes the selectivity of the disjunction to be 1/3, often resulting in an over-estimated row count. When the DistinctOn is built during exploration it is added to the same memo group as the Select that it replaces, so it shares the same row count estimate. Even though the UnionAll's cost is low because it produces a small subset of the table, the DistinctOn's cost is high because the coster is under the assumption that the DistinctOn will produce 1/3 of the rows in the table. The overhead of producing so many rows adds significant overhead to the overall cost, preventing this plan from being chosen by the optimizer. This commit fixes the issue by attempting to build a constraint set for each side of a disjunction in a filter. By unioning the selectivity of each constraint set, a more accurate row count estimate is calculated for the filter. As a result, the cost of the DistinctOn is more accurate and the optimal plan is chosen. This fix is only enabled if the cluster setting is enabled: `sql.defaults.optimizer_improve_disjunction_selectivity.enabled`. Informs cockroachdb#58744 Release note (performance improvement): A new cluster setting `sql.defaults.optimizer_improve_disjunction_selectivity.enabled` enables more accurate selectivity estimation of query filters with OR expressions. This improves query plans in some cases. The cluster setting is disabled by default.

rytaft

Reviewed 2 of 2 files at r3.
Reviewable status: complete! 0 of 0 LGTMs obtained (and 1 stale) (waiting on @mgartner)

pkg/sql/opt/memo/statistics_builder.go, line 34 at r2 (raw file):

Previously, mgartner (Marcus Gartner) wrote…

My reasoning was to not add anything that's not absolutely necessary, so I didn't add a session variable.

I see what you mean about the naming. defaults implies that it is a cluster setting to set the default value of a session setting. So maybe sql.optimizer_improve_disjunction_selectivity.enabled would be better?

That seems fine to me. Thanks!

mgartner · 2021-07-19T20:12:15Z

Merging despite the failing blathers check - the requirements are met but the check is erroneously failing.

RaduBerinde · 2021-07-20T19:35:53Z

Note that using a cluster setting like this doesn't play nice with memo caching. All other knobs are stored in Memo so IsStale can check them. This could mean that, depending on what memos are cached, the cluster setting might not take effect for a particular query (until node restart).

mgartner · 2021-07-21T00:37:55Z

Note that using a cluster setting like this doesn't play nice with memo caching. All other knobs are stored in Memo so IsStale can check them. This could mean that, depending on what memos are cached, the cluster setting might not take effect for a particular query (until node restart).

Welp... I guess I'll have to adjust this to be a session setting that is plumbed into a memo field. Thanks for pointing this out!

Previously, only a cluster setting could enable improved optimizer calculations for the selectivity of filters with disjunctions. This prevented the optimizer from detecting stale memos in the cache. See cockroachdb#67730 (comment). This commit adds the `optimizer_improve_disjunction_selectivity` session setting to fix this. The cluster setting, which is now called `sql.defaults.optimizer_improve_disjunction_selectivity.enable` controls the default value of the session setting. Release note: None

opt: refactor statisticsBuilder.applyFilter

b7abd20

This commit refactors `statisticsBuilder.applyFilter` by renaming it to `applyFilters` and moving the logic in the `applyConjunction` closure to a separate `applyFiltersItem` function. Release note: None

mgartner requested review from rytaft and a team July 17, 2021 00:00

mgartner force-pushed the backport21.1-64886 branch from 887dc5b to d2b1d9b Compare July 17, 2021 00:00

rytaft approved these changes Jul 19, 2021

View reviewed changes

mgartner commented Jul 19, 2021

View reviewed changes

mgartner force-pushed the backport21.1-64886 branch from d2b1d9b to 54e2c6e Compare July 19, 2021 16:12

rytaft approved these changes Jul 19, 2021

View reviewed changes

mgartner merged commit 3f198fa into cockroachdb:release-21.1 Jul 19, 2021

mgartner deleted the backport21.1-64886 branch July 19, 2021 20:12

RaduBerinde mentioned this pull request Jul 20, 2021

sql: gate setting IntervalStyle with a setting #67792

Merged

mgartner mentioned this pull request Jul 21, 2021

release-21.1: sql: make a session setting for improving disjunction selectivity #67836

Merged

mikeCRL mentioned this pull request Aug 31, 2021

opt: calculate more accurate selectivity for disjunctions cockroachdb/docs#11189

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

release-21.1: opt: calculate more accurate selectivity for disjunctions #67730

release-21.1: opt: calculate more accurate selectivity for disjunctions #67730

mgartner commented Jul 17, 2021 •

edited

blathers-crl bot commented Jul 17, 2021 •

edited by mgartner

cockroach-teamcity commented Jul 17, 2021

rytaft left a comment

mgartner left a comment

rytaft left a comment

mgartner commented Jul 19, 2021

RaduBerinde commented Jul 20, 2021

mgartner commented Jul 21, 2021

release-21.1: opt: calculate more accurate selectivity for disjunctions #67730

release-21.1: opt: calculate more accurate selectivity for disjunctions #67730

Conversation

mgartner commented Jul 17, 2021 • edited

opt: refactor statisticsBuilder.applyFilter

opt: calculate more accurate selectivity for disjunctions

blathers-crl bot commented Jul 17, 2021 • edited by mgartner

cockroach-teamcity commented Jul 17, 2021

rytaft left a comment

Choose a reason for hiding this comment

mgartner left a comment

Choose a reason for hiding this comment

rytaft left a comment

Choose a reason for hiding this comment

mgartner commented Jul 19, 2021

RaduBerinde commented Jul 20, 2021

mgartner commented Jul 21, 2021

mgartner commented Jul 17, 2021 •

edited

blathers-crl bot commented Jul 17, 2021 •

edited by mgartner