fix(influxql): don't expand large regex char sets #30

dgnorton · 2019-09-11T17:44:36Z

InfluxQL optimizes some queries by expanding regular expressions in the
WHERE clause to literal expressions. This causes a problem when the
expression expands to a large number of literals. This change caps it at
100 literals. If the expression would expand to more, it is not
optimized at all (i.e., no partial optimization).

jsternberg

I think the max should be lower, but the code is solid so whichever number it is this can be approved.

jsternberg · 2019-09-11T17:46:40Z

ast.go

+	// this is exceeded, no expansion will be done. This allows reasonable
+	// optimizations of regex by expansion to literals but prevents cases
+	// where that expansion would result in a large number of literals.
+	const maxLiterals = 1000


I think 100 is a better value. 1000 is likely too high.

InfluxQL optimizes some queries by expanding regular expressions in the WHERE clause to literal expressions. This causes a problem when the expression expands to a large number of literals. This change caps it at 100 literals. If the expression would expand to more, it is not optimized at all (i.e., no partial optimization).

e-dard

This is right approach, but we need to change a couple of things here:

we can't introduce a change like this that could break existing users' queries without a form of control. We need to propogate maxLiterals all the way back to a configuration file option in InfluxDB. The default of 100 seems reasonable.
when I tested this I just got a normal info message saying the query executed, but no error or indication the query failed. We need to be able to inform the user that their query failed, if indeed it did.

Further, when the PR to InfluxDB is made, we need to add then config option to the demo config with some description of what it does. We also need to let the docs team know about this change.

dgnorton · 2019-09-12T11:38:07Z

@e-dard

This change shouldn't break any existing queries because it is just limiting a query optimization. E.g., take a query with regex that would expanded to 100,000 literal comparisons. Prior to this change, the expansion would have happened and the rewritten query might have executed on a machine with sufficient resources. After this change, the query will not be rewritten and it will execute correctly on any machine.
No error occurs. It's just a matter of whether a query was optimized or not.

e-dard · 2019-09-12T11:55:08Z

@dgnorton ah OK I misunderstood the function's purpose. This makes a lot of sense! We should still document in the InfluxDB release notes (top of the changelog) that the optimisation has now been limited to XYZ expressions.

dgnorton requested review from jsternberg and e-dard September 11, 2019 17:44

jsternberg approved these changes Sep 11, 2019

View reviewed changes

dgnorton force-pushed the dn-fix-regex-stack-overflow branch from be022f0 to 3af2fe0 Compare September 11, 2019 17:51

e-dard suggested changes Sep 12, 2019

View reviewed changes

e-dard approved these changes Sep 12, 2019

View reviewed changes

dgnorton merged commit 57f403b into master Sep 12, 2019

dgnorton deleted the dn-fix-regex-stack-overflow branch September 12, 2019 15:35

williamhbaker mentioned this pull request Sep 17, 2021

fix: forward-port changes to tag values/key for better predicate handling influxdata/influxdb#22500

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(influxql): don't expand large regex char sets #30

fix(influxql): don't expand large regex char sets #30

dgnorton commented Sep 11, 2019 •

edited

jsternberg left a comment

jsternberg Sep 11, 2019

dgnorton Sep 11, 2019

e-dard left a comment

dgnorton commented Sep 12, 2019

e-dard commented Sep 12, 2019

fix(influxql): don't expand large regex char sets #30

fix(influxql): don't expand large regex char sets #30

Conversation

dgnorton commented Sep 11, 2019 • edited

jsternberg left a comment

Choose a reason for hiding this comment

jsternberg Sep 11, 2019

Choose a reason for hiding this comment

dgnorton Sep 11, 2019

Choose a reason for hiding this comment

e-dard left a comment

Choose a reason for hiding this comment

dgnorton commented Sep 12, 2019

e-dard commented Sep 12, 2019

dgnorton commented Sep 11, 2019 •

edited