Don't honor lowercase expanded terms in regex query generated from query string query #3922

mattweber · 2013-10-16T18:36:21Z

On the mailing list a user was using a query string query against an unanalyzed field that contained values such as:

5.2
5.3-SNAPSHOT

He wanted to return all documents that were not a snapshot version so they used a query string like -version:/.*SNAPSHOT/. This did not work. Turns out that lowercase_expanded_terms was turning the regex into -version:/.*snapshot/ which doesn't match the unanalyzed field due to case.

He was using Kibana which does not currently expose the lowercaseExpandedTerms option, so the user really had no way to fix this without changing the mapping and reindexing or patching kibana.

Should the lowercase_expanded_terms option even be honored in a regex query? I would think no.

I did not see a way to set a case insensitive flag on the lucene regex query and the (?i) flag from java regex did not work.

The text was updated successfully, but these errors were encountered:

clintongormley · 2013-10-16T18:51:04Z

Or at least we shouldn't lowercase expanded terms when we're querying a not_analyzed field

amitelad7 · 2014-03-26T09:33:38Z

+1 running into this issue as well

clintongormley · 2014-09-05T10:14:16Z

We should add a (default) third setting for the lowercase_expanded_terms of auto. That will only lowercase if the field is analyzed, but not if it is not analyzed.

NickMeves · 2015-06-09T18:45:24Z

+1

Was about to open a new issue then I stumbled across this. Running into this alot lately with both Kibana and Graylog.

Issue is with not_analyzed field using regex for us as well.

khadrin · 2015-07-20T19:58:39Z

👍

pickypg · 2015-08-19T23:20:20Z

I wonder why we even need this setting. The analysis process should take care of lowercasing, if it's desirable for the fields in question, which is not a query-global decision, and it should otherwise not touch the entry.

pickypg · 2015-08-19T23:21:20Z

Highly related to #9978

clintongormley · 2015-08-24T11:19:10Z

Closing in favour of #9978

aaijazi mentioned this issue Apr 24, 2014

Advanced option: lowercase_expanded_terms elastic/kibana#1181

Closed

clintongormley added the discuss label Aug 8, 2014

clintongormley added adoptme and removed discuss labels Sep 5, 2014

NickMeves mentioned this issue Jun 30, 2015

Intelligently add "lowercase_expanded_terms: false" to appropriate queries Graylog2/graylog2-server#1276

Closed

clintongormley closed this as completed Aug 24, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't honor lowercase expanded terms in regex query generated from query string query #3922

Don't honor lowercase expanded terms in regex query generated from query string query #3922

mattweber commented Oct 16, 2013

clintongormley commented Oct 16, 2013

amitelad7 commented Mar 26, 2014

clintongormley commented Sep 5, 2014

NickMeves commented Jun 9, 2015

khadrin commented Jul 20, 2015

pickypg commented Aug 19, 2015

pickypg commented Aug 19, 2015

clintongormley commented Aug 24, 2015

Don't honor lowercase expanded terms in regex query generated from query string query #3922

Don't honor lowercase expanded terms in regex query generated from query string query #3922

Comments

mattweber commented Oct 16, 2013

clintongormley commented Oct 16, 2013

amitelad7 commented Mar 26, 2014

clintongormley commented Sep 5, 2014

NickMeves commented Jun 9, 2015

khadrin commented Jul 20, 2015

pickypg commented Aug 19, 2015

pickypg commented Aug 19, 2015

clintongormley commented Aug 24, 2015