Query parsers to throw exception when multiple field names are provided #19791

javanna · 2016-08-03T20:26:44Z

Most of our queries support a single field name, the field that gets queried. That is the key of the object which contains the query options. For such queries, in case multiple fields are presented, in most of the cases the query goes through and either the last or the first field only will be read and queried.

This PR changes the behaviour of all those parsers to throw exception in case multiple field names are provided.

Closes #19547

javanna · 2016-08-03T20:27:58Z

core/src/main/java/org/elasticsearch/index/query/CommonTermsQueryBuilder.java

            }
        }

-        if (text == null) {


I removed these checks at the end of parsing as they are not needed. The query builder throws error anyways when required fields are missing as we validate its input and that is also what we test already (testIllegalArguments test method).

jpountz · 2016-08-05T06:34:05Z

I left some comments, this looks great!

tlrx · 2016-08-05T08:36:04Z

core/src/main/java/org/elasticsearch/index/query/MatchQueryBuilder.java

+                    throw new ParsingException(parser.getTokenLocation(), "[match] query doesn't support multiple fields, found ["
+                            + fieldName + "] and [" + currentFieldName + "]");
+                }
+                fieldName = currentFieldName;


Match query is tricky because it can be written as:

{ "match" : { "message1" : { "query" : "this is a test" } } }

but also:

{ "match" : { "message1" : "this is a test" } }

and in case of multiple fields we just catch the first form, not the second which is very used too.

many queries actually have a short syntax like the match one. I'm afraid what you bring up is a problem that's common to them all. I will dig.

Yes, I know, sorry for that :(

I suspect that this situation is already at least partially caught by making QueryParseContext#parseInnerQueryBuilder stricter as in checking what is the current token after the query gets parsed. I have ideas on how to test it too but I'd prefer to do this in a followup PR if you don' mind. We already test these short syntaxes but we never inject bogus objects in them like we do with the "standard" json output, we can totally introduce that for more coverage.

Sure.

We already test these short syntaxes but we never inject bogus objects in them like we do with the "standard" json output, we can totally introduce that for more coverage.

Since query builders now implements ToXContent I think we could introduce bogus object using a XContentBuilder that randomly duplicates fields. Just an idea.

tlrx · 2016-08-05T08:45:17Z

It's a nice change, thanks for doing that. I left comments concerning the queries that can be written differently like match* queries. I don't know if we can handle every format here but that would be great.

Range Query, like many other queries, used to parse when the query refers to multiple fields and the last one would win. We rather throw an exception now instead. Closes elastic#19547

Prefix Query, like many other queries, used to parse when the query refers to multiple fields and the last one would win. We rather throw an exception now instead. Also added tests for short prefix quer variant.

Regexp Query, like many other queries, used to parse even when the query referred to multiple fields and the last one would win. We rather throw an exception now instead. Also added test for short prefix query variant.

Wildcard Query, like many other queries, used to parse even when the query referred to multiple fields and the first one would win. We rather throw an exception now instead. Also added test for short prefix query variant and modified the parsing code to consume the whole query object.

Match phrase Query, like many other queries, used to parse even when the query referred to multiple fields and the first one would win. We rather throw an exception now instead. Also added test for short prefix query variant and modified the parsing code to consume the whole query object.

Geo distance Query, like many other queries, used to parse even when the query referred to multiple fields and the last one would win. We rather throw an exception now instead.

…elds Match phrase prefix Query, like many other queries, used to parse even when the query referred to multiple fields and the first one would win. We rather throw an exception now instead. Also added test for short prefix query variant and modified the parsing code to consume the whole query object.

Match Query, like many other queries, used to parse even when the query referred to multiple fields and the first one would win. We rather throw an exception now instead. Also added test for short prefix query variant and modified the parsing code to consume the whole query object.

Common Terms Query, like many other queries, used to parse even when the query referred to multiple fields and the first one would win. We rather throw an exception now instead. Also added test for short prefix query variant and modified the parsing code to consume the whole query object.

Span term Query, like many other queries, used to parse even when the query referred to multiple fields and the first one would win. We rather throw an exception now instead. Also modified the parsing code to consume the whole query object.

…ields

Fuzzy Query, like many other queries, used to parse even when the query referred to multiple fields and the first one would win. We rather throw an exception now instead. Also added test for short prefix query variant and modified the parsing code to consume the whole query object.

…BJECT token Instead of being lenient in QueryParseContext#parseInnerQueryBuilder we check that the token where the parser stopped reading was END_OBJECT, and throw error otherwise. This is a best effort to verify that the parsers read a whole object rather than stepping out in the middle of it due to malformed queries.

javanna added >enhancement review v5.0.0-beta1 labels Aug 3, 2016

javanna reviewed Aug 3, 2016
View reviewed changes

tlrx reviewed Aug 5, 2016
View reviewed changes

javanna added 17 commits August 5, 2016 10:58

Throw parsing error if range query contains multiple fields

11e4b01

Range Query, like many other queries, used to parse when the query refers to multiple fields and the last one would win. We rather throw an exception now instead. Closes elastic#19547

Throw parsing error if prefix query contains multiple fields

69c2dee

Prefix Query, like many other queries, used to parse when the query refers to multiple fields and the last one would win. We rather throw an exception now instead. Also added tests for short prefix quer variant.

Throw parsing error if regexp query contains multiple fields

003a7b6

Regexp Query, like many other queries, used to parse even when the query referred to multiple fields and the last one would win. We rather throw an exception now instead. Also added test for short prefix query variant.

[TEST] check validation error messages in IdsQueryBuilderTests

195320f

Throw parsing error if geo_distance query contains multiple fields

ad8f5e7

Geo distance Query, like many other queries, used to parse even when the query referred to multiple fields and the last one would win. We rather throw an exception now instead.

[TEST] check validation error messages in AbstractTermQueryTestCase

389bd06

[TEST] test that term query throws error when made against multiple f…

6d228bb

…ields

fix line length in FuzzyQueryBuilder

6a5c44a

[TEST] use expectThrows wherever possible in query builder unit tests

7f0bd56

javanna force-pushed the fix/multiple_fields_queries branch from a4c5df3 to 7f0bd56 Compare August 5, 2016 11:55

javanna merged commit 4c1a3b9 into elastic:master Aug 5, 2016

javanna mentioned this pull request Aug 8, 2016

Throw exception when multiple field names are provided as part of query short syntax #19871

Merged

javanna mentioned this pull request Sep 16, 2016

Throw error if query element doesn't end with END_OBJECT #20528

Merged

clintongormley added :Search/Search Search-related issues that do not fall into other categories and removed :Query DSL labels Feb 14, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Query parsers to throw exception when multiple field names are provided #19791

Query parsers to throw exception when multiple field names are provided #19791

javanna commented Aug 3, 2016

javanna Aug 3, 2016 •

edited

Loading

jpountz commented Aug 5, 2016

tlrx Aug 5, 2016 •

edited

Loading

javanna Aug 5, 2016

tlrx Aug 5, 2016

javanna Aug 5, 2016

tlrx Aug 5, 2016

tlrx commented Aug 5, 2016

Query parsers to throw exception when multiple field names are provided #19791

Query parsers to throw exception when multiple field names are provided #19791

Conversation

javanna commented Aug 3, 2016

javanna Aug 3, 2016 • edited Loading

Choose a reason for hiding this comment

jpountz commented Aug 5, 2016

tlrx Aug 5, 2016 • edited Loading

Choose a reason for hiding this comment

javanna Aug 5, 2016

Choose a reason for hiding this comment

tlrx Aug 5, 2016

Choose a reason for hiding this comment

javanna Aug 5, 2016

Choose a reason for hiding this comment

tlrx Aug 5, 2016

Choose a reason for hiding this comment

tlrx commented Aug 5, 2016

javanna Aug 3, 2016 •

edited

Loading

tlrx Aug 5, 2016 •

edited

Loading