RangeQuery without lower term and inclusive=false skips blank fields [LUCENE-38] #1116

asfimport · 2002-06-11T22:02:41Z

This was reported by "James Ricci" <james@riccinursery.com> at:
http://nagoya.apache.org/eyebrowse/ReadMsg?listName=lucene-user@jakarta.apache.org&msgNo=1835

When you create a ranged query and omit the lower term, my expectation
would be that I would find everything less than the upper term. Now if I pass
false for the inclusive term, then I would expect that I would find all
terms less than the upper term excluding the upper term itself.

What is happening in the case of lower_term=null, upper_term=x,
inclusive=false is that empty strings are being excluded because
inclusive is set false, and the implementation of RangedQuery creates a default
lower term of Term(fieldName, ""). Since it's not inclusive, it excludes "".
This isn't what I intended, and I don't think it's what most people would
imagine RangedQuery would do in the case I've mentioned.

I equate lower=null, upper=x, inclusive=false to Field < x. lower=null,
upper=x, inclusive=true would be Field <= x. In both cases, the only
difference should be whether or not Field = x is true for the query.

Migrated from LUCENE-38 by Otis Gospodnetic, resolved Nov 13 2008
Environment:

Operating System: other
Platform: Other

Attachments: LUCENE-38.patch, TestRangeQuery.patch

The text was updated successfully, but these errors were encountered:

asfimport · 2006-08-28T07:11:42Z

Dejan Nenov (migrated from JIRA)

Added additional tests, using "null" as the lower term in the range query. The tests are commented to indicate how they should be modified to behave once this LUCENE-38 is fixed.

asfimport · 2008-11-12T23:45:58Z

Mark Miller (@markrmiller) (migrated from JIRA)

Does this need to be 'fixed' ? RangeQuery now uses the semantics from ConstantScoreRangeQuery, which decided that open ended sides of a range must be inclusive (and are converted as such if not). Is that acceptable and we close this bug? Or jump a hoop or two for this rather niche case?

asfimport · 2008-11-13T04:32:19Z

Otis Gospodnetic (@otisg) (migrated from JIRA)

This thing is 6+ years old and I don't recall this being mentioned on the list in the last half a decade. I'll leave you the Won't Fix pleasure, Mark.

asfimport · 2008-11-13T10:06:30Z

Michael McCandless (@mikemccand) (migrated from JIRA)

Actually, this should have already worked, because RangeTermEnum forces includeLower to be true when lowerTermText is null.

But indeed the test still fails, so I dug into a bit and I think the test is faulty. The test expects the empty string doc ("") to be returned as a result, but the problem is the empty string doc when analyzed does not produce an empty string Token. So I modified the test (attached) to use an analyzer that emits empty string token, and then the test passes as expected.

I'll commit shortly.

asfimport · 2008-11-13T10:08:21Z

Michael McCandless (@mikemccand) (migrated from JIRA)

Committed revision 713696.

asfimport closed this as completed Nov 13, 2008

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RangeQuery without lower term and inclusive=false skips blank fields [LUCENE-38] #1116

RangeQuery without lower term and inclusive=false skips blank fields [LUCENE-38] #1116

asfimport commented Jun 11, 2002

asfimport commented Aug 28, 2006

asfimport commented Nov 12, 2008

asfimport commented Nov 13, 2008

asfimport commented Nov 13, 2008

asfimport commented Nov 13, 2008

RangeQuery without lower term and inclusive=false skips blank fields [LUCENE-38] #1116

RangeQuery without lower term and inclusive=false skips blank fields [LUCENE-38] #1116

Comments

asfimport commented Jun 11, 2002

asfimport commented Aug 28, 2006

asfimport commented Nov 12, 2008

asfimport commented Nov 13, 2008

asfimport commented Nov 13, 2008

asfimport commented Nov 13, 2008