SKIPLIST index not used in AQL query #2116

thierry-github · 2016-10-16T21:18:03Z

my environment running ArangoDB

I'm using the latest ArangoDB of the respective release series:

3.0 (3.0.10)

On this operating system:

[x ] Windows, version: 10

I'm issuing AQL via:

[x ] web interface with this browser: CHROME running on this OS: WIN10

In the documentation, it's written :

A skiplist index will only be used if at least its first attribute is used in a FILTER condition
https://docs.arangodb.com/3.0/Manual/Indexing/IndexUtilization.html

i have on collection named "content" with the following index :

type : skiplist
unique : false
sparse : true
fields : [field_one , field_two]

i'have one record in the collection :
{
"field_one": "one",
"field_two": "two"
}

using this AQL query works correctly but doesn't involve index :
for record in content filter record.field_one == "one" return record
Associated explain result :

Query string:
for record in content filter record.field_one == "one" return record

Execution plan:
Id NodeType Est. Comment
1 SingletonNode 1 * ROOT
2 EnumerateCollectionNode 1 - FOR record IN content /* full collection scan /
3 CalculationNode 1 - LET #1 = (record.field_one == "one") / simple expression / / collections used: record : content */
4 FilterNode 1 - FILTER #1
5 ReturnNode 1 - RETURN record

Indexes used:
none

Optimization rules applied:
none

using this query makes use of index :
for record in content FILTER record.field_one == "one" AND record.field_two == "two" return record
Indexes used:
By Type Collection Unique Sparse Selectivity Fields Ranges
6 skiplist content false true n/a [ field_one, field_two ](%28record.field_one ==) && (record.field_two == "two"))

Am 'i missing somthing in index utilization ?

The text was updated successfully, but these errors were encountered:

DeShadow · 2016-10-16T22:40:42Z

@thierry-github You use sparse index.
The sparse index doesn't store documents with null values. That's why when you make second query for record in content FILTER record.field_one == "one" AND record.field_two == "two" return record
the optimiser knows that no documents with field_one == null or field_two == null will be in the result of query.

But when you make query for record in content filter record.field_one == "one" return record, optimiser knows that there can be record with field_two == null. But this record not in index, because this index is sparse. That's why full scan is required.

You can change your query to:

for record in content filter record.field_one == "one" && record._field_two != null return record

or

for record in content filter record.field_one == "one" && record._field_two > null return record

These two queries say to optimiser that there are no records with null values in result. And optimiser will use your index. :)

Or you can simply use non-sparse index, which indexes documents with null-values too.

thierry-github · 2016-10-16T22:50:26Z

just tried this
for record in content filter record.field_one == "one" AND record.field_two != null return record

but same result. Here is the explanation result

Query string:
for record in content filter record.field_one == "one" AND record.field_two != null return record

Execution plan:
Id NodeType Est. Comment
1 SingletonNode 1 * ROOT
2 EnumerateCollectionNode 1 - FOR record IN content /* full collection scan /
3 CalculationNode 1 - LET #1 = ((record.field_one == "one") && (record.field_two != null)) / simple expression / / collections used: record : content */
4 FilterNode 1 - FILTER #1
5 ReturnNode 1 - RETURN record

Indexes used:
none

Optimization rules applied:
none
but the second one works
for record in content filter record.field_one == "one" && record.field_two > null return record
Query string:
for record in content filter record.field_one == "one" && record.field_two > null return record

Execution plan:
Id NodeType Est. Comment
1 SingletonNode 1 * ROOT
6 IndexNode 1 - FOR record IN content /* skiplist index scan */
5 ReturnNode 1 - RETURN record

Indexes used:
By Type Collection Unique Sparse Selectivity Fields Ranges
6 skiplist content false true n/a [ field_one, field_two ](%28record.field_one ==) && (record.field_two > null))

Optimization rules applied:
Id RuleName
1 use-indexes
2 remove-filter-covered-by-index
3 remove-unnecessary-calculations-2

DeShadow · 2016-10-16T23:06:30Z

@thierry-github Optimiser doesn't works good with expression !=, that's why it's not use index in obvious case. :( ArangoDB knows about it and will improve it :)

thierry-github · 2016-10-16T23:28:58Z

@DeShadow As you said that ArangoDB knows about it, i close the issue. Thanks for your answers.

thierry-github closed this as completed Oct 16, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SKIPLIST index not used in AQL query #2116

SKIPLIST index not used in AQL query #2116

thierry-github commented Oct 16, 2016

DeShadow commented Oct 16, 2016 •

edited

thierry-github commented Oct 16, 2016 •

edited

DeShadow commented Oct 16, 2016

thierry-github commented Oct 16, 2016

SKIPLIST index not used in AQL query #2116

SKIPLIST index not used in AQL query #2116

Comments

thierry-github commented Oct 16, 2016

my environment running ArangoDB

DeShadow commented Oct 16, 2016 • edited

thierry-github commented Oct 16, 2016 • edited

DeShadow commented Oct 16, 2016

thierry-github commented Oct 16, 2016

DeShadow commented Oct 16, 2016 •

edited

thierry-github commented Oct 16, 2016 •

edited