ES|QL: Fix wrong pruning of plans with no output columns #133405

luigidellaquila · 2025-08-22T16:30:17Z

Fixes: #120272

Fixing queries where all the columns from the original index are projected out.
The expected outcome is that the number of rows is preserved

This means that a query like

FROM idx
| LIMIT 3
| KEEP foo
| DROP foo             // drop all the columns
| EVAL bar = 1

that now returns nothing

bar
----

will start returning values:

bar
----
1
1
1

This will also apply to aggregations, eg.

FROM idx
| KEEP foo
| DROP foo             // drop all the columns
| EVAL bar = 1
| LIMIT 3
| STATS count = count(*)

that now returns 0, will start returning the actual count of the rows (3 in this case)

I don't know if it's breaking (IMHO it's a bug, but it still changes the behavior).

TODO

need to add all the queries from the original issue to CSV tests

elasticsearchmachine · 2025-08-22T16:31:15Z

Hi @luigidellaquila, I've created a changelog YAML for you.

…to esql/fix_no_columns

luigidellaquila · 2025-08-25T12:50:24Z

...k/plugin/esql/src/main/java/org/elasticsearch/xpack/esql/optimizer/LogicalPlanOptimizer.java

            new PruneRedundantSortClauses(),
-            new PruneLeftJoinOnNullMatchingField()
+            new PruneLeftJoinOnNullMatchingField(),
+            new PruneEmptyAggregates()


We need this because our aggs don't know how to deal with no grouping and no aggs at the same time.

luigidellaquila · 2025-08-25T12:50:55Z

x-pack/plugin/esql/src/main/java/org/elasticsearch/xpack/esql/session/FieldNameUtils.java

            // there cannot be an empty list of fields, we'll ask the simplest and lightest one instead: _index
            return new PreAnalysisResult(enrichResolution, IndexResolver.INDEX_METADATA_FIELD, wildcardJoinIndices);
        } else {
+            fieldNames.add(MetadataAttribute.INDEX);


This is the actual fix.

Here I think it would be more correct to add this field after FieldNameUtils::withSubfields is called on the next line. _index.* doesn't actually make sense, as we know there is only on _index metadata field.

elasticsearchmachine · 2025-08-25T15:07:03Z

Pinging @elastic/es-analytical-engine (Team:Analytics)

ivancea

LGTM!
However, I would wait for Andrei/Alex to check it, as the _index fix was initially discarded in favor of a more complete approach when the issue was opened.
This PR covers more areas than the initial approach though, and initially this wasn't a common issue

x-pack/plugin/esql/qa/testFixtures/src/main/resources/drop.csv-spec

...c/internalClusterTest/java/org/elasticsearch/xpack/esql/action/CrossClusterLookupJoinIT.java

…to esql/fix_no_columns

astefan

I think it looks good. Some comments:

from employees | keep salary | eval c = 1 | drop c, salary

returns

{
    "took": 29,
    "is_partial": false,
    "documents_found": 100,
    "values_loaded": 0,
    "columns": [],
    "values": [
        [],
        [],
        [],
        [],
        [],
        [],
        [],
        [],
        [],
		....

This, to me at least (as a regular user of ESQL), seems surprising. This represents a bunch of "empty" virtual rows so to speak. This seems like an implementation detail that leaks into UX, I would have expected the "no columns and Xrows" use case to be handled in a more user friendly way. This also saves some KBs of comm data if potentially this is performed on a large data set.

Conceptually speaking, we are introducing the notion of "nothing/no columns + X rows", but the impact of this concept in the UX is a bit too intrusive imho.

looking at the logs of this query

ProjectExec[[<all-fields-projected>{r$}#204]]
\_EvalExec[[null[NULL] AS <all-fields-projected>#204]]
  \_EsQueryExec[employees], indexMode[standard], [_doc...

This "no columns + X rows" is represented as "" and it is a query that reaches ES in a particular way. I am wondering if this couldn't be, more performant, be implemented as a count, meaning we would ask from ES a count of rows and, potentially, on the coordinator node to "expand" this result in a "count" number of "empty" rows.

But this can regarded as food for thought for the future.

I would add the following test, as well. This is, essentially, an empty result set but done with, also, dropping all columns:

from employees | keep salary | eval c = 1 | drop c, salary | where false

another test(s) to add are those related to inline stats because it is also using aggregates (which were changed as part of this PR).

there is also the story around drop * which is not allowed atm.
Should we allow it now, that we can handle a "drop all" scenario? Again, something to be put in a separate issue and have a discussion maybe about it.
I would also add something that goes to more than one index:

from employees* | keep salary | drop salary | eval x = 1 | stats count()

[LATER EDIT] Some tests that use fork would help:

astefan · 2025-10-01T12:14:31Z

x-pack/plugin/esql/src/main/java/org/elasticsearch/xpack/esql/analysis/Analyzer.java

                blocks[i++] = column.values();
            }
-            LocalSupplier supplier = LocalSupplier.of(blocks);
+            LocalSupplier supplier = LocalSupplier.of(blocks.length > 0 ? new Page(blocks) : new Page(0));


I am wondering if we could integrate the logic "if the blocks size is 0 then new Page(0) otherwise new Page(blocks)" somehow with LocalSupplier.

We don't want this in general, in some cases we want no blocks but new Page(N)

astefan · 2025-10-01T12:19:28Z

...ain/java/org/elasticsearch/xpack/esql/optimizer/rules/logical/ReplaceRowAsLocalRelation.java

        fields.forEach(f -> values.add(f.child().fold(context.foldCtx())));
        var blocks = BlockUtils.fromListRow(PlannerUtils.NON_BREAKING_BLOCK_FACTORY, values);
-        return new LocalRelation(row.source(), row.output(), new CopyingLocalSupplier(blocks));
+        return new LocalRelation(row.source(), row.output(), new CopyingLocalSupplier(blocks.length == 0 ? new Page(0) : new Page(blocks)));


Same here about this common logic where a Page is built this way.

astefan · 2025-10-01T12:21:26Z

...esql/src/main/java/org/elasticsearch/xpack/esql/plan/logical/local/CopyingLocalSupplier.java


-    public CopyingLocalSupplier(Block[] blocks) {
-        delegate = new ImmediateLocalSupplier(blocks);
+    public CopyingLocalSupplier(Page page) {


The javadoc of this class needs an update given the change of the constructor?

astefan · 2025-10-01T12:25:25Z

...src/main/java/org/elasticsearch/xpack/esql/optimizer/rules/logical/PruneEmptyAggregates.java

+
+import java.util.List;
+
+public final class PruneEmptyAggregates extends OptimizerRules.OptimizerRule<Aggregate> {


It would help to have a javadoc on this rule. I mean, it is obvious what it does, but it would be helpful to read about the situations that lead to an Aggregate with no aggregates and no groupings.

astefan · 2025-10-01T12:42:08Z

...main/java/org/elasticsearch/xpack/esql/optimizer/rules/physical/local/PushStatsToSource.java

            // for the moment support pushing count just for one field
            List<EsStatsQueryExec.Stat> stats = tuple.v2();
-            if (stats.size() > 1) {
+            if (stats.size() != 1) {


What lead to this change?

It's because now all the stats can be pruned, so also the case with size == 0 has to be taken into consideration

astefan · 2025-10-01T12:47:39Z

x-pack/plugin/esql/src/main/java/org/elasticsearch/xpack/esql/session/FieldNameUtils.java

            // there cannot be an empty list of fields, we'll ask the simplest and lightest one instead: _index
            return new PreAnalysisResult(enrichResolution, IndexResolver.INDEX_METADATA_FIELD, wildcardJoinIndices);
        } else {
+            fieldNames.add(MetadataAttribute.INDEX);


Here I think it would be more correct to add this field after FieldNameUtils::withSubfields is called on the next line. _index.* doesn't actually make sense, as we know there is only on _index metadata field.

astefan · 2025-10-01T14:00:26Z

...qa/testFixtures/src/main/java/org/elasticsearch/xpack/esql/generator/EsqlQueryGenerator.java

        ChangePointGenerator.INSTANCE,
        DissectGenerator.INSTANCE,
        DropGenerator.INSTANCE,
+        DropAllGenerator.INSTANCE,


luigidellaquila · 2025-10-02T09:14:00Z

Thanks for the review @astefan, I just pushed a commit that incorporate your suggestions.

Please let me know if you have further remarks or if you think it's worth discussing it in a wider audience.

When it's done, I'll open two follow-up issues, one to allow DROP * and one for possible optimizations.

astefan

LGTM. Thanks @luigidellaquila

luigidellaquila added 2 commits August 22, 2025 18:11

ES|QL:Fix wrong pruning of plans with no output columns

ad961ae

Fix test

21dc299

luigidellaquila added >bug :Analytics/ES|QL AKA ESQL labels Aug 22, 2025

elasticsearchmachine added the v9.2.0 label Aug 22, 2025

Update docs/changelog/133405.yaml

de95b79

luigidellaquila added 6 commits August 22, 2025 19:57

Fix BWC

42d364f

Merge remote-tracking branch 'luigidellaquila/esql/fix_no_columns' in…

544d6dd

…to esql/fix_no_columns

Restore original tests

c2f01d6

Merge branch 'main' into esql/fix_no_columns

ba01afe

Fix tests

764cace

Refactor local suppliers to return a Page

642b50a

luigidellaquila commented Aug 25, 2025

View reviewed changes

Fix flaky test

3ae2314

luigidellaquila marked this pull request as ready for review August 25, 2025 15:06

elasticsearchmachine added the Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) label Aug 25, 2025

luigidellaquila mentioned this pull request Aug 25, 2025

ESQL: No rows with JOIN/ENRICH/SAMPLE + EVAL + KEEP #120272

Closed

ivancea approved these changes Aug 25, 2025

View reviewed changes

x-pack/plugin/esql/qa/testFixtures/src/main/resources/drop.csv-spec Outdated Show resolved Hide resolved

...c/internalClusterTest/java/org/elasticsearch/xpack/esql/action/CrossClusterLookupJoinIT.java Show resolved Hide resolved

Merge branch 'main' into esql/fix_no_columns

e09cabd

astefan requested a review from alex-spies August 26, 2025 13:40

luigidellaquila added 7 commits August 27, 2025 14:44

Merge branch 'main' into esql/fix_no_columns

a1aaa5b

More tests

0421de5

Tests

34b995a

Merge branch 'main' into esql/fix_no_columns

2e35ab3

More tests

be75856

Merge branch 'main' into esql/fix_no_columns

657c94e

Merge branch 'main' into esql/fix_no_columns

a3cccde

luigidellaquila added 8 commits September 8, 2025 09:52

Merge branch 'main' into esql/fix_no_columns

dd74c0b

More tests

afd0350

Merge branch 'main' into esql/fix_no_columns

b45fc9a

Fix pushdown stats and new tests

b42b09a

BWC

3a388a7

Merge branch 'main' into esql/fix_no_columns

076db41

Merge branch 'main' into esql/fix_no_columns

b0a9d02

Merge branch 'main' into esql/fix_no_columns

130fa2e

luigidellaquila mentioned this pull request Sep 29, 2025

ES|QL Empty index resolution #135601

Closed

alex-spies requested review from astefan and removed request for alex-spies September 30, 2025 08:15

luigidellaquila added 7 commits September 30, 2025 14:41

Merge branch 'main' into esql/fix_no_columns

c0ac934

Fix compile and add transport version

710fdcc

Fix test

edc3890

Merge branch 'main' into esql/fix_no_columns

d2a5631

Merge branch 'main' into esql/fix_no_columns

921ecb8

Merge branch 'main' into esql/fix_no_columns

59207cb

Merge remote-tracking branch 'luigidellaquila/esql/fix_no_columns' in…

bb45498

…to esql/fix_no_columns

astefan reviewed Oct 1, 2025

View reviewed changes

elasticsearchmachine added v9.3.0 and removed v9.2.0 labels Oct 2, 2025

luigidellaquila added 3 commits October 2, 2025 09:37

Merge branch 'main' into esql/fix_no_columns

d059447

More tests

f8ca703

Implement suggestions

e9ebf8b

astefan self-requested a review October 2, 2025 11:29

astefan approved these changes Oct 2, 2025

View reviewed changes

luigidellaquila merged commit a399e29 into elastic:main Oct 2, 2025
34 checks passed


		import java.util.List;

		public final class PruneEmptyAggregates extends OptimizerRules.OptimizerRule<Aggregate> {

ES|QL: Fix wrong pruning of plans with no output columns #133405

ES|QL: Fix wrong pruning of plans with no output columns #133405

Uh oh!

Conversation

luigidellaquila commented Aug 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Aug 22, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

elasticsearchmachine commented Aug 25, 2025

Uh oh!

ivancea left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

astefan left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

luigidellaquila commented Oct 2, 2025

Uh oh!

astefan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

luigidellaquila commented Aug 22, 2025 •

edited

Loading

astefan left a comment •

edited

Loading