feat: pass ignore_nulls flag to first and last #1866

rluvaton · 2025-06-08T15:15:04Z

Which issue does this PR close?

N/A

Rationale for this change

Actually use ignore_nulls
that was used in:

fix: respect ignoreNulls flag in first_value and last_value #1626

What changes are included in this PR?

forward ignore_nulls in first and last and no longer mark as unsupported

How are these changes tested?

this have the same problem as #1626, all the tests are disabled for first and last

parthchandra · 2025-06-09T16:21:26Z

The linked PR did not add a test. Would you be able to?

codecov-commenter · 2025-06-09T16:45:53Z

Codecov Report

Attention: Patch coverage is 0% with 2 lines in your changes missing coverage. Please review.

Project coverage is 59.44%. Comparing base (f09f8af) to head (692332f).
Report is 248 commits behind head on main.

Files with missing lines	Patch %	Lines
...main/scala/org/apache/comet/serde/aggregates.scala	0.00%	2 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff              @@
##               main    #1866      +/-   ##
============================================
+ Coverage     56.12%   59.44%   +3.31%     
- Complexity      976     1151     +175     
============================================
  Files           119      130      +11     
  Lines         11743    12663     +920     
  Branches       2251     2374     +123     
============================================
+ Hits           6591     7527     +936     
+ Misses         4012     3924      -88     
- Partials       1140     1212      +72

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

rluvaton · 2025-06-11T09:09:43Z

All the tests for first and last are disabled currently:

Re-enable tests for FIRST/LAST #1646

andygrove · 2025-06-13T14:42:45Z

All the tests for first and last are disabled currently:
* [Re-enable tests for FIRST/LAST #1646](https://github.com/apache/datafusion-comet/issues/1646)

Yes, we should figure out a test approach for these non-deterministic functions before we start making changes to the implementation,

parthchandra · 2025-06-13T21:28:57Z

@andygrove perhaps we can merge this while we wait for the tests to be made more accurate?

andygrove · 2025-06-16T13:52:53Z

I will create a PR today to add correctness tests for first/last. We can then rebase this PR and make sure that there are no regressions.

andygrove · 2025-06-16T14:19:09Z

@rluvaton Here is a test that fails in main and passes with your changes in this PR. Could you add this to CometAggregateSuite as part of this PR?

  test("first/last with ignore null") {
    val data = Range(0, 8192).flatMap(n => Seq((n, 1), (n, 2))).toDF("a", "b")
    withTempDir { dir =>
      val filename = s"${dir.getAbsolutePath}/first_last_ignore_null.parquet"
      data.write.parquet(filename)
      withSQLConf(CometConf.COMET_BATCH_SIZE.key -> "100") {
        spark.read.parquet(filename).createOrReplaceTempView("t1")
        for (expr <- Seq("first", "last")) {
          // deterministic query that should return one non-null value per group
          val df = spark.sql(s"SELECT a, $expr(IF(b==1,null,b)) IGNORE NULLS FROM t1 GROUP BY a ORDER BY a")
          checkSparkAnswerAndOperator(df)
        }
      }
    }
  }

feat: pass ignore_nulls flag to first and last

692332f

rluvaton force-pushed the pass-ignore-null-to-first-and-last branch from 2b122ed to 692332f Compare June 8, 2025 15:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: pass ignore_nulls flag to first and last #1866

feat: pass ignore_nulls flag to first and last #1866

Uh oh!

rluvaton commented Jun 8, 2025

Uh oh!

parthchandra commented Jun 9, 2025

Uh oh!

codecov-commenter commented Jun 9, 2025 •

edited

Loading

Uh oh!

rluvaton commented Jun 11, 2025 •

edited

Loading

Uh oh!

andygrove commented Jun 13, 2025

Uh oh!

parthchandra commented Jun 13, 2025

Uh oh!

andygrove commented Jun 16, 2025

Uh oh!

andygrove commented Jun 16, 2025

Uh oh!

Uh oh!

feat: pass ignore_nulls flag to first and last #1866

Are you sure you want to change the base?

feat: pass ignore_nulls flag to first and last #1866

Uh oh!

Conversation

rluvaton commented Jun 8, 2025

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

How are these changes tested?

Uh oh!

parthchandra commented Jun 9, 2025

Uh oh!

codecov-commenter commented Jun 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

rluvaton commented Jun 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

andygrove commented Jun 13, 2025

Uh oh!

parthchandra commented Jun 13, 2025

Uh oh!

andygrove commented Jun 16, 2025

Uh oh!

andygrove commented Jun 16, 2025

Uh oh!

Uh oh!

codecov-commenter commented Jun 9, 2025 •

edited

Loading

rluvaton commented Jun 11, 2025 •

edited

Loading