Skip to content

Conversation

@andygrove
Copy link
Member

@andygrove andygrove commented Jan 27, 2026

Which issue does this PR close?

Closes #3300

Rationale for this change

Enable by default for better performance and reduced GC for JVM shuffle where the child plan is a Comet plan.

What changes are included in this PR?

  • Enable config by default
  • Disable config in stability suite (so that I didn't have to update golden files and make this PR huge). I will do this as a separate PR
  • Update diffs for some Spark SQL tests because the plan is different now

How are these changes tested?

@codecov-commenter
Copy link

codecov-commenter commented Jan 27, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 59.95%. Comparing base (f09f8af) to head (f1944c7).
⚠️ Report is 901 commits behind head on main.

Additional details and impacted files
@@             Coverage Diff              @@
##               main    #3299      +/-   ##
============================================
+ Coverage     56.12%   59.95%   +3.82%     
- Complexity      976     1473     +497     
============================================
  Files           119      175      +56     
  Lines         11743    16167    +4424     
  Branches       2251     2682     +431     
============================================
+ Hits           6591     9693    +3102     
- Misses         4012     5126    +1114     
- Partials       1140     1348     +208     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

andygrove and others added 3 commits January 27, 2026 10:15
Add support for native columnar-to-row conversion in Spark SQL tests:
- SparkPlanSuite: Handle CometNativeColumnarToRowExec in SPARK-37779 test
- BucketedReadSuite: Handle CometNativeColumnarToRowExec when extracting
  join operators from execution plans

This fixes test failures when native C2R is enabled by default.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Add support for native columnar-to-row conversion in Spark SQL tests:
- SparkPlanSuite: Handle CometNativeColumnarToRowExec in SPARK-37779 test
- BucketedReadSuite: Handle CometNativeColumnarToRowExec when extracting
  join operators from execution plans

This fixes test failures when native C2R is enabled by default.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Add support for native columnar-to-row conversion in Spark SQL tests:
- SparkPlanSuite: Handle CometNativeColumnarToRowExec in SPARK-37779 test
- BucketedReadSuite: Handle CometNativeColumnarToRowExec when extracting
  join operators from execution plans

This fixes test failures when native C2R is enabled by default.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
@andygrove andygrove changed the title feat: Enable native columnar-to-row by default [WIP] feat: Enable native columnar-to-row by default Jan 27, 2026
@andygrove andygrove marked this pull request as ready for review January 27, 2026 19:14
Skip WholeStageCodegenSuite tests when Comet is enabled, matching the
behavior in Spark 3.5. This fixes test failures like "Sort should be
included in WholeStageCodegen" when native C2R is enabled.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
// Disable AQE because the WholeStageCodegenExec is added when running QueryStageExec
class WholeStageCodegenSuite extends QueryTest with SharedSparkSession
- with DisableAdaptiveExecutionSuite {
+ with DisableAdaptiveExecutionSuite with IgnoreCometSuite {
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This matches what we were already doing for 3.5

Co-authored-by: Oleks V <comphead@users.noreply.github.com>
Copy link
Contributor

@comphead comphead left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @andygrove would be a nice note on the release notes

@andygrove andygrove merged commit 22fdec9 into apache:main Jan 28, 2026
170 of 171 checks passed
@andygrove andygrove deleted the native-c2r-enabled-by-default branch January 28, 2026 03:22
vigneshsiva11 pushed a commit to vigneshsiva11/datafusion-comet that referenced this pull request Jan 29, 2026
mbutrovich added a commit to mbutrovich/datafusion-comet that referenced this pull request Jan 29, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Spark SQL test failures when native columnar-to-row is enabled by default

3 participants