Skip to content

fix: add TimestampLTZ-as-NTZ correctness tests and compatibility docs#4220

Open
andygrove wants to merge 9 commits intoapache:mainfrom
andygrove:parquet-ntz
Open

fix: add TimestampLTZ-as-NTZ correctness tests and compatibility docs#4220
andygrove wants to merge 9 commits intoapache:mainfrom
andygrove:parquet-ntz

Conversation

@andygrove
Copy link
Copy Markdown
Member

@andygrove andygrove commented May 5, 2026

Summary

Test plan

  • Verify tests are skipped on Spark 3.5 with default scan impl (auto)
  • Verify tests pass on Spark 4.0+ with checkSparkAnswerAndOperator

Closes #4218

🤖 Generated with Claude Code

andygrove added 9 commits May 4, 2026 18:15
Spark 3.x raises per SPARK-36182; Comet should match. Skip test
for native_datafusion/auto scans until apache#4219 is fixed. On Spark
4.0+ (SPARK-47447), verify Comet matches Spark instead.
- Add ParquetInt96NtzCorrectnessSuite to pr_build workflows
- Update SPARK-36182 ignore link from apache#3720 to apache#4219 in 3.4/3.5 diffs
- Remove IgnoreCometNativeDataFusion from SPARK-47447 test in 4.0/4.1
  diffs since Comet should match Spark 4 behavior
Remove IgnoreCometNativeDataFusion from SPARK-47447 test by applying
diffs to Spark v4.0.2/v4.1.1, editing, and regenerating with git diff.
The issue is not specific to INT96; it applies to any TimestampLTZ
column read as TimestampNTZ regardless of physical encoding.
Broaden scans.md to cover all LTZ encodings and add note about
unsupported type conversions (apache#3720). Add known limitations to
Spark 3.4/3.5 sections linking to the scan compat doc.
@andygrove andygrove changed the title test: add INT96 TimestampNTZ correctness tests test/docs: add TimestampLTZ-as-NTZ correctness tests and compatibility docs May 5, 2026
@andygrove andygrove changed the title test/docs: add TimestampLTZ-as-NTZ correctness tests and compatibility docs fix: add TimestampLTZ-as-NTZ correctness tests and compatibility docs May 5, 2026
@andygrove andygrove added this to the 0.16.0 milestone May 6, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Native DataFusion scan silently returns wrong values reading INT96 as TimestampNTZ prior to Spark 4.0

1 participant