Spark (3.4,3.5,4.0) : Include snapshotId and branch in SparkTable equals and hashCode by bharos · Pull Request #15840 · apache/iceberg

bharos · 2026-03-31T22:34:16Z

SparkTable.equals() and hashCode() only compared the table name, causing Spark to return cached query results for time-travel and branch queries. When a user reads from branch 'main' and then branch 'audit', Spark's cache considered them equal and returned stale data from 'main'.

Include snapshotId and branch fields in equals() and hashCode() so that Spark correctly distinguishes tables loaded at different snapshots or branches. This matches the fix already applied in Spark 4.1 (current main).

Closes #15741

… and hashCode SparkTable.equals() and hashCode() only compared the table name, causing Spark to return cached query results for time-travel and branch queries. When a user reads from branch 'main' and then branch 'audit', Spark's cache considered them equal and returned stale data from 'main'. Include snapshotId and branch fields in equals() and hashCode() so that Spark correctly distinguishes tables loaded at different snapshots or branches. This matches the fix already applied in Spark 4.1 (main). Closes apache#15741

github-actions bot added the spark label Mar 31, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spark (3.4,3.5,4.0) : Include snapshotId and branch in SparkTable equals and hashCode#15840

Spark (3.4,3.5,4.0) : Include snapshotId and branch in SparkTable equals and hashCode#15840
bharos wants to merge 1 commit intoapache:mainfrom
bharos:fix/spark-table-equals-snapshot-caching

bharos commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

bharos commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant