Skip to content

Core: Add partition to TrackedFile#16253

Merged
nastra merged 2 commits into
apache:mainfrom
nastra:v4-add-partition-to-trackedfile
May 12, 2026
Merged

Core: Add partition to TrackedFile#16253
nastra merged 2 commits into
apache:mainfrom
nastra:v4-add-partition-to-trackedfile

Conversation

@nastra
Copy link
Copy Markdown
Contributor

@nastra nastra commented May 8, 2026

Adds the v1-v3 DataFile.partition (field ID 102) to the v4 TrackedFile, so partition data continues to be tracked per-file in v4 manifests

@github-actions github-actions Bot added the core label May 8, 2026
@nastra nastra force-pushed the v4-add-partition-to-trackedfile branch from 5a97adf to b10fb2e Compare May 8, 2026 14:23
@nastra nastra requested a review from amogh-jahagirdar May 8, 2026 14:24

List<FieldStats<?>> fieldStatsList =
ImmutableList.of(
(FieldStats<?>)
Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this and the suppression were unnecessary

@nastra nastra force-pushed the v4-add-partition-to-trackedfile branch from b10fb2e to 6f21db2 Compare May 8, 2026 14:26
FileContent.DATA,
"s3://bucket/data/file.parquet",
FileFormat.PARQUET,
new PartitionData(Types.StructType.of()),
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Optional: Would it make sense to test with an actual partition value?

@nastra nastra force-pushed the v4-add-partition-to-trackedfile branch from 6f21db2 to eaf1f72 Compare May 11, 2026 15:05
Copy link
Copy Markdown
Contributor

@amogh-jahagirdar amogh-jahagirdar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Overall looks good to me thanks @nastra just a minor comment

private long recordCount = -1L;
private long fileSizeInBytes = -1L;
private Integer specId = null;
private PartitionData partitionData = null;
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Minor: Would it simplify some of the below code if we just set this to EMPTY_PARTITION_DATA?

@nastra nastra force-pushed the v4-add-partition-to-trackedfile branch from eaf1f72 to 9f08873 Compare May 12, 2026 09:49
@nastra nastra merged commit 64ba246 into apache:main May 12, 2026
36 checks passed
@nastra nastra deleted the v4-add-partition-to-trackedfile branch May 12, 2026 13:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants