Skip to content

Expose numFiles and numPartitions metrics for native Iceberg scan #2219

@weimingdiit

Description

@weimingdiit

Describe

Native file scans expose driver-side metrics such as numFiles and numPartitions, but native Iceberg scans currently only expose the default native file scan metrics. This makes it harder to inspect how many Iceberg files and Spark file partitions were planned for a native Iceberg scan.

Solution

Add numFiles and numPartitions SQL metrics to NativeIcebergTableScanExec, consistent with NativeFileSourceScanBase.

The metrics should be populated on the driver after Iceberg FileScanTasks are converted into Spark FilePartitions.

Additional context
Entry points:

  • NativeIcebergTableScanExec.scala
  • NativeFileSourceScanBase.scala

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions