Skip to content

Input column missing in Spark UI when using S3 Analytics Accelerator #13295

Open
ranveer-git/iceberg
#2
@hussein-awala

Description

@hussein-awala

Apache Iceberg version

1.9.1 (latest release)

Query engine

Spark

Please describe the bug 🐞

I've recently tested the new integration with the S3 Analytics Accelerator and noticed a small improvement in the processing time for stages that read data from S3, which is great!

However, I also observed that the Input column in the Spark UI (and in the Spark History Server) is consistently empty for these stages. This makes it harder to understand the volume of data being read and to debug or optimize performance.

Spark UI with S3 Analytics Accelerator disabled (default):
Image

Spark UI with S3 Analytics Accelerator enabled:
Image

Willingness to contribute

  • I can contribute a fix for this bug independently
  • I would be willing to contribute a fix for this bug with guidance from the Iceberg community
  • I cannot contribute a fix for this bug at this time

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions