Skip to content

Conversation

@PatrickRen
Copy link
Contributor

Purpose

Linked issue: close #6284

This pull request updates the Azure Blob Storage integration and improves object store detection in the codebase. The most important changes are grouped below:

Dependency updates:

  • Upgraded the Hadoop shaded filesystem dependency from version 3.3.4 to 3.4.2 in paimon-filesystems/pom.xml, which adds supports for Azure Workload Identity authentication in Hadoop's Azure Filesystem.

Azure Blob Storage integration:

  • Removed the requiredOptions method from the AzureLoader class. The original implementation checks if account name and key are set, but this will block usages of other auth mechanisms such as OAuth.

Object store detection improvements:

  • Updated the logic in FlinkFileIO to recognize the abfs scheme as an object store, ensuring proper handling of Azure Blob Storage.

Tests

Unfortunately this pull request doesn't contain any test coverage, as it requires Azure resources for testing, and Azure simulator Azurite doesn't support ABFS.

However I tested it with my own Azure account with OAuth authentication.

API and Format

No.

Documentation

No.

@PatrickRen PatrickRen force-pushed the support-azure-workload-identity branch 3 times, most recently from 6222f28 to 80de5e5 Compare September 23, 2025 13:26
@PatrickRen PatrickRen force-pushed the support-azure-workload-identity branch from fa12bef to 78b405b Compare September 25, 2025 07:55
@PatrickRen PatrickRen force-pushed the support-azure-workload-identity branch from 2558688 to 630a298 Compare September 25, 2025 11:39
@PatrickRen PatrickRen changed the title [fs] Upgrade filesystem Hadoop to 3.4.2 to Support Azure Workload Identity authentication [fs] Upgrade filesystem Hadoop version to 3.4.2 Sep 25, 2025
Copy link
Contributor

@JingsongLi JingsongLi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @PatrickRen , looks good to me!

@JingsongLi JingsongLi merged commit 7962e22 into apache:master Sep 30, 2025
24 checks passed
XiaoHongbo-Hope pushed a commit to XiaoHongbo-Hope/paimon that referenced this pull request Oct 10, 2025
JingsongLi added a commit to JingsongLi/paimon that referenced this pull request Oct 10, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Bug] Bump Hadoop version in filesystem to support Azure Workload Identity

2 participants