Skip to content

[HUDI-7797] Use HoodieIOFactory to return pluggable FileFormatUtils implementation#11302

Merged
yihua merged 2 commits intoapache:masterfrom
yihua:HUDI-7797-pluggable-format-utils
May 26, 2024
Merged

[HUDI-7797] Use HoodieIOFactory to return pluggable FileFormatUtils implementation#11302
yihua merged 2 commits intoapache:masterfrom
yihua:HUDI-7797-pluggable-format-utils

Conversation

@yihua
Copy link
Contributor

@yihua yihua commented May 25, 2024

Change Logs

This PR adds the flexibility to plug in custom FileFormatUtils implementation based on the HoodieStorage instance. To realize this, two new APIs #getFileFormatUtils are added to the HoodieIOFactory so that the IO factory implementation class can customize the FileFormatUtils implementation to return. The hardcoded logic in FileFormatUtils#getInstance for Hadoop-based implementation is moved to HoodieHadoopIOFactory class. New tests are added in TestHoodieHadoopIOFactory.

Impact

Allows pluggable FileFormatUtils implementation

Risk level

none

Documentation Update

none

Contributor's checklist

  • Read through contributor's guide
  • Change Logs and Impact were stated clearly
  • Adequate tests were added if applicable
  • CI passed

@github-actions github-actions bot added the size:M PR with lines of changes in (100, 300] label May 25, 2024
@yihua yihua force-pushed the HUDI-7797-pluggable-format-utils branch from 6b8370a to 049a4ff Compare May 25, 2024 08:28
@hudi-bot
Copy link
Collaborator

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

@yihua yihua changed the title [HUDI-7797] Use HoodieIOFactory to return pluggable FileFormatUtils i… [HUDI-7797] Use HoodieIOFactory to return pluggable FileFormatUtils implementation May 25, 2024
@yihua yihua force-pushed the HUDI-7797-pluggable-format-utils branch from 049a4ff to 9800805 Compare May 25, 2024 15:34
@codope
Copy link
Member

codope commented May 25, 2024

Thanks for addressing the comments. Please also update the PR description.

@yihua
Copy link
Contributor Author

yihua commented May 26, 2024

Azure CI is green.
Screenshot 2024-05-25 at 17 51 55

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

release-0.15.0 size:M PR with lines of changes in (100, 300]

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants