Skip to content

feat: improve bucket assignment for MOR with bucket index#18444

Merged
danny0405 merged 1 commit into
apache:masterfrom
HuangZhenQiu:bucket-index-assigner
Apr 2, 2026
Merged

feat: improve bucket assignment for MOR with bucket index#18444
danny0405 merged 1 commit into
apache:masterfrom
HuangZhenQiu:bucket-index-assigner

Conversation

@HuangZhenQiu
Copy link
Copy Markdown
Member

@HuangZhenQiu HuangZhenQiu commented Apr 1, 2026

Describe the issue this Pull Request addresses

Improve Bucket assignment for MOR with bucket index

Closes #18166

Summary and Changelog

  1. Improve HoodieSplitBucketAssigner with the feature parity of fix(flink): Improve splits distribution strategy for mor table w/ bucket index #18103
  2. Change test cases in HoodieSplitBucketAssigner.

Impact

none

Risk Level

none

Documentation Update

none

Contributor's checklist

  • Read through contributor's guide
  • Enough context is provided in the sections above
  • Adequate tests were added if applicable

@HuangZhenQiu HuangZhenQiu changed the title feat (flink): feature parity of bucket assignment for MOR with bucket index feat (flink): improvement bucket assignment for MOR with bucket index Apr 1, 2026
@HuangZhenQiu HuangZhenQiu changed the title feat (flink): improvement bucket assignment for MOR with bucket index feat (flink): improve bucket assignment for MOR with bucket index Apr 1, 2026
@HuangZhenQiu HuangZhenQiu force-pushed the bucket-index-assigner branch from 3108349 to a512db2 Compare April 1, 2026 20:15
@HuangZhenQiu HuangZhenQiu force-pushed the bucket-index-assigner branch from a512db2 to 8c27e99 Compare April 1, 2026 21:04
@HuangZhenQiu HuangZhenQiu changed the title feat (flink): improve bucket assignment for MOR with bucket index feat: improve bucket assignment for MOR with bucket index Apr 1, 2026
@github-actions github-actions Bot added the size:L PR with lines of changes in (300, 1000] label Apr 1, 2026
@codecov-commenter
Copy link
Copy Markdown

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 68.51%. Comparing base (447af5a) to head (8c27e99).
⚠️ Report is 3 commits behind head on master.

Additional details and impacted files
@@             Coverage Diff              @@
##             master   #18444      +/-   ##
============================================
- Coverage     68.55%   68.51%   -0.04%     
+ Complexity    27988    27984       -4     
============================================
  Files          2442     2443       +1     
  Lines        134534   134634     +100     
  Branches      16249    16278      +29     
============================================
+ Hits          92225    92248      +23     
- Misses        35038    35110      +72     
- Partials       7271     7276       +5     
Flag Coverage Δ
common-and-other-modules 44.33% <100.00%> (-0.03%) ⬇️
hadoop-mr-java-client 44.94% <ø> (-0.05%) ⬇️
spark-client-hadoop-common 48.39% <ø> (+<0.01%) ⬆️
spark-java-tests 48.77% <ø> (-0.03%) ⬇️
spark-scala-tests 45.63% <ø> (-0.02%) ⬇️
utilities 38.32% <ø> (-0.05%) ⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines Coverage Δ
...hudi/source/split/assign/HoodieSplitAssigners.java 83.33% <100.00%> (ø)
...source/split/assign/HoodieSplitBucketAssigner.java 100.00% <100.00%> (ø)

... and 10 files with indirect coverage changes

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@hudi-bot
Copy link
Copy Markdown
Collaborator

hudi-bot commented Apr 1, 2026

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

@danny0405 danny0405 merged commit 3e662f9 into apache:master Apr 2, 2026
59 of 60 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

size:L PR with lines of changes in (300, 1000]

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Improve splits distribution strategy for mor table

4 participants