Skip to content

perf: Avoid materializing into intermediate list in metadata table path#14033

Merged
nsivabalan merged 1 commit intoapache:masterfrom
the-other-tim-brown:lazy-iter-to-stream
Oct 3, 2025
Merged

perf: Avoid materializing into intermediate list in metadata table path#14033
nsivabalan merged 1 commit intoapache:masterfrom
the-other-tim-brown:lazy-iter-to-stream

Conversation

@the-other-tim-brown
Copy link
Contributor

@the-other-tim-brown the-other-tim-brown commented Oct 2, 2025

Describe the issue this Pull Request addresses

There is one path in the metadata table that is not properly leveraging iterators and needs a fix

Summary and Changelog

  • Added a new method for HoodieListData that takes in an iterator. This method will convert to a stream and add the proper hook to close the iterator if required.
  • Updates the readSliceAndFilterByKeysIntoList to use the above constructor

Impact

Avoids reading data into a list just to create an iterator

Risk Level

None

Documentation Update

Contributor's checklist

  • Read through contributor's guide
  • Enough context is provided in the sections above
  • Adequate tests were added if applicable

@github-actions github-actions bot added the size:S PR with lines of changes in (10, 100] label Oct 2, 2025
Copy link
Contributor

@yihua yihua left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@hudi-bot
Copy link
Collaborator

hudi-bot commented Oct 2, 2025

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

@nsivabalan nsivabalan merged commit 4d20d4f into apache:master Oct 3, 2025
136 of 137 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

size:S PR with lines of changes in (10, 100]

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants