Skip to content

[HUDI-7062] Implement Caching Iterator for new filegroup reader#10036

Merged
yihua merged 2 commits intoapache:masterfrom
jonvex:use_caching_iterator_new_filegroup_reader
Nov 9, 2023
Merged

[HUDI-7062] Implement Caching Iterator for new filegroup reader#10036
yihua merged 2 commits intoapache:masterfrom
jonvex:use_caching_iterator_new_filegroup_reader

Conversation

@jonvex
Copy link
Copy Markdown
Contributor

@jonvex jonvex commented Nov 9, 2023

Change Logs

Sometimes df.count() ends up calling hasnext multiple times before calling next() so we need to use caching iterator style

Impact

fix issue that was causing tests to fail

Risk level (write none, low medium or high below)

low

Documentation Update

N/A

Contributor's checklist

  • Read through contributor's guide
  • Change Logs and Impact were stated clearly
  • Adequate tests were added if applicable
  • CI passed

this.baseFileIterator = baseFileIterator;
}

protected abstract boolean doHasNext() throws IOException;
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's add a comment here on why the caching iterator is needed here.

@yihua yihua added priority:blocker Production down; release blocker release-1.0.0 labels Nov 9, 2023
@hudi-bot
Copy link
Copy Markdown
Collaborator

hudi-bot commented Nov 9, 2023

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

@yihua yihua merged commit 96cb780 into apache:master Nov 9, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

priority:blocker Production down; release blocker release-1.0.0

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants