Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[HUDI-2461] Support out of order commits in MDT with completion time view #9871

Merged
merged 6 commits into from Nov 6, 2023

Conversation

codope
Copy link
Member

@codope codope commented Oct 16, 2023

Change Logs

Metadata table (MDT) has special handling for compaction. This PR ensures MDT compaction is handled in completion time based filesystem view. Previously, out-of-order commit tests were disabled. This PR now enables that test to show the file slicing happens correctly.

NOTE: This PR does not remove the limitation of single writer or OCC concurrency mode from MDT. That will be tackled separately.

Impact

Support out of order commits due to compaction in MDT with completion time view.

Risk level (write none, low medium or high below)

low

Documentation Update

Describe any necessary documentation update if there is any new feature, config, or user-facing change

  • The config description must be updated if new configs are added or the default value of the configs are changed
  • Any new feature or user-facing change requires updating the Hudi website. Please create a Jira ticket, attach the
    ticket number here and follow the instruction to make
    changes to the website.

Contributor's checklist

  • Read through contributor's guide
  • Change Logs and Impact were stated clearly
  • Adequate tests were added if applicable
  • CI passed

Copy link
Contributor

@danny0405 danny0405 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the contribution, I have left some comments.

…view

Use normal instant time for compaction in MDT

Revert "Use normal instant time for compaction in MDT"

This reverts commit 6176139.

Fix some tests

Fix more tests
Copy link
Member Author

@codope codope left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@danny0405 Please review again, i've moved the check based on your suggestion.

@@ -1221,35 +1222,6 @@ public void testArchiveTableWithMetadataTableCompaction() throws Exception {
commitsAfterArchival);
}

@Test
public void testArchiveCommitsWithCompactionCommitInMetadataTableTimeline() throws Exception {
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this test is redundant and uses dummy instants. we're already checking archival with compaction in metadata table in several other tests such as testArchiveTableWithMetadataTableCompaction and testArchivalAndCompactionInMetadataTable

@hudi-bot
Copy link

hudi-bot commented Nov 3, 2023

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

@danny0405
Copy link
Contributor

Thanks for the contribution @codope , I have reviewed and created a patch:

2461.patch.zip

Copy link
Contributor

@danny0405 danny0405 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1, we can land it once all the CI tests pass.

@codope codope merged commit 61f35eb into apache:master Nov 6, 2023
28 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants