Skip to content

[HUDI-3710] Fixing async clustering job test in TestHoodieDeltaStreamer#5317

Merged
codope merged 1 commit intoapache:masterfrom
nsivabalan:fixAsyncClusterinJobTest
Apr 18, 2022
Merged

[HUDI-3710] Fixing async clustering job test in TestHoodieDeltaStreamer#5317
codope merged 1 commit intoapache:masterfrom
nsivabalan:fixAsyncClusterinJobTest

Conversation

@nsivabalan
Copy link
Contributor

@nsivabalan nsivabalan commented Apr 13, 2022

What is the purpose of the pull request

  • TestHoodieDeltastreamer.testHoodieAsyncClusteringJob has been flaky in CI and hence it was disabled. Fixing it in this patch.
    Context: we can't schedule async clustering when regular writers are making commits if they are overlapping. So, for now, have fixed the flakiness to stop ingestion and ensure the clustering job is able to scheduled and execute. But we need to fix the data generation in general. Have also created a follow up ticket. https://issues.apache.org/jira/browse/HUDI-3872 to work on test data generation to cover many more cases for async clustering in general.

Brief change log

  • Fixed flakiness around testHoodieAsyncClusteringJob in tests.

Verify this pull request

  • Fix itself is in tests only

Committer checklist

  • Has a corresponding JIRA in PR title & commit

  • Commit message is descriptive of the change

  • CI is green

  • Necessary doc changes done or have another open PR

  • For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.

@hudi-bot
Copy link
Collaborator

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

@codope codope merged commit 05dfc39 into apache:master Apr 18, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants