Skip to content

[HUDI-6342] Fixing flaky Continuous mode multi writer tests#8910

Merged
danny0405 merged 1 commit intoapache:masterfrom
nsivabalan:MultiWriterFlakyTestJune8
Jun 9, 2023
Merged

[HUDI-6342] Fixing flaky Continuous mode multi writer tests#8910
danny0405 merged 1 commit intoapache:masterfrom
nsivabalan:MultiWriterFlakyTestJune8

Conversation

@nsivabalan
Copy link
Contributor

Change Logs

Fixing flaky Continuous mode multi writer tests. Exception thrown when continuous mode job fails is different than exception thrown while backfill job fails. So, had to fix the tests accounting for that.

Impact

Fixing flaky Continuous mode multi writer tests. Exception thrown when continuous mode job fails is different than exception thrown while backfill job fails. So, had to fix the tests accounting for that.

Risk level (write none, low medium or high below)

Stabilizes CI

Documentation Update

Describe any necessary documentation update if there is any new feature, config, or user-facing change

  • The config description must be updated if new configs are added or the default value of the configs are changed
  • Any new feature or user-facing change requires updating the Hudi website. Please create a Jira ticket, attach the
    ticket number here and follow the instruction to make
    changes to the website.

Contributor's checklist

  • Read through contributor's guide
  • Change Logs and Impact were stated clearly
  • Adequate tests were added if applicable
  • CI passed

@nsivabalan nsivabalan added release-0.14.0 priority:critical Production degraded; pipelines stalled labels Jun 9, 2023
if (expectConflict && backfillFailed.get() && e.getCause().getMessage().contains(ConcurrentModificationException.class.getName())) {
// expected ConcurrentModificationException since ingestion & backfill will have overlapping writes
if (backfillFailed.get()) {
if (!continuousFailed.get()) {
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

NTR: in most cases backfill job fails and hence the test succeeds. but if continuous job fails, the test times out.

@nsivabalan
Copy link
Contributor Author

nsivabalan commented Jun 9, 2023

we have an unrelated test failure (GH action failure)

Test Call run_clustering Procedure Order Strategy *** FAILED ***

since this patch is also fixing a flaky test, prefer to go ahead w/ landing. we should probably triage and put a follow up patch on this flaky test.

@hudi-bot
Copy link
Collaborator

hudi-bot commented Jun 9, 2023

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

Copy link
Member

@codope codope left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for fixing this.

throw new HoodieException("Both backfilling and ingestion job failed ", e);
}
} else if (expectConflict && continuousFailed.get() && e.getCause().getMessage().contains("Ingestion service was shut down with exception")) {
// incase of regular ingestion job failing, ConcurrentModificationException is not throw all the way.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: thrown

@danny0405 danny0405 merged commit f1c8049 into apache:master Jun 9, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

priority:critical Production degraded; pipelines stalled release-0.14.0

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants