Skip to content

[HUDI-2502] Refactor index in hudi-client module#3778

Merged
nsivabalan merged 8 commits intoapache:masterfrom
yihua:HUDI-2502-refactor-index
Oct 28, 2021
Merged

[HUDI-2502] Refactor index in hudi-client module#3778
nsivabalan merged 8 commits intoapache:masterfrom
yihua:HUDI-2502-refactor-index

Conversation

@yihua
Copy link
Contributor

@yihua yihua commented Oct 11, 2021

What is the purpose of the pull request

This PR refactors the index in hudi-client module by using the HoodieData abstraction to extract common indexing logic, reduce LoC, and keep only divergent engine-specific logic in the corresponding engine client packages.

Brief change log

Verify this pull request

This pull request is already covered by existing tests around indexing.

Committer checklist

  • Has a corresponding JIRA in PR title & commit

  • Commit message is descriptive of the change

  • CI is green

  • Necessary doc changes done or have another open PR

  • For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.

@hudi-bot
Copy link
Collaborator

hudi-bot commented Oct 11, 2021

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run travis re-run the last Travis build
  • @hudi-bot run azure re-run the last Azure build

@yihua yihua force-pushed the HUDI-2502-refactor-index branch 2 times, most recently from 9ed5c80 to bc4493a Compare October 12, 2021 06:06
@yihua
Copy link
Contributor Author

yihua commented Oct 12, 2021

@hudi-bot run azure

@yihua yihua force-pushed the HUDI-2502-refactor-index branch from bc4493a to 57f7040 Compare October 15, 2021 16:28
@yihua yihua changed the title [WIP][HUDI-2502] Refactor index in hudi-client module [HUDI-2502] Refactor index in hudi-client module Oct 15, 2021
@yihua yihua force-pushed the HUDI-2502-refactor-index branch from 57f7040 to ac66c74 Compare October 15, 2021 16:38
@vinothchandar vinothchandar self-assigned this Oct 15, 2021
Copy link
Contributor

@nsivabalan nsivabalan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pretty good job on this PR. You have done a great job buddy!
mostly minor comments.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yeah, its been a long due to fix the naming to factory :) thanks.

Copy link
Member

@vinothchandar vinothchandar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I skimmed the approach. Looks reasonable. Main thing to think thru is any public API change,. Is it possible to retain the existing methods tagLocation(..) with a @deprecated warning for the release. We can then retire/remove them over time. Otherwise, this is a breaking change, which is a problem with rollouts

@nsivabalan nsivabalan added the priority:blocker Production down; release blocker label Oct 22, 2021
@yihua yihua force-pushed the HUDI-2502-refactor-index branch from ac66c74 to 6a9e2f6 Compare October 26, 2021 05:27
@yihua
Copy link
Contributor Author

yihua commented Oct 27, 2021

@nsivabalan @vinothchandar I addressed all the comments. Could you take another pass?

@yihua
Copy link
Contributor Author

yihua commented Oct 27, 2021

@hudi-bot run azure

Copy link
Contributor

@nsivabalan nsivabalan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

few minor comments.

@nsivabalan
Copy link
Contributor

Good job on the PR. commendable work!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

priority:blocker Production down; release blocker

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants