-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[HUDI-2658] When disable auto clean, do not check if MIN_COMMITS_TO_KEEP was larger CLEANER_COMMITS_RETAINED or not. #3897
Conversation
…er CLEANER_COMMITS_RETAINED
@hudi-bot run azure |
Hi @xushiyan Friendly ping. Could you please take a look at this PR at you convince? Thanks a lot! |
Just fix conflict from master @xushiyan PTAL :) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@zhangyue19921010 thanks for the patch. Can you explain what is the downside of keep the logic as is? in another word: even if auto clean disabled, why wouldn't you increase min instants to keep to be greater than commits retained?
Hi @xushiyan Thanks a lot for your attention. Actually this is a minor patch, and just make hudi‘s behavior maybe more appropriate. |
@zhangyue19921010 I think putting this conditional validity could compromise the integrity of min-instant as user can toggle auto clean any time. What if on the same table there is a writer and a compactor with different auto clean settings: the writer could disable auto clean and trigger archival and have less number of commits, then compactor runs and see actual instants less than min-instants? I found having consistency over the logic here is important. |
Hi @xushiyan Thanks a lot for your explaining. I will close this pr and keep the behavior as before :) |
What is the purpose of the pull request
When disable auto clean, do not check if MIN_COMMITS_TO_KEEP was larger CLEANER_COMMITS_RETAINED or not.
For current master branch
Exception mentioned blow will throw even though disabled auto clean.
Brief change log
(for example:)
Verify this pull request
(Please pick either of the following options)
This pull request is a trivial rework / code cleanup without any test coverage.
(or)
This pull request is already covered by existing tests, such as (please describe tests).
(or)
This change added tests and can be verified as follows:
(example:)
Committer checklist
Has a corresponding JIRA in PR title & commit
Commit message is descriptive of the change
CI is green
Necessary doc changes done or have another open PR
For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.