Skip to content

[core] provide changelog-producer.row-deduplicate to deduplicate same change#1100

Merged
JingsongLi merged 2 commits into
apache:masterfrom
liming30:paimon-634
May 9, 2023
Merged

[core] provide changelog-producer.row-deduplicate to deduplicate same change#1100
JingsongLi merged 2 commits into
apache:masterfrom
liming30:paimon-634

Conversation

@liming30
Copy link
Copy Markdown
Contributor

@liming30 liming30 commented May 8, 2023

Purpose

Provide changelog-producer.row-deduplicate to deduplicate same change. Closes #634

Tests

Modify the following unit test to verify:

  • org.apache.paimon.mergetree.compact.FullChangelogMergeFunctionWrapperTestBase
  • org.apache.paimon.mergetree.compact.LookupChangelogMergeFunctionWrapperTest

Modify the following integration tests to verify:

  • org.apache.paimon.flink.FullCompactionFileStoreITCase#testUpdate
  • org.apache.paimon.flink.LookupChangelogWithAggITCase#testMultipleCompaction

API and Format

No

Documentation

No

return options.get(CHANGELOG_PRODUCER);
}

public boolean rowDeduplicate() {
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Use changelogRowDeduplicate()?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

resolved.

Copy link
Copy Markdown
Contributor

@JingsongLi JingsongLi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @liming30 for the contribution!
The PR looks pretty good. +1

We can create an Issue for introducing RecordEqualiser, it is bit faster than comparator.

@JingsongLi JingsongLi merged commit 99ff847 into apache:master May 9, 2023
@liming30 liming30 deleted the paimon-634 branch May 9, 2023 08:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[feature] Provide changelog-producer.row-deduplicate to deduplicate same change

3 participants