Skip to content

Conversation

@JingsongLi
Copy link
Contributor

There are many jobs that do not require a downstream normalize node, and the key is that the node has a very large cost.

Introduce a option:
Whether to force the removal of the normalize node when streaming read. Note: This is dangerous and is likely to cause data errors if downstream is used to calculate aggregation.

return ChangelogMode.insertOnly();
} else if (table instanceof ChangelogValueCountFileStoreTable) {
return ChangelogMode.all();
} else if (table instanceof ChangelogWithKeyFileStoreTable) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nit: in the long run, I think we can simplify the configuration to some extent, e.g., depend on ChangelogProducer

Copy link
Contributor

@LadyForest LadyForest left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the contribution, LGTM

@JingsongLi JingsongLi merged commit fa11e87 into apache:release-0.2 Aug 23, 2022
JingsongLi added a commit that referenced this pull request Aug 23, 2022
@JingsongLi JingsongLi deleted the remove branch January 3, 2024 06:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants