Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Improvement]: Improve optimizer's compaction performance in large delete scenarios #2266

Closed
3 tasks done
Tracked by #2448
zhongqishang opened this issue Nov 8, 2023 · 0 comments · Fixed by #2268
Closed
3 tasks done
Tracked by #2448

Comments

@zhongqishang
Copy link
Contributor

zhongqishang commented Nov 8, 2023

Search before asking

  • I have searched in the issues and found no similar issues.

What would you like to be improved?

In the case of large data volume, we can enable rocksdb as overflow storage.

The following scenarios the merge performance needs to be improved:

  • Write full data through flink cdc
  • A large amount of data is refreshed in a short period of time, include insert update delete

How should we improve?

When eq delete total record count is greater than 1 million, we write the data file primary key into bloom filter.
When reading eq-delete files, we determine whether it exists in data bloom filter. If it does not exist, we can directly ignore it (currently all written to eq delete StructLikeMap), which can greatly reduce the storage operation of eq delete. In this way, the size of bloom filter is controllable, so the memory usage is also controllable, depending on the size of the file.

Overflow writing to rocksdb will cause a large number of disk operations, so reduce the storage amount of eq-delete will speeds up merge operations.

Are you willing to submit PR?

  • Yes I am willing to submit a PR!

Subtasks

No response

Code of Conduct

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant