Optimize merge of uniqExact without_key #43072

nickitat · 2022-11-09T00:03:27Z

Changelog category (leave one):

Performance Improvement

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):

Parallelized merging of uniqExact states for aggregation without a key, i.e. queries like SELECT uniqExact(number) FROM table. The improvement becomes noticeable when the number of unique keys approaches 10^6.
Also uniq performance is slightly optimized.
This closes #4510.

x86:

11.880	2.753	-4.315x	-0.769	0.768	uniq_without_key	26	SELECT uniqExact(number) from numbers_mt(1e8)

arm:

5.777	1.077	-5.36x	-0.814	0.813	uniq_without_key	26	SELECT uniqExact(number) from numbers_mt(1e8)

nickitat · 2022-11-13T14:36:47Z

AST fuzzer (asan) — #43199

nickitat · 2022-11-15T13:00:07Z

AST fuzzer (debug) - #42819
AST fuzzer (ubsan) - #42691
Integration tests (tsan) [1/4] - #43169
Stateless tests (release, s3 storage) - 01079_parallel_alter_detach_table_zookeeper doesn't look relevant

KochetovNicolai · 2022-11-15T15:19:40Z

src/AggregateFunctions/UniqExactSet.h

+                    }
+                };
+
+                for (size_t i = 0; i < thread_pool->getMaxThreads(); ++i)


This can be a dangerous code a bit. If somebody common background I/O pool with 1000 threads, it will add 1000 tasks. let's add at least <= NUM_BUCKETS tasks.

Also, with current implementation we can't share such a pool with a multiple uniq functions.
Maybe it's better to avoid calling wait() at all and handle exceptions in every task separately. (But out ThreadPool interface is not so good for it).

let's add at least <= NUM_BUCKETS tasks.

ok

Also, with current implementation we can't share such a pool with a multiple uniq functions.

yes, we cannot merge states of two different uniqExact-s simultaneously, but it shouldn't be a problem since we aim to utilise all the threads by the current merge. if we would share, client code would need to manually call wait before destroying states which is also not ideal.

KochetovNicolai · 2022-11-15T15:31:10Z

src/AggregateFunctions/AggregateFunctionUniq.h

  * Used for partial specialization to add strings.
  */
-template <typename T, typename Data>
-struct OneAdder
+template <typename T, typename Data, bool is_variadic = false, bool is_exact = false, bool argument_is_tuple = false>


I had an idea that probably we can put this 3 flags into Data.
Probably it will make a code a bit more readable. (If it is possible to do).

And maybe we can do it with is_able_to_parallelize_merge flag as well to those Data which support it, like

if (settings->max_threads > 1) return createAggregateFunctionUniq< ..., AggregateFunctionUniqExactData<..., true /* is_able_to_parallelize_merge> else return createAggregateFunctionUniq< ..., AggregateFunctionUniqExactData<..., false /* is_able_to_parallelize_merge>

also thought about that, looks more accurate

KochetovNicolai · 2022-11-15T15:40:11Z

src/Common/HashTable/HashTable.h

@@ -1263,30 +1251,6 @@ class HashTable :
                ptr->write(wb);
    }

-    void writeText(DB::WriteBuffer & wb) const


I am confused a bit why this code is removed.
Probably it's not used, but may be still helpful for debugging.
I would prefer to do it in a separate pr if possible.

yeah, let's left it untouched

KochetovNicolai

Generally looks ok.

This reverts commit a7e7480.

Add #43072

nickitat added the force tests Force test ignoring fast test output. label Nov 9, 2022

robot-clickhouse added the pr-performance Pull request with some performance improvements label Nov 9, 2022

nickitat added 2 commits November 10, 2022 22:31

impl for uniqExact

adafbcf

rm unused (read|write)Text methods

a7e7480

nickitat force-pushed the optimize_merge_uniqExact branch from 2ba1b0c to a7e7480 Compare November 10, 2022 21:32

nickitat added 5 commits November 10, 2022 22:41

fix style

9839db2

small fixes

88b3725

impl for variadic uniqExact

58e29e2

refactor

5ff7c45

fix style

9eb4422

nickitat changed the title ~~[WIP] Optimize merge of uniqExact without_key~~ Optimize merge of uniqExact without_key Nov 11, 2022

more agressive inlining

a715e79

disable if max_threads=1

84fc8a7

nickitat force-pushed the optimize_merge_uniqExact branch from b97a39e to 84fc8a7 Compare November 13, 2022 22:44

nickitat marked this pull request as ready for review November 14, 2022 16:00

small improvements

0e76c8c

nickitat force-pushed the optimize_merge_uniqExact branch from c1731d6 to 0e76c8c Compare November 14, 2022 18:25

Merge branch 'master' into optimize_merge_uniqExact

1bd2d95

KochetovNicolai self-assigned this Nov 15, 2022

KochetovNicolai reviewed Nov 15, 2022

View reviewed changes

KochetovNicolai approved these changes Nov 15, 2022

View reviewed changes

nickitat added 4 commits November 16, 2022 00:39

review fixes

e079ef7

Revert "rm unused (read|write)Text methods"

cc4df1e

This reverts commit a7e7480.

encapsulate is_able_to_parallelize_merge in Data

02890ae

encapsulate is_exact & argument_is_tuple in Data

f286cef

KochetovNicolai approved these changes Nov 16, 2022

View reviewed changes

nickitat merged commit 7beb58b into ClickHouse:master Nov 17, 2022

nickitat added a commit that referenced this pull request Nov 17, 2022

Add #43072

b826772

alexey-milovidov added a commit that referenced this pull request Nov 17, 2022

Merge pull request #43345 from ClickHouse/nickitat-patch-8

a5821f8

Add #43072

CurtizJ mentioned this pull request Mar 12, 2024

Fix possible incorrect result of aggregate function uniqExact #61257

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize merge of uniqExact without_key #43072

Optimize merge of uniqExact without_key #43072

nickitat commented Nov 9, 2022 •

edited

nickitat commented Nov 13, 2022

nickitat commented Nov 15, 2022

KochetovNicolai Nov 15, 2022

nickitat Nov 15, 2022

KochetovNicolai Nov 15, 2022

nickitat Nov 15, 2022

KochetovNicolai Nov 15, 2022

nickitat Nov 15, 2022

KochetovNicolai left a comment

Optimize merge of uniqExact without_key #43072

Optimize merge of uniqExact without_key #43072

Conversation

nickitat commented Nov 9, 2022 • edited

Changelog category (leave one):

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):

nickitat commented Nov 13, 2022

nickitat commented Nov 15, 2022

KochetovNicolai Nov 15, 2022

Choose a reason for hiding this comment

nickitat Nov 15, 2022

Choose a reason for hiding this comment

KochetovNicolai Nov 15, 2022

Choose a reason for hiding this comment

nickitat Nov 15, 2022

Choose a reason for hiding this comment

KochetovNicolai Nov 15, 2022

Choose a reason for hiding this comment

nickitat Nov 15, 2022

Choose a reason for hiding this comment

KochetovNicolai left a comment

Choose a reason for hiding this comment

nickitat commented Nov 9, 2022 •

edited