[feat](hll) add to_hll(bigint) scalar function by HappenLee · Pull Request #60996 · apache/doris

HappenLee · 2026-03-03T12:07:54Z

Add a new built-in scalar function to_hll(bigint) that constructs an HLL value from a BIGINT directly, bypassing any hash — the bigint is fed as-is into HyperLogLog::update(uint64_t).

Changes:

BE (vec/functions/function_hll.cpp): implement struct ToHll with Status-returning execute/vector/vector_nullable methods; register as FunctionAlwaysNotNullable<ToHll, true> so InvalidArgument status is propagated on negative input.
BE test (be/test/vec/function/function_hll_test.cpp): add unit test function_hll_test.function_to_hll_test covering values 0, 1, 2, 100.
FE Nereids (scalar/ToHll.java): new ScalarFunction with signature BIGINT -> HLL.
FE catalog (BuiltinScalarFunctions.java): register to_hll.
FE visitor (ScalarFunctionVisitor.java): add visitToHll visitor hook.

What problem does this PR solve?

Issue Number: close #xxx

Related PR: #xxx

Problem Summary:

Release note

None

Check List (For Author)

Test
- Regression test
- Unit Test
- Manual test (add detailed scripts or steps below)
- No need to test or manual test. Explain why:
  - This is a refactor/code format and no logic has been changed.
  - Previous test can cover this change.
  - No code files have been changed.
  - Other reason
Behavior changed:
- No.
- Yes.
Does this need documentation?
- No.
- Yes.

Check List (For Reviewer who merge this PR)

Confirm the release note
Confirm test cases
Confirm document
Add branch pick label

hello-stephen · 2026-03-03T12:08:00Z

Thank you for your contribution to Apache Doris.
Don't know what should be done next? See How to process your PR.

Please clearly describe your PR:

What problem was fixed (it's best to include specific error reporting information). How it was fixed.
Which behaviors were modified. What was the previous behavior, what is it now, why was it modified, and what possible impacts might there be.
What features were added. Why was this function added?
Which code was refactored and why was this part of the code refactored?
Which functions were optimized and what is the difference before and after the optimization?

HappenLee · 2026-03-03T12:08:06Z

run buildall

Copilot

Pull request overview

Adds a new built-in scalar function to_hll(BIGINT) -> HLL to construct an HLL value directly from a BIGINT (without hashing), with FE/BE registration and a BE unit test.

Changes:

BE: implement and register to_hll in vectorized HLL functions (including negative-input validation via Status::InvalidArgument).
FE (Nereids): add ToHll scalar expression + visitor hook and register it as a built-in scalar function.
Tests: add BE unit test for basic positive inputs.

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
fe/fe-core/src/main/java/org/apache/doris/nereids/trees/expressions/visitor/ScalarFunctionVisitor.java	Adds a visitor entrypoint for the new `ToHll` scalar function.
fe/fe-core/src/main/java/org/apache/doris/nereids/trees/expressions/functions/scalar/ToHll.java	Introduces the Nereids scalar function definition and signature `BIGINT -> HLL`.
fe/fe-core/src/main/java/org/apache/doris/catalog/BuiltinScalarFunctions.java	Registers `to_hll` as a built-in scalar function in FE.
be/src/vec/functions/function_hll.cpp	Implements `ToHll` BE vectorized function and registers it in the HLL function factory.
be/test/vec/function/function_hll_test.cpp	Adds a unit test covering several positive BIGINT inputs for `to_hll`.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-03T12:13:34Z

be/src/vec/functions/function_hll.cpp

+                if (arg_is_nullable && (*nullmap)[i]) {
+                    continue;
+                } else {
+                    int64_t value = data[i];
+                    if (value < 0) {
+                        return Status::InvalidArgument(
+                                "to_hll does not support negative bigint value: {}", value);
+                    }
+                    res_data[i].update(static_cast<uint64_t>(value));
+                }


In the non-nullable instantiation, nullmap is passed as nullptr, but the loop still references (*nullmap)[i] inside a normal if condition. While short-circuiting avoids evaluation when arg_is_nullable is false, rewriting this as an if constexpr (arg_is_nullable) block (or splitting nullable/non-nullable loops) avoids relying on that subtlety and keeps static analyzers/sanitizers from flagging a potential null dereference.

Suggested change

if (arg_is_nullable && (*nullmap)[i]) {

continue;

} else {

int64_t value = data[i];

if (value < 0) {

return Status::InvalidArgument(

"to_hll does not support negative bigint value: {}", value);

}

res_data[i].update(static_cast<uint64_t>(value));

}

if constexpr (arg_is_nullable) {

if ((*nullmap)[i]) {

continue;

}

}

int64_t value = data[i];

if (value < 0) {

return Status::InvalidArgument(

"to_hll does not support negative bigint value: {}", value);

}

res_data[i].update(static_cast<uint64_t>(value));

Copilot · 2026-03-03T12:13:34Z

be/test/vec/function/function_hll_test.cpp

+
+    DataSet data_set = {{{(int64_t)1}, &hll1},
+                        {{(int64_t)2}, &hll2},
+                        {{(int64_t)100}, &hll3},
+                        {{(int64_t)0}, &hll4}};
+
+    static_cast<void>(check_function<DataTypeHLL>(func_name, input_types, data_set));
+}


The new to_hll behavior has an explicit error path for negative BIGINT values (returns Status::InvalidArgument) and a special null-handling path (skips update and returns a default/empty HLL). The unit test currently only covers a few positive values; please add coverage for at least one negative input (expect execute failure) and a NULL input row (verify it returns an empty HLL rather than NULL).

Suggested change

DataSet data_set = {{{(int64_t)1}, &hll1},

{{(int64_t)2}, &hll2},

{{(int64_t)100}, &hll3},

{{(int64_t)0}, &hll4}};

static_cast<void>(check_function<DataTypeHLL>(func_name, input_types, data_set));

}

HyperLogLog empty_hll;

DataSet data_set = {{{(int64_t)1}, &hll1},

{{(int64_t)2}, &hll2},

{{(int64_t)100}, &hll3},

{{(int64_t)0}, &hll4},

{{Null()}, &empty_hll}};

static_cast<void>(check_function<DataTypeHLL, true>(func_name, input_types, data_set));

}

TEST(function_hll_test, function_to_hll_negative_input_test) {

std::string func_name = "to_hll";

InputTypeSet input_types = {PrimitiveType::TYPE_BIGINT};

HyperLogLog dummy_hll;

// Negative BIGINT input should cause function execution to fail.

DataSet data_set = {{{(int64_t)-1}, &dummy_hll}};

EXPECT_FALSE(check_function<DataTypeHLL>(func_name, input_types, data_set));

}

HappenLee · 2026-03-03T12:38:15Z

run buidall

doris-robot · 2026-03-03T13:59:38Z

BE UT Coverage Report

Increment line coverage 70.00% (21/30) 🎉

Increment coverage report
Complete coverage report

Category	Coverage
Function Coverage	52.58% (19644/37357)
Line Coverage	36.20% (183376/506586)
Region Coverage	32.48% (142169/437764)
Branch Coverage	33.45% (61705/184449)

HappenLee · 2026-03-04T01:54:52Z

run buildall

doris-robot · 2026-03-04T02:29:37Z

TPC-H: Total hot run time: 28759 ms

machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit cbea9552a1c1b59c015a9b22f7676dc7a3f2a5a8, data reload: false

------ Round 1 ----------------------------------
============================================
q1	17693	4451	4336	4336
q2	q3	10656	788	519	519
q4	4681	350	253	253
q5	7558	1189	1031	1031
q6	172	174	147	147
q7	794	831	679	679
q8	9291	1456	1367	1367
q9	4895	4796	4679	4679
q10	6826	1867	1637	1637
q11	466	263	236	236
q12	730	573	465	465
q13	17764	4207	3410	3410
q14	227	232	209	209
q15	954	813	794	794
q16	766	726	659	659
q17	735	861	403	403
q18	5910	5469	5195	5195
q19	1109	961	621	621
q20	502	492	399	399
q21	4715	1969	1459	1459
q22	378	324	261	261
Total cold run time: 96822 ms
Total hot run time: 28759 ms

----- Round 2, with runtime_filter_mode=off -----
============================================
q1	4633	4610	4624	4610
q2	q3	1791	2165	1795	1795
q4	865	1193	772	772
q5	4100	4398	4302	4302
q6	185	173	138	138
q7	1799	1687	1586	1586
q8	2509	2723	2519	2519
q9	7473	7421	7349	7349
q10	2721	2816	2403	2403
q11	503	422	413	413
q12	510	615	478	478
q13	4030	4409	3602	3602
q14	282	300	295	295
q15	880	828	815	815
q16	725	752	712	712
q17	1217	1567	1324	1324
q18	7171	6851	6812	6812
q19	866	849	889	849
q20	2072	2204	2074	2074
q21	4106	3691	3402	3402
q22	468	432	380	380
Total cold run time: 48906 ms
Total hot run time: 46630 ms

doris-robot · 2026-03-04T02:40:59Z

TPC-DS: Total hot run time: 183883 ms

machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit cbea9552a1c1b59c015a9b22f7676dc7a3f2a5a8, data reload: false

query5	4320	630	520	520
query6	328	214	195	195
query7	4215	479	278	278
query8	352	255	240	240
query9	8733	2782	2751	2751
query10	526	396	350	350
query11	16904	17378	17120	17120
query12	263	167	173	167
query13	2069	486	361	361
query14	6999	3462	3094	3094
query14_1	2920	2903	2867	2867
query15	207	207	185	185
query16	1031	504	480	480
query17	1271	746	639	639
query18	2620	467	362	362
query19	280	216	187	187
query20	141	140	130	130
query21	218	138	118	118
query22	5570	5540	4992	4992
query23	17325	16809	16646	16646
query23_1	16667	16726	16714	16714
query24	7365	1601	1234	1234
query24_1	1229	1241	1246	1241
query25	605	445	391	391
query26	1230	272	156	156
query27	2734	462	281	281
query28	4443	1853	1890	1853
query29	786	555	480	480
query30	309	243	216	216
query31	867	724	639	639
query32	84	72	71	71
query33	502	338	281	281
query34	907	913	573	573
query35	619	671	603	603
query36	1058	1120	982	982
query37	127	95	83	83
query38	2957	2935	2867	2867
query39	1019	856	872	856
query39_1	835	835	832	832
query40	231	152	136	136
query41	64	61	58	58
query42	106	102	105	102
query43	386	380	347	347
query44	
query45	205	193	178	178
query46	883	979	603	603
query47	2111	2156	2029	2029
query48	308	308	226	226
query49	654	502	382	382
query50	683	293	221	221
query51	4071	4117	4166	4117
query52	106	107	93	93
query53	289	330	294	294
query54	293	262	260	260
query55	96	82	78	78
query56	311	311	309	309
query57	1371	1333	1249	1249
query58	307	272	268	268
query59	2548	2651	2559	2559
query60	335	330	323	323
query61	147	144	151	144
query62	640	591	552	552
query63	311	283	272	272
query64	4849	1264	985	985
query65	
query66	1444	451	345	345
query67	16203	16336	16250	16250
query68	
query69	404	301	282	282
query70	999	983	969	969
query71	342	311	305	305
query72	2796	2605	2407	2407
query73	543	545	325	325
query74	9994	9922	9769	9769
query75	2870	2765	2492	2492
query76	2300	1040	674	674
query77	364	382	310	310
query78	11172	11352	10645	10645
query79	2525	805	633	633
query80	1792	640	582	582
query81	566	274	249	249
query82	1012	147	113	113
query83	333	258	241	241
query84	248	119	94	94
query85	876	485	435	435
query86	414	336	290	290
query87	3183	3070	3060	3060
query88	3546	2677	2684	2677
query89	424	370	341	341
query90	2000	174	169	169
query91	166	150	143	143
query92	82	79	72	72
query93	1059	824	515	515
query94	635	324	275	275
query95	581	394	304	304
query96	621	517	231	231
query97	2506	2487	2395	2395
query98	251	233	239	233
query99	1007	1004	952	952
Total cold run time: 256345 ms
Total hot run time: 183883 ms

hello-stephen · 2026-03-04T03:10:54Z

FE UT Coverage Report

Increment line coverage 8.33% (1/12) 🎉
Increment coverage report
Complete coverage report

hello-stephen · 2026-03-04T03:21:17Z

BE UT Coverage Report

Increment line coverage 70.00% (21/30) 🎉

Increment coverage report
Complete coverage report

Category	Coverage
Function Coverage	52.61% (19657/37361)
Line Coverage	36.21% (183489/506691)
Region Coverage	32.51% (142336/437868)
Branch Coverage	33.47% (61747/184492)

hello-stephen · 2026-03-04T04:56:48Z

BE Regression && UT Coverage Report

Increment line coverage 70.00% (21/30) 🎉

Increment coverage report
Complete coverage report

Category	Coverage
Function Coverage	73.36% (26837/36583)
Line Coverage	56.63% (286073/505138)
Region Coverage	54.07% (238973/442005)
Branch Coverage	55.71% (103098/185056)

hello-stephen · 2026-03-04T05:07:13Z

FE Regression Coverage Report

Increment line coverage 8.33% (1/12) 🎉
Increment coverage report
Complete coverage report

hello-stephen · 2026-03-04T10:17:38Z

BE Regression && UT Coverage Report

Increment line coverage 70.00% (21/30) 🎉

Increment coverage report
Complete coverage report

Category	Coverage
Function Coverage	73.35% (26832/36583)
Line Coverage	56.61% (285955/505138)
Region Coverage	54.05% (238882/442005)
Branch Coverage	55.68% (103045/185056)

hello-stephen · 2026-03-04T10:27:48Z

FE Regression Coverage Report

Increment line coverage 8.33% (1/12) 🎉
Increment coverage report
Complete coverage report

HappenLee · 2026-03-04T11:49:14Z

run buildall

doris-robot · 2026-03-04T12:30:04Z

TPC-H: Total hot run time: 28842 ms

machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 644a371416e33ab20451719fcd53eb467923cef9, data reload: false

------ Round 1 ----------------------------------
============================================
q1	17647	4463	4285	4285
q2	q3	10644	765	516	516
q4	4671	351	256	256
q5	7555	1208	1028	1028
q6	175	176	144	144
q7	778	849	656	656
q8	9708	1465	1328	1328
q9	5275	4756	4721	4721
q10	6838	1871	1638	1638
q11	474	253	227	227
q12	732	557	461	461
q13	17775	4200	3433	3433
q14	227	222	208	208
q15	938	794	788	788
q16	738	726	667	667
q17	731	867	388	388
q18	5890	5406	5311	5311
q19	1283	962	599	599
q20	517	489	374	374
q21	4810	1964	1550	1550
q22	336	294	264	264
Total cold run time: 97742 ms
Total hot run time: 28842 ms

----- Round 2, with runtime_filter_mode=off -----
============================================
q1	4624	4607	4714	4607
q2	q3	1800	2191	1790	1790
q4	885	1176	765	765
q5	4043	4350	4397	4350
q6	181	176	136	136
q7	1754	1678	1524	1524
q8	2446	2734	2544	2544
q9	7548	7295	7509	7295
q10	2625	2891	2452	2452
q11	500	422	406	406
q12	490	595	455	455
q13	4135	4422	3613	3613
q14	285	296	273	273
q15	867	804	805	804
q16	772	757	723	723
q17	1197	1543	1285	1285
q18	7094	6892	6675	6675
q19	883	834	895	834
q20	2088	2191	2038	2038
q21	4073	3450	3436	3436
q22	454	417	370	370
Total cold run time: 48744 ms
Total hot run time: 46375 ms

doris-robot · 2026-03-04T12:41:29Z

TPC-DS: Total hot run time: 184251 ms

machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 644a371416e33ab20451719fcd53eb467923cef9, data reload: false

query5	4651	651	510	510
query6	329	211	196	196
query7	4218	466	266	266
query8	346	251	231	231
query9	8801	2749	2692	2692
query10	513	395	338	338
query11	17048	17816	17338	17338
query12	209	134	134	134
query13	1325	480	379	379
query14	7381	3418	2991	2991
query14_1	2949	2942	3034	2942
query15	210	206	183	183
query16	1008	465	462	462
query17	1301	749	607	607
query18	4031	437	334	334
query19	215	206	178	178
query20	141	131	133	131
query21	221	162	132	132
query22	5813	5232	5053	5053
query23	17126	16798	16660	16660
query23_1	16773	16632	16692	16632
query24	7057	1638	1255	1255
query24_1	1248	1240	1226	1226
query25	550	478	393	393
query26	1222	255	156	156
query27	2751	465	293	293
query28	4498	1854	1864	1854
query29	797	554	467	467
query30	312	245	209	209
query31	876	730	633	633
query32	80	67	78	67
query33	519	331	279	279
query34	908	910	561	561
query35	630	676	591	591
query36	1087	1133	1014	1014
query37	135	95	84	84
query38	2995	2913	2932	2913
query39	1001	847	855	847
query39_1	819	825	825	825
query40	235	150	133	133
query41	61	60	60	60
query42	104	102	101	101
query43	364	374	346	346
query44	
query45	199	188	182	182
query46	871	978	617	617
query47	2109	2166	2060	2060
query48	318	318	232	232
query49	624	456	414	414
query50	683	288	217	217
query51	4074	4111	4097	4097
query52	107	111	98	98
query53	290	340	279	279
query54	294	270	258	258
query55	92	84	82	82
query56	329	304	298	298
query57	1367	1349	1264	1264
query58	299	282	272	272
query59	2611	2669	2527	2527
query60	335	336	327	327
query61	150	138	151	138
query62	636	588	547	547
query63	312	277	275	275
query64	4838	1255	969	969
query65	
query66	1389	455	349	349
query67	16430	16387	16386	16386
query68	
query69	393	302	279	279
query70	960	981	944	944
query71	348	315	299	299
query72	2707	2834	2445	2445
query73	551	540	321	321
query74	10019	9962	9743	9743
query75	2838	2745	2477	2477
query76	2284	1029	664	664
query77	386	413	314	314
query78	11114	11269	10681	10681
query79	2989	800	596	596
query80	1762	639	566	566
query81	561	290	248	248
query82	987	151	118	118
query83	344	274	249	249
query84	257	134	106	106
query85	978	550	518	518
query86	408	301	328	301
query87	3092	3078	3017	3017
query88	3542	2675	2648	2648
query89	440	363	349	349
query90	1894	180	175	175
query91	173	174	149	149
query92	81	79	78	78
query93	1190	840	510	510
query94	643	345	282	282
query95	585	400	315	315
query96	647	512	226	226
query97	2457	2467	2464	2464
query98	234	220	210	210
query99	1008	997	913	913
Total cold run time: 258426 ms
Total hot run time: 184251 ms

Add a new built-in scalar function to_hll(bigint) that constructs an HLL value from a BIGINT directly, bypassing any hash — the bigint is fed as-is into HyperLogLog::update(uint64_t). Changes: - BE (vec/functions/function_hll.cpp): implement struct ToHll with Status-returning execute/vector/vector_nullable methods; register as FunctionAlwaysNotNullable<ToHll, true> so InvalidArgument status is propagated on negative input. - BE test (be/test/vec/function/function_hll_test.cpp): add unit test function_hll_test.function_to_hll_test covering values 0, 1, 2, 100. - FE Nereids (scalar/ToHll.java): new ScalarFunction with signature BIGINT -> HLL. - FE catalog (BuiltinScalarFunctions.java): register to_hll. - FE visitor (ScalarFunctionVisitor.java): add visitToHll visitor hook.

HappenLee · 2026-03-04T12:58:35Z

run buildall

doris-robot · 2026-03-04T13:21:23Z

TPC-H: Total hot run time: 28948 ms

machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 697bd4d9548658b747f0762d9c258b127e1ec128, data reload: false

------ Round 1 ----------------------------------
============================================
q1	17631	4455	4331	4331
q2	q3	10653	779	521	521
q4	4682	361	254	254
q5	7554	1228	1009	1009
q6	176	172	153	153
q7	759	843	656	656
q8	9291	1440	1317	1317
q9	4754	4714	4736	4714
q10	6761	1860	1643	1643
q11	434	261	234	234
q12	706	568	484	484
q13	17775	4220	3391	3391
q14	235	232	213	213
q15	912	791	790	790
q16	726	728	692	692
q17	723	874	410	410
q18	5992	5398	5287	5287
q19	1255	969	628	628
q20	493	490	396	396
q21	5227	1982	1553	1553
q22	384	311	272	272
Total cold run time: 97123 ms
Total hot run time: 28948 ms

----- Round 2, with runtime_filter_mode=off -----
============================================
q1	4629	4584	4463	4463
q2	q3	1779	2193	1783	1783
q4	861	1181	789	789
q5	4038	4388	4282	4282
q6	207	169	138	138
q7	1795	1653	1496	1496
q8	2493	2880	2530	2530
q9	7613	7323	7353	7323
q10	2617	2774	2380	2380
q11	509	433	427	427
q12	516	607	444	444
q13	4059	4389	3647	3647
q14	290	315	297	297
q15	913	797	816	797
q16	749	804	732	732
q17	1173	1514	1331	1331
q18	7080	6776	6584	6584
q19	937	992	1026	992
q20	2080	2174	1996	1996
q21	3944	3504	3412	3412
q22	474	441	373	373
Total cold run time: 48756 ms
Total hot run time: 46216 ms

doris-robot · 2026-03-04T13:32:46Z

TPC-DS: Total hot run time: 183969 ms

machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 697bd4d9548658b747f0762d9c258b127e1ec128, data reload: false

query5	5197	666	525	525
query6	345	237	212	212
query7	4231	469	270	270
query8	350	253	240	240
query9	8772	2801	2753	2753
query10	576	379	333	333
query11	16998	17435	17327	17327
query12	226	146	132	132
query13	1302	484	371	371
query14	7584	3351	3120	3120
query14_1	3001	2956	2858	2858
query15	216	198	192	192
query16	1026	514	492	492
query17	1364	767	649	649
query18	2748	430	358	358
query19	219	213	183	183
query20	139	127	123	123
query21	213	136	118	118
query22	4812	4830	4773	4773
query23	17214	16813	16593	16593
query23_1	16708	16752	16676	16676
query24	6889	1621	1230	1230
query24_1	1221	1236	1242	1236
query25	566	495	476	476
query26	1233	256	143	143
query27	2777	467	285	285
query28	4513	1894	1874	1874
query29	796	564	462	462
query30	311	246	214	214
query31	860	746	632	632
query32	82	72	74	72
query33	537	331	288	288
query34	916	924	553	553
query35	633	684	588	588
query36	1095	1145	984	984
query37	148	91	83	83
query38	2995	2976	2856	2856
query39	890	874	843	843
query39_1	831	840	834	834
query40	226	147	134	134
query41	78	74	72	72
query42	111	108	112	108
query43	397	381	359	359
query44	
query45	194	188	182	182
query46	881	969	615	615
query47	2108	2148	2040	2040
query48	305	326	238	238
query49	629	464	371	371
query50	684	276	216	216
query51	4053	4121	4059	4059
query52	109	111	99	99
query53	292	333	277	277
query54	287	277	272	272
query55	85	88	81	81
query56	306	297	348	297
query57	1374	1329	1250	1250
query58	301	288	276	276
query59	2597	2724	2508	2508
query60	342	336	338	336
query61	151	150	153	150
query62	622	583	532	532
query63	308	277	279	277
query64	4809	1268	988	988
query65	
query66	1384	455	358	358
query67	16426	16409	16366	16366
query68	
query69	395	299	282	282
query70	961	959	922	922
query71	349	310	304	304
query72	2766	2746	2435	2435
query73	546	548	319	319
query74	10018	9949	9820	9820
query75	2853	2758	2449	2449
query76	2289	1045	667	667
query77	361	382	314	314
query78	11253	11367	10650	10650
query79	2958	799	619	619
query80	1812	629	545	545
query81	562	280	251	251
query82	1018	147	119	119
query83	341	260	242	242
query84	253	126	106	106
query85	886	521	434	434
query86	422	297	291	291
query87	3118	3130	3019	3019
query88	3553	2682	2681	2681
query89	424	366	340	340
query90	1994	181	174	174
query91	166	158	139	139
query92	79	75	69	69
query93	1169	849	516	516
query94	639	317	283	283
query95	569	341	310	310
query96	638	520	234	234
query97	2432	2494	2465	2465
query98	229	226	216	216
query99	926	997	945	945
Total cold run time: 257319 ms
Total hot run time: 183969 ms

hello-stephen · 2026-03-04T14:10:23Z

FE UT Coverage Report

Increment line coverage 8.33% (1/12) 🎉
Increment coverage report
Complete coverage report

hello-stephen · 2026-03-04T14:11:12Z

BE UT Coverage Report

Increment line coverage 83.87% (26/31) 🎉

Increment coverage report
Complete coverage report

Category	Coverage
Function Coverage	52.58% (19656/37382)
Line Coverage	36.20% (183537/506985)
Region Coverage	32.51% (142419/438102)
Branch Coverage	33.45% (61766/184655)

hello-stephen · 2026-03-04T15:52:07Z

BE Regression && UT Coverage Report

Increment line coverage 83.87% (26/31) 🎉

Increment coverage report
Complete coverage report

Category	Coverage
Function Coverage	73.27% (26820/36604)
Line Coverage	56.59% (286006/505432)
Region Coverage	54.05% (239014/442239)
Branch Coverage	55.61% (103005/185219)

hello-stephen · 2026-03-04T16:07:16Z

FE Regression Coverage Report

Increment line coverage 8.33% (1/12) 🎉
Increment coverage report
Complete coverage report

Copilot AI review requested due to automatic review settings March 3, 2026 12:07

HappenLee requested a review from zclllyybb as a code owner March 3, 2026 12:07

Copilot started reviewing on behalf of HappenLee March 3, 2026 12:08 View session

Copilot AI reviewed Mar 3, 2026

View reviewed changes

HappenLee force-pushed the hll branch from b57dd74 to cbea955 Compare March 3, 2026 12:38

HappenLee force-pushed the hll branch from cbea955 to 644a371 Compare March 4, 2026 11:48

HappenLee force-pushed the hll branch 2 times, most recently from 58dc8d4 to 697bd4d Compare March 4, 2026 12:58

-    DataSet data_set = {{{(int64_t)1}, &hll1},
-                        {{(int64_t)2}, &hll2},
-                        {{(int64_t)100}, &hll3},
-                        {{(int64_t)0}, &hll4}};
-    static_cast<void>(check_function<DataTypeHLL>(func_name, input_types, data_set));
-}
+    HyperLogLog empty_hll;
+    DataSet data_set = {{{(int64_t)1}, &hll1},
+                        {{(int64_t)2}, &hll2},
+                        {{(int64_t)100}, &hll3},
+                        {{(int64_t)0}, &hll4},
+                        {{Null()}, &empty_hll}};
+    static_cast<void>(check_function<DataTypeHLL, true>(func_name, input_types, data_set));
+}
+TEST(function_hll_test, function_to_hll_negative_input_test) {
+    std::string func_name = "to_hll";
+    InputTypeSet input_types = {PrimitiveType::TYPE_BIGINT};
+    HyperLogLog dummy_hll;
+    // Negative BIGINT input should cause function execution to fail.
+    DataSet data_set = {{{(int64_t)-1}, &dummy_hll}};
+    EXPECT_FALSE(check_function<DataTypeHLL>(func_name, input_types, data_set));
+}

Conversation

HappenLee commented Mar 3, 2026

What problem does this PR solve?

Release note

Check List (For Author)

Check List (For Reviewer who merge this PR)

Uh oh!

hello-stephen commented Mar 3, 2026

Uh oh!

HappenLee commented Mar 3, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

HappenLee commented Mar 3, 2026

Uh oh!

doris-robot commented Mar 3, 2026

BE UT Coverage Report

Uh oh!

HappenLee commented Mar 4, 2026

Uh oh!

doris-robot commented Mar 4, 2026

Uh oh!

doris-robot commented Mar 4, 2026

Uh oh!

hello-stephen commented Mar 4, 2026

FE UT Coverage Report

Uh oh!

hello-stephen commented Mar 4, 2026

BE UT Coverage Report

Uh oh!

hello-stephen commented Mar 4, 2026

BE Regression && UT Coverage Report

Uh oh!

hello-stephen commented Mar 4, 2026

FE Regression Coverage Report

Uh oh!

hello-stephen commented Mar 4, 2026

BE Regression && UT Coverage Report

Uh oh!

hello-stephen commented Mar 4, 2026

FE Regression Coverage Report

Uh oh!

HappenLee commented Mar 4, 2026

Uh oh!

doris-robot commented Mar 4, 2026

Uh oh!

doris-robot commented Mar 4, 2026

Uh oh!

HappenLee commented Mar 4, 2026

Uh oh!

doris-robot commented Mar 4, 2026

Uh oh!

doris-robot commented Mar 4, 2026

Uh oh!

hello-stephen commented Mar 4, 2026

FE UT Coverage Report

Uh oh!

hello-stephen commented Mar 4, 2026

BE UT Coverage Report

Uh oh!

hello-stephen commented Mar 4, 2026

BE Regression && UT Coverage Report

Uh oh!

hello-stephen commented Mar 4, 2026

FE Regression Coverage Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone