Skip to content

Conversation

@mrhhsg
Copy link
Member

@mrhhsg mrhhsg commented Jul 30, 2025

What problem does this PR solve?

Doc: apache/doris-website#2676

Related PR: #xxx

Problem Summary:

Release note

None

Check List (For Author)

  • Test

    • Regression test
    • Unit Test
    • Manual test (add detailed scripts or steps below)
    • No need to test or manual test. Explain why:
      • This is a refactor/code format and no logic has been changed.
      • Previous test can cover this change.
      • No code files have been changed.
      • Other reason
  • Behavior changed:

    • No.
    • Yes.
  • Does this need documentation?

    • No.
    • Yes.

Check List (For Reviewer who merge this PR)

  • Confirm the release note
  • Confirm test cases
  • Confirm document
  • Add branch pick label

@hello-stephen
Copy link
Contributor

Thank you for your contribution to Apache Doris.
Don't know what should be done next? See How to process your PR.

Please clearly describe your PR:

  1. What problem was fixed (it's best to include specific error reporting information). How it was fixed.
  2. Which behaviors were modified. What was the previous behavior, what is it now, why was it modified, and what possible impacts might there be.
  3. What features were added. Why was this function added?
  4. Which code was refactored and why was this part of the code refactored?
  5. Which functions were optimized and what is the difference before and after the optimization?

@mrhhsg
Copy link
Member Author

mrhhsg commented Jul 30, 2025

run buildall

@github-actions
Copy link
Contributor

Possible file(s) that should be tracked in LFS detected: 🚨

The following file(s) exceeds the file size limit: 1048576 bytes, as set in the .yml configuration files:

  • regression-test/data/datatype_p0/nested_types/base_cases/one_level_nestedtypes_with_s3data.out

Consider using git-lfs to manage large files.

@github-actions github-actions bot added the lfs-detected! Warning Label for use when LFS is detected in the commits of a Pull Request label Jul 30, 2025
@doris-robot
Copy link

TPC-H: Total hot run time: 33786 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit a3362157c4ad087d2e836a385a9af10906686059, data reload: false

------ Round 1 ----------------------------------
q1	17654	5613	5383	5383
q2	1946	278	188	188
q3	10308	1328	714	714
q4	10244	989	530	530
q5	7471	2399	2302	2302
q6	174	168	130	130
q7	880	748	665	665
q8	9332	1313	1022	1022
q9	7106	5036	5221	5036
q10	6882	2353	1988	1988
q11	462	282	263	263
q12	342	364	225	225
q13	17770	3483	2976	2976
q14	238	251	213	213
q15	515	464	468	464
q16	413	428	375	375
q17	554	848	348	348
q18	7473	7025	6975	6975
q19	1222	981	527	527
q20	321	308	208	208
q21	3412	2442	2247	2247
q22	1082	1127	1007	1007
Total cold run time: 105801 ms
Total hot run time: 33786 ms

----- Round 2, with runtime_filter_mode=off -----
q1	5720	5689	5714	5689
q2	235	303	215	215
q3	2109	2564	2210	2210
q4	1307	1712	1294	1294
q5	4122	4506	4383	4383
q6	208	187	133	133
q7	1944	1983	1891	1891
q8	2574	2553	2528	2528
q9	7418	7364	7367	7364
q10	3128	3313	2960	2960
q11	533	508	485	485
q12	749	758	634	634
q13	3430	3895	3128	3128
q14	297	293	271	271
q15	492	432	444	432
q16	440	500	442	442
q17	1152	1544	1361	1361
q18	11870	8233	7660	7660
q19	8818	958	977	958
q20	2027	1969	1800	1800
q21	14889	4249	4341	4249
q22	1088	1031	1021	1021
Total cold run time: 74550 ms
Total hot run time: 51108 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 173861 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit a3362157c4ad087d2e836a385a9af10906686059, data reload: false

============================================
query1	1019	394	433	394
query2	6536	1763	1674	1674
query3	6737	227	220	220
query4	26490	23565	23248	23248
query5	4329	607	528	528
query6	328	238	233	233
query7	4632	518	302	302
query8	284	240	234	234
query9	8641	3208	3183	3183
query10	505	343	297	297
query11	15569	14958	14845	14845
query12	182	134	142	134
query13	1662	551	432	432
query14	9550	7713	7839	7713
query15	209	188	167	167
query16	7822	646	463	463
query17	1637	787	682	682
query18	2092	457	326	326
query19	302	218	197	197
query20	154	143	137	137
query21	222	125	121	121
query22	3935	4005	4028	4005
query23	34425	34230	34424	34230
query24	8161	2353	2401	2353
query25	565	521	449	449
query26	720	292	160	160
query27	2582	499	348	348
query28	4194	2300	2317	2300
query29	669	601	482	482
query30	296	232	200	200
query31	882	777	695	695
query32	88	76	73	73
query33	568	425	387	387
query34	830	840	521	521
query35	829	813	761	761
query36	1034	1070	943	943
query37	134	106	92	92
query38	3966	3965	3965	3965
query39	1414	1363	1363	1363
query40	234	142	142	142
query41	58	58	52	52
query42	140	120	125	120
query43	520	494	490	490
query44	1403	856	874	856
query45	207	187	184	184
query46	954	1075	672	672
query47	1823	1902	1778	1778
query48	402	419	314	314
query49	694	500	422	422
query50	651	702	403	403
query51	5504	5591	5533	5533
query52	128	130	117	117
query53	264	286	216	216
query54	663	649	558	558
query55	95	89	90	89
query56	359	365	348	348
query57	1215	1234	1145	1145
query58	332	333	335	333
query59	2569	2551	2575	2551
query60	392	399	384	384
query61	126	123	120	120
query62	806	750	658	658
query63	250	211	212	211
query64	3066	1120	779	779
query65	4235	4104	4055	4055
query66	907	459	335	335
query67	query68	17925	1136	840	840
query69	1007	294	286	286
query70	1436	1152	1167	1152
query71	717	356	374	356
query72	9156	2308	2377	2308
query73	3453	643	357	357
query74	9080	8919	8715	8715
query75	7719	3153	2700	2700
query76	8787	1207	783	783
query77	1148	412	333	333
query78	9676	10766	9132	9132
query79	14600	638	595	595
query80	1845	551	502	502
query81	560	269	236	236
query82	355	150	116	116
query83	384	280	270	270
query84	304	95	82	82
query85	898	379	434	379
query86	378	311	318	311
query87	4284	4228	4120	4120
query88	3851	2230	2228	2228
query89	517	362	315	315
query90	2661	223	223	223
query91	144	140	110	110
query92	94	80	67	67
query93	4955	965	658	658
query94	1002	411	264	264
query95	409	327	322	322
query96	501	598	281	281
query97	2734	2752	2585	2585
query98	254	239	236	236
query99	1482	1380	1277	1277
Total cold run time: 304507 ms
Total hot run time: 173861 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 32.59 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit a3362157c4ad087d2e836a385a9af10906686059, data reload: false

query1	0.04	0.04	0.03
query2	0.08	0.04	0.04
query3	0.25	0.08	0.07
query4	1.61	0.11	0.10
query5	0.48	0.43	0.44
query6	1.17	0.69	0.67
query7	0.02	0.02	0.02
query8	0.05	0.03	0.03
query9	0.57	0.48	0.47
query10	0.53	0.52	0.52
query11	0.16	0.10	0.10
query12	0.15	0.11	0.11
query13	0.65	0.66	0.65
query14	0.93	1.09	1.19
query15	0.90	0.87	0.87
query16	0.38	0.40	0.41
query17	1.08	1.02	1.08
query18	0.21	0.21	0.20
query19	1.95	1.77	1.95
query20	0.01	0.01	0.02
query21	15.37	0.84	0.57
query22	0.75	1.18	0.63
query23	15.01	1.13	0.61
query24	6.59	1.72	0.41
query25	0.47	0.12	0.12
query26	0.55	0.16	0.14
query27	0.07	0.05	0.06
query28	10.24	0.86	0.45
query29	12.62	3.80	3.28
query30	3.00	2.98	2.94
query31	2.81	0.57	0.39
query32	3.24	0.57	0.48
query33	2.97	3.15	3.21
query34	15.98	5.34	4.99
query35	4.89	4.93	4.93
query36	0.70	0.51	0.49
query37	0.09	0.07	0.08
query38	0.06	0.04	0.04
query39	0.03	0.03	0.03
query40	0.18	0.15	0.14
query41	0.09	0.03	0.02
query42	0.04	0.03	0.03
query43	0.04	0.03	0.04
Total cold run time: 107.01 s
Total hot run time: 32.59 s

@doris-robot
Copy link

BE UT Coverage Report

Increment line coverage 75.56% (68/90) 🎉

Increment coverage report
Complete coverage report

Category Coverage
Function Coverage 57.77% (16061/27801)
Line Coverage 46.50% (144460/310695)
Region Coverage 35.74% (108688/304071)
Branch Coverage 38.38% (47998/125051)

@hello-stephen
Copy link
Contributor

BE Regression && UT Coverage Report

Increment line coverage 95.56% (86/90) 🎉

Increment coverage report
Complete coverage report

Category Coverage
Function Coverage 81.19% (22159/27294)
Line Coverage 73.86% (229180/310287)
Region Coverage 61.55% (191914/311826)
Branch Coverage 65.35% (82639/126461)

@mrhhsg mrhhsg force-pushed the deduplicate_keys branch from a336215 to 1627002 Compare July 31, 2025 06:46
@mrhhsg
Copy link
Member Author

mrhhsg commented Jul 31, 2025

run buildall

@github-actions
Copy link
Contributor

Possible file(s) that should be tracked in LFS detected: 🚨

The following file(s) exceeds the file size limit: 1048576 bytes, as set in the .yml configuration files:

  • regression-test/data/datatype_p0/nested_types/base_cases/one_level_nestedtypes_with_s3data.out

Consider using git-lfs to manage large files.

@hello-stephen
Copy link
Contributor

FE UT Coverage Report

Increment line coverage 13.89% (5/36) 🎉
Increment coverage report
Complete coverage report

@doris-robot
Copy link

TPC-H: Total hot run time: 34171 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 1627002b6d91ad8c9fe786240dc06656fafa61b1, data reload: false

------ Round 1 ----------------------------------
q1	17772	5683	5585	5585
q2	1914	299	190	190
q3	10705	1358	711	711
q4	10333	989	513	513
q5	9948	2238	2357	2238
q6	221	165	130	130
q7	988	775	602	602
q8	9306	1317	1127	1127
q9	7248	5085	5094	5085
q10	6970	2377	1977	1977
q11	480	273	258	258
q12	368	362	219	219
q13	17803	3402	2962	2962
q14	244	240	223	223
q15	545	476	459	459
q16	420	442	379	379
q17	560	787	417	417
q18	9727	7124	7116	7116
q19	1523	968	530	530
q20	308	324	205	205
q21	8015	3015	2270	2270
q22	1088	1045	975	975
Total cold run time: 116486 ms
Total hot run time: 34171 ms

----- Round 2, with runtime_filter_mode=off -----
q1	5949	5769	5732	5732
q2	285	308	211	211
q3	2097	2650	2538	2538
q4	1408	1764	1475	1475
q5	4568	4371	4405	4371
q6	221	177	127	127
q7	2041	1855	1842	1842
q8	2528	2439	2535	2439
q9	7389	7605	7427	7427
q10	3131	3365	2932	2932
q11	547	515	483	483
q12	710	780	619	619
q13	3376	3688	3265	3265
q14	299	345	331	331
q15	498	456	452	452
q16	457	495	453	453
q17	1222	1523	1329	1329
q18	7896	7892	7313	7313
q19	9535	906	839	839
q20	1855	1949	1752	1752
q21	6221	4291	4253	4253
q22	1062	1043	1023	1023
Total cold run time: 63295 ms
Total hot run time: 51206 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 162234 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 1627002b6d91ad8c9fe786240dc06656fafa61b1, data reload: false

reason	Doris	NULL	NULL	0	0	0	NULL	0	NULL	NULL	2023-12-26 22:43:56	2023-12-26 22:44:01	NULL	utf-8	NULL	NULL	
============================================
query1	1005	402	443	402
query2	6547	1834	1694	1694
query3	6740	222	217	217
query4	27628	23623	22921	22921
query5	4323	608	505	505
query6	314	235	215	215
query7	4631	508	300	300
query8	285	248	238	238
query9	8638	3115	3165	3115
query10	489	322	292	292
query11	15418	14901	14702	14702
query12	181	133	129	129
query13	1654	533	395	395
query14	9622	7520	7579	7520
query15	212	203	168	168
query16	7980	645	486	486
query17	1660	818	627	627
query18	2116	470	333	333
query19	294	250	189	189
query20	157	143	141	141
query21	222	132	112	112
query22	3958	4024	3848	3848
query23	34334	34508	34376	34376
query24	7848	2375	2440	2375
query25	579	527	460	460
query26	919	299	163	163
query27	2599	509	425	425
query28	4239	2326	2319	2319
query29	691	603	489	489
query30	307	236	206	206
query31	873	787	812	787
query32	90	80	77	77
query33	571	414	387	387
query34	801	840	519	519
query35	827	830	738	738
query36	1026	1033	930	930
query37	135	108	90	90
query38	4024	4055	4013	4013
query39	1443	1404	1359	1359
query40	251	155	130	130
query41	62	56	54	54
query42	140	122	126	122
query43	520	513	479	479
query44	1387	862	850	850
query45	206	185	186	185
query46	958	1056	672	672
query47	1792	1895	1766	1766
query48	389	437	323	323
query49	703	509	424	424
query50	634	688	417	417
query51	4141	4210	4113	4113
query52	127	134	114	114
query53	277	294	210	210
query54	657	642	558	558
query55	93	86	90	86
query56	355	379	357	357
query57	1209	1256	1123	1123
query58	336	334	328	328
query59	2643	2714	2514	2514
query60	412	425	400	400
query61	125	122	123	122
query62	814	726	648	648
query63	253	218	215	215
query64	3619	1088	776	776
query65	4316	4123	4148	4123
query66	974	452	344	344
query67	query68	17209	594	585	585
query69	1006	294	283	283
query70	1398	1108	1164	1108
query71	726	333	320	320
query72	9213	2287	2386	2287
query73	3578	625	350	350
query74	9006	8919	8642	8642
query75	7487	3138	2654	2654
query76	8699	1201	772	772
query77	1166	402	373	373
query78	9701	11615	query79	16056	613	579	579
query80	3809	631	486	486
query81	571	275	228	228
query82	527	150	117	117
query83	395	290	283	283
query84	294	98	81	81
query85	1026	385	333	333
query86	352	364	293	293
query87	4318	4240	4187	4187
query88	5489	2187	2193	2187
query89	500	377	315	315
query90	2491	223	219	219
query91	145	140	112	112
query92	92	73	67	67
query93	7043	1001	657	657
query94	1414	399	280	280
query95	415	333	310	310
query96	495	584	279	279
query97	2688	2740	2598	2598
query98	257	239	222	222
query99	1455	1431	1312	1312
Total cold run time: 302093 ms
Total hot run time: 162234 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 32.76 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit 1627002b6d91ad8c9fe786240dc06656fafa61b1, data reload: false

query1	0.04	0.03	0.03
query2	0.08	0.04	0.04
query3	0.24	0.07	0.07
query4	1.63	0.11	0.11
query5	0.42	0.41	0.44
query6	1.18	0.67	0.69
query7	0.02	0.01	0.02
query8	0.05	0.03	0.03
query9	0.56	0.48	0.46
query10	0.53	0.54	0.51
query11	0.16	0.10	0.10
query12	0.15	0.11	0.11
query13	0.64	0.64	0.66
query14	0.88	1.22	0.96
query15	0.94	0.92	0.87
query16	0.38	0.41	0.40
query17	1.04	1.05	1.05
query18	0.22	0.21	0.20
query19	2.04	1.84	1.91
query20	0.02	0.01	0.01
query21	15.39	0.86	0.55
query22	0.77	1.01	0.59
query23	15.14	1.11	0.61
query24	6.66	0.76	0.64
query25	0.53	0.23	0.14
query26	0.79	0.16	0.12
query27	0.08	0.06	0.06
query28	9.19	0.84	0.43
query29	12.55	3.81	3.32
query30	3.11	3.10	2.96
query31	2.81	0.56	0.39
query32	3.23	0.57	0.50
query33	3.09	3.14	3.22
query34	15.95	5.39	4.99
query35	4.84	4.95	4.96
query36	0.69	0.52	0.50
query37	0.10	0.07	0.07
query38	0.05	0.05	0.04
query39	0.03	0.02	0.03
query40	0.17	0.14	0.14
query41	0.08	0.03	0.03
query42	0.04	0.02	0.03
query43	0.04	0.03	0.03
Total cold run time: 106.55 s
Total hot run time: 32.76 s

@doris-robot
Copy link

BE UT Coverage Report

Increment line coverage 75.56% (68/90) 🎉

Increment coverage report
Complete coverage report

Category Coverage
Function Coverage 57.84% (16081/27804)
Line Coverage 46.55% (144702/310881)
Region Coverage 35.74% (108695/304167)
Branch Coverage 38.42% (48049/125059)

@mrhhsg mrhhsg force-pushed the deduplicate_keys branch from 1627002 to b08eae0 Compare July 31, 2025 09:50
@github-actions
Copy link
Contributor

Possible file(s) that should be tracked in LFS detected: 🚨

The following file(s) exceeds the file size limit: 1048576 bytes, as set in the .yml configuration files:

  • regression-test/data/datatype_p0/nested_types/base_cases/one_level_nestedtypes_with_s3data.out

Consider using git-lfs to manage large files.

@mrhhsg
Copy link
Member Author

mrhhsg commented Jul 31, 2025

run buildall

@hello-stephen
Copy link
Contributor

BE UT Coverage Report

Increment line coverage 75.56% (68/90) 🎉

Increment coverage report
Complete coverage report

Category Coverage
Function Coverage 57.85% (16067/27773)
Line Coverage 46.60% (144431/309940)
Region Coverage 35.82% (108661/303392)
Branch Coverage 38.50% (47974/124596)

@doris-robot
Copy link

TPC-H: Total hot run time: 34142 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit b08eae05dc5b370497c6f5167e24477282176ed9, data reload: false

------ Round 1 ----------------------------------
q1	17611	5637	5463	5463
q2	1936	294	194	194
q3	10314	1329	716	716
q4	10272	978	526	526
q5	8928	2278	2344	2278
q6	251	168	134	134
q7	888	773	626	626
q8	9333	1301	1076	1076
q9	7191	5115	5095	5095
q10	6996	2363	1986	1986
q11	471	281	271	271
q12	371	399	224	224
q13	17834	3506	3035	3035
q14	238	236	242	236
q15	542	464	459	459
q16	428	443	378	378
q17	562	815	387	387
q18	7431	6982	6910	6910
q19	1320	985	550	550
q20	330	315	225	225
q21	3360	3116	2364	2364
q22	1072	1049	1009	1009
Total cold run time: 107679 ms
Total hot run time: 34142 ms

----- Round 2, with runtime_filter_mode=off -----
q1	5936	5865	5886	5865
q2	299	321	217	217
q3	2113	2665	2203	2203
q4	1339	1768	1319	1319
q5	4539	4463	4474	4463
q6	226	183	131	131
q7	1976	1956	1773	1773
q8	2543	2516	2728	2516
q9	7502	7234	7246	7234
q10	3327	3373	2910	2910
q11	549	509	484	484
q12	710	859	620	620
q13	3482	3763	3544	3544
q14	315	339	296	296
q15	490	506	446	446
q16	454	571	631	571
q17	1583	1529	1368	1368
q18	8010	7901	7765	7765
q19	15696	944	993	944
q20	2939	1988	1886	1886
q21	9196	4374	4288	4288
q22	1057	1033	1008	1008
Total cold run time: 74281 ms
Total hot run time: 51851 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 172407 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit b08eae05dc5b370497c6f5167e24477282176ed9, data reload: false

============================================
query1	1023	422	449	422
query2	6522	1839	1750	1750
query3	6740	233	219	219
query4	27239	23432	23471	23432
query5	4327	631	527	527
query6	343	255	231	231
query7	4632	518	300	300
query8	291	244	229	229
query9	8679	3181	3205	3181
query10	490	345	293	293
query11	15879	14896	14850	14850
query12	176	135	132	132
query13	1654	536	410	410
query14	9643	7675	7614	7614
query15	220	198	175	175
query16	7126	647	460	460
query17	970	757	613	613
query18	1980	450	322	322
query19	214	213	191	191
query20	149	154	141	141
query21	217	127	107	107
query22	3959	3927	3798	3798
query23	34663	34453	34261	34261
query24	8228	2424	2447	2424
query25	602	513	443	443
query26	1252	294	156	156
query27	2723	505	349	349
query28	4392	2358	2348	2348
query29	809	596	487	487
query30	299	227	203	203
query31	894	796	701	701
query32	85	79	77	77
query33	545	418	378	378
query34	821	864	522	522
query35	801	880	793	793
query36	1030	1049	951	951
query37	137	112	101	101
query38	3994	4028	3999	3999
query39	1527	1371	1356	1356
query40	245	147	134	134
query41	67	63	57	57
query42	138	129	135	129
query43	524	525	480	480
query44	1452	885	898	885
query45	197	190	184	184
query46	939	1052	668	668
query47	1805	1790	1769	1769
query48	402	428	336	336
query49	747	508	418	418
query50	676	710	411	411
query51	4204	4206	4131	4131
query52	130	139	115	115
query53	272	301	216	216
query54	660	648	567	567
query55	88	90	88	88
query56	363	358	357	357
query57	1200	1199	1142	1142
query58	341	334	333	333
query59	2767	2693	2572	2572
query60	417	393	388	388
query61	125	121	121	121
query62	835	757	651	651
query63	251	215	219	215
query64	4502	1184	849	849
query65	4151	4127	4136	4127
query66	1150	453	329	329
query67	query68	16912	856	857	856
query69	1161	286	300	286
query70	1524	1107	1101	1101
query71	722	321	325	321
query72	9293	2256	2182	2182
query73	3240	717	354	354
query74	9063	8961	8492	8492
query75	7561	3169	2620	2620
query76	8845	1227	779	779
query77	1150	419	338	338
query78	9701	10282	9251	9251
query79	14013	611	589	589
query80	1303	566	497	497
query81	587	277	234	234
query82	316	154	122	122
query83	379	291	277	277
query84	291	98	89	89
query85	812	360	331	331
query86	382	327	298	298
query87	4268	4242	4190	4190
query88	3900	2218	2181	2181
query89	474	368	322	322
query90	2566	229	227	227
query91	149	170	111	111
query92	93	71	69	69
query93	3996	994	667	667
query94	943	402	276	276
query95	404	329	319	319
query96	500	580	280	280
query97	2661	2722	2628	2628
query98	239	222	221	221
query99	1460	1363	1281	1281
Total cold run time: 302831 ms
Total hot run time: 172407 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 33.65 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit b08eae05dc5b370497c6f5167e24477282176ed9, data reload: false

query1	0.04	0.04	0.04
query2	0.08	0.04	0.04
query3	0.24	0.07	0.07
query4	1.62	0.11	0.11
query5	0.41	0.43	0.43
query6	1.15	0.68	0.66
query7	0.03	0.02	0.02
query8	0.05	0.04	0.03
query9	0.56	0.47	0.46
query10	0.52	0.52	0.52
query11	0.16	0.11	0.10
query12	0.15	0.11	0.11
query13	0.64	0.73	0.64
query14	0.88	1.10	1.07
query15	1.02	0.88	0.89
query16	0.39	0.40	0.38
query17	1.08	1.05	1.09
query18	0.23	0.20	0.20
query19	1.92	1.91	1.98
query20	0.01	0.02	0.01
query21	15.38	0.88	0.56
query22	0.75	0.98	0.59
query23	15.23	1.15	0.58
query24	6.47	1.49	1.45
query25	0.54	0.11	0.20
query26	0.44	0.17	0.13
query27	0.06	0.05	0.05
query28	10.69	0.83	0.44
query29	12.76	3.79	3.30
query30	3.03	2.96	2.99
query31	2.81	0.57	0.39
query32	3.24	0.56	0.49
query33	3.03	3.19	3.28
query34	16.33	5.31	4.92
query35	4.88	4.91	5.00
query36	0.70	0.53	0.51
query37	0.10	0.07	0.07
query38	0.06	0.05	0.04
query39	0.04	0.02	0.03
query40	0.17	0.13	0.13
query41	0.08	0.03	0.03
query42	0.03	0.02	0.02
query43	0.04	0.04	0.03
Total cold run time: 108.04 s
Total hot run time: 33.65 s

@hello-stephen
Copy link
Contributor

BE Regression && UT Coverage Report

Increment line coverage 95.56% (86/90) 🎉

Increment coverage report
Complete coverage report

Category Coverage
Function Coverage 81.32% (22169/27262)
Line Coverage 74.07% (229311/309567)
Region Coverage 61.56% (191513/311109)
Branch Coverage 65.54% (82557/125968)

assert_cast<const ColumnArray&>(*k_arr).get_offsets_ptr());
}

Status ColumnMap::deduplicate_keys(bool recursive) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

两个block合并的时候也需要去重

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why?

@mrhhsg
Copy link
Member Author

mrhhsg commented Sep 10, 2025

run buildall

@github-actions
Copy link
Contributor

Possible file(s) that should be tracked in LFS detected: 🚨

The following file(s) exceeds the file size limit: 1048576 bytes, as set in the .yml configuration files:

  • regression-test/data/datatype_p0/nested_types/base_cases/one_level_nestedtypes_with_s3data.out

Consider using git-lfs to manage large files.

@doris-robot
Copy link

TPC-H: Total hot run time: 34630 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 24cea7b82e378aad454caa7028091bd2cf16ec8e, data reload: false

------ Round 1 ----------------------------------
q1	17584	5269	5054	5054
q2	2024	319	213	213
q3	10284	1422	721	721
q4	10232	1025	538	538
q5	7518	2462	2308	2308
q6	182	168	134	134
q7	950	759	648	648
q8	9370	1352	1077	1077
q9	7111	5103	5191	5103
q10	6933	2356	1989	1989
q11	506	308	274	274
q12	374	376	225	225
q13	17772	3689	3031	3031
q14	239	236	217	217
q15	573	499	492	492
q16	1001	1016	959	959
q17	597	872	367	367
q18	7547	7152	7132	7132
q19	1362	955	564	564
q20	342	344	237	237
q21	3840	3164	2336	2336
q22	1076	1051	1011	1011
Total cold run time: 107417 ms
Total hot run time: 34630 ms

----- Round 2, with runtime_filter_mode=off -----
q1	5166	5099	5083	5083
q2	250	337	232	232
q3	2213	2710	2353	2353
q4	1318	1790	1332	1332
q5	4244	4623	4616	4616
q6	216	178	139	139
q7	2060	2016	1904	1904
q8	2668	2607	2593	2593
q9	7315	7464	7504	7464
q10	3102	3300	2878	2878
q11	584	533	524	524
q12	717	778	632	632
q13	3534	4044	3371	3371
q14	289	326	289	289
q15	528	475	479	475
q16	1066	1121	1066	1066
q17	1300	1589	1419	1419
q18	7886	7805	7659	7659
q19	812	813	830	813
q20	1956	2070	1854	1854
q21	4698	4320	4302	4302
q22	1103	1042	1011	1011
Total cold run time: 53025 ms
Total hot run time: 52009 ms

result_col_map_offsets[row] = offset;
}

RETURN_IF_ERROR(map_column->deduplicate_keys());
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

map_column->deduplicate_keys(true)? 比如是:map(2, map(2, 3, 2, 3))

const auto inner_rows = keys_column->size();
const auto rows = offsets_column->size();

if (recursive) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这里是不是总是检查嵌套的map ,如果是就去去重一遍,这样调用 deduplicate_keys 不用考虑是否嵌套了。

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

理论上咱们生成一个 map column 时会就会去重,那么大部分场景下可能不需要递归式的去重。

@doris-robot
Copy link

TPC-DS: Total hot run time: 188693 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 24cea7b82e378aad454caa7028091bd2cf16ec8e, data reload: false

query1	1068	474	419	419
query2	6555	1683	1669	1669
query3	6753	231	223	223
query4	26018	23291	23220	23220
query5	4461	638	487	487
query6	344	253	237	237
query7	4649	510	314	314
query8	300	255	249	249
query9	8634	2940	2901	2901
query10	497	364	297	297
query11	15772	15012	14686	14686
query12	163	119	115	115
query13	1681	551	424	424
query14	10141	9231	9107	9107
query15	211	199	167	167
query16	7161	671	486	486
query17	1224	752	624	624
query18	2012	432	340	340
query19	208	200	175	175
query20	131	129	122	122
query21	215	140	123	123
query22	4234	4210	4199	4199
query23	33814	33165	32927	32927
query24	8133	2412	2446	2412
query25	634	554	492	492
query26	1261	277	167	167
query27	2723	510	343	343
query28	4519	2274	2238	2238
query29	843	613	500	500
query30	295	225	196	196
query31	887	798	729	729
query32	92	77	81	77
query33	560	394	358	358
query34	799	863	534	534
query35	865	821	760	760
query36	991	1036	915	915
query37	127	157	91	91
query38	3512	3522	3499	3499
query39	1503	1422	1444	1422
query40	223	136	130	130
query41	65	59	59	59
query42	131	118	118	118
query43	548	505	462	462
query44	1358	879	863	863
query45	180	178	185	178
query46	860	1015	651	651
query47	1779	1805	1743	1743
query48	441	417	319	319
query49	733	511	417	417
query50	650	691	409	409
query51	3890	3911	3869	3869
query52	113	112	108	108
query53	245	278	197	197
query54	637	603	552	552
query55	99	89	91	89
query56	342	342	358	342
query57	1166	1208	1135	1135
query58	297	282	275	275
query59	2510	2706	2516	2516
query60	370	355	349	349
query61	162	156	158	156
query62	831	732	664	664
query63	236	199	197	197
query64	4528	1152	831	831
query65	4109	3951	3955	3951
query66	1178	422	352	352
query67	15648	15154	15141	15141
query68	7062	923	581	581
query69	521	338	299	299
query70	1294	1260	1281	1260
query71	541	349	322	322
query72	5949	5004	5026	5004
query73	672	622	361	361
query74	9259	9276	8638	8638
query75	3302	3194	2744	2744
query76	3182	1182	749	749
query77	498	406	338	338
query78	9481	9579	8821	8821
query79	2492	860	598	598
query80	704	608	553	553
query81	480	262	224	224
query82	221	182	136	136
query83	264	272	269	269
query84	269	122	101	101
query85	938	553	550	550
query86	378	328	308	308
query87	3681	3696	3594	3594
query88	3380	2238	2266	2238
query89	399	313	295	295
query90	1978	230	225	225
query91	171	179	130	130
query92	93	74	69	69
query93	2568	982	635	635
query94	692	415	329	329
query95	411	331	346	331
query96	479	590	282	282
query97	2880	2954	2857	2857
query98	253	223	224	223
query99	1364	1398	1312	1312
Total cold run time: 273175 ms
Total hot run time: 188693 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 29.68 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit 24cea7b82e378aad454caa7028091bd2cf16ec8e, data reload: false

query1	0.05	0.05	0.05
query2	0.09	0.04	0.06
query3	0.25	0.08	0.08
query4	1.60	0.11	0.11
query5	0.29	0.27	0.25
query6	1.17	0.65	0.62
query7	0.03	0.03	0.03
query8	0.06	0.04	0.04
query9	0.62	0.53	0.52
query10	0.58	0.57	0.58
query11	0.16	0.12	0.10
query12	0.16	0.12	0.12
query13	0.66	0.63	0.61
query14	1.01	1.04	1.02
query15	0.86	0.87	0.85
query16	0.39	0.40	0.41
query17	1.05	1.08	1.07
query18	0.21	0.20	0.20
query19	1.94	1.83	1.91
query20	0.02	0.01	0.01
query21	15.46	0.96	0.58
query22	0.76	0.99	0.62
query23	15.18	1.37	0.66
query24	6.25	1.60	0.63
query25	0.48	0.13	0.09
query26	0.60	0.16	0.13
query27	0.06	0.05	0.06
query28	9.84	0.89	0.44
query29	12.56	3.94	3.26
query30	0.29	0.12	0.11
query31	2.82	0.58	0.39
query32	3.23	0.56	0.48
query33	3.06	3.11	3.11
query34	15.98	5.51	4.85
query35	4.92	4.90	4.93
query36	0.72	0.51	0.50
query37	0.11	0.07	0.07
query38	0.06	0.06	0.04
query39	0.03	0.03	0.04
query40	0.19	0.15	0.14
query41	0.09	0.04	0.03
query42	0.04	0.03	0.02
query43	0.05	0.03	0.03
Total cold run time: 103.98 s
Total hot run time: 29.68 s

@hello-stephen
Copy link
Contributor

BE UT Coverage Report

Increment line coverage 75.56% (68/90) 🎉

Increment coverage report
Complete coverage report

Category Coverage
Function Coverage 51.97% (17303/33296)
Line Coverage 37.31% (157697/422678)
Region Coverage 31.93% (120228/376482)
Branch Coverage 33.33% (52847/158545)

@mrhhsg
Copy link
Member Author

mrhhsg commented Sep 10, 2025

run buildall

@github-actions
Copy link
Contributor

Possible file(s) that should be tracked in LFS detected: 🚨

The following file(s) exceeds the file size limit: 1048576 bytes, as set in the .yml configuration files:

  • regression-test/data/datatype_p0/nested_types/base_cases/one_level_nestedtypes_with_s3data.out

Consider using git-lfs to manage large files.

@doris-robot
Copy link

TPC-H: Total hot run time: 34873 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 23d61a44ee7b7c54645f3f9dbec0cab52e2cd996, data reload: false

------ Round 1 ----------------------------------
q1	17620	5266	5121	5121
q2	2000	343	256	256
q3	10196	1326	757	757
q4	10217	1016	533	533
q5	7541	2404	2383	2383
q6	185	170	135	135
q7	917	769	638	638
q8	9340	1378	1118	1118
q9	7058	5203	5147	5147
q10	6904	2419	1982	1982
q11	504	297	282	282
q12	348	367	228	228
q13	17769	3860	3055	3055
q14	244	246	220	220
q15	544	486	485	485
q16	1004	1015	925	925
q17	585	870	363	363
q18	7406	7318	7092	7092
q19	1231	939	558	558
q20	349	348	280	280
q21	3769	2549	2337	2337
q22	1070	1013	978	978
Total cold run time: 106801 ms
Total hot run time: 34873 ms

----- Round 2, with runtime_filter_mode=off -----
q1	5160	5111	5116	5111
q2	246	323	234	234
q3	2205	2684	2326	2326
q4	1382	1817	1347	1347
q5	4210	4589	4580	4580
q6	234	178	134	134
q7	2039	2006	1849	1849
q8	2717	2640	2571	2571
q9	7484	7561	7228	7228
q10	3131	3302	2893	2893
q11	571	520	537	520
q12	744	770	673	673
q13	3524	3921	3377	3377
q14	299	335	297	297
q15	536	484	479	479
q16	1120	1119	1062	1062
q17	1186	1574	1376	1376
q18	8071	7894	7626	7626
q19	825	809	858	809
q20	2069	2101	1908	1908
q21	5161	4520	4402	4402
q22	1102	1037	968	968
Total cold run time: 54016 ms
Total hot run time: 51770 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 189719 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 23d61a44ee7b7c54645f3f9dbec0cab52e2cd996, data reload: false

query1	1062	440	412	412
query2	6549	1710	1762	1710
query3	6753	231	232	231
query4	26177	23272	22889	22889
query5	4415	659	509	509
query6	341	248	229	229
query7	4648	522	306	306
query8	295	272	250	250
query9	8647	2939	2994	2939
query10	481	354	304	304
query11	15752	14939	14745	14745
query12	180	124	120	120
query13	1675	561	424	424
query14	10603	9466	9402	9402
query15	216	214	180	180
query16	7668	683	519	519
query17	1264	781	654	654
query18	2058	457	364	364
query19	218	207	190	190
query20	144	129	138	129
query21	221	132	118	118
query22	4069	4187	3981	3981
query23	34239	33155	33048	33048
query24	8235	2397	2442	2397
query25	640	563	496	496
query26	1253	286	180	180
query27	2715	531	356	356
query28	4419	2304	2265	2265
query29	804	632	496	496
query30	296	235	200	200
query31	890	822	702	702
query32	94	83	85	83
query33	584	461	349	349
query34	833	859	518	518
query35	835	833	779	779
query36	981	1058	938	938
query37	128	117	96	96
query38	3542	3513	3521	3513
query39	1511	1437	1453	1437
query40	226	143	128	128
query41	66	62	61	61
query42	133	127	128	127
query43	522	490	462	462
query44	1352	869	866	866
query45	181	176	170	170
query46	866	1042	656	656
query47	1754	1785	1737	1737
query48	417	419	319	319
query49	760	540	418	418
query50	682	697	435	435
query51	3863	3999	3822	3822
query52	120	115	111	111
query53	249	283	198	198
query54	628	619	551	551
query55	100	98	90	90
query56	345	348	331	331
query57	1191	1201	1124	1124
query58	300	293	295	293
query59	2542	2640	2555	2555
query60	360	366	356	356
query61	171	164	161	161
query62	825	729	676	676
query63	238	200	200	200
query64	4516	1161	841	841
query65	4075	4025	4054	4025
query66	1196	448	362	362
query67	15355	15322	15273	15273
query68	8797	979	612	612
query69	512	351	304	304
query70	1423	1367	1341	1341
query71	593	355	328	328
query72	6014	5009	5117	5009
query73	737	614	362	362
query74	8921	8943	8876	8876
query75	4386	3262	2861	2861
query76	3714	1163	772	772
query77	819	427	338	338
query78	9750	9951	8830	8830
query79	2347	818	598	598
query80	663	586	540	540
query81	476	263	225	225
query82	438	168	140	140
query83	295	272	254	254
query84	300	108	99	99
query85	897	475	478	475
query86	353	337	329	329
query87	3779	3673	3645	3645
query88	3136	2223	2224	2223
query89	408	347	302	302
query90	1930	239	239	239
query91	165	173	138	138
query92	98	78	83	78
query93	1716	1003	651	651
query94	695	424	335	335
query95	413	343	342	342
query96	489	588	283	283
query97	2959	2998	2863	2863
query98	257	229	222	222
query99	1466	1417	1299	1299
Total cold run time: 277414 ms
Total hot run time: 189719 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 30.01 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit 23d61a44ee7b7c54645f3f9dbec0cab52e2cd996, data reload: false

query1	0.06	0.05	0.05
query2	0.09	0.05	0.06
query3	0.26	0.08	0.08
query4	1.61	0.11	0.11
query5	0.28	0.27	0.26
query6	1.20	0.67	0.64
query7	0.03	0.03	0.03
query8	0.06	0.05	0.05
query9	0.63	0.54	0.53
query10	0.58	0.59	0.58
query11	0.17	0.12	0.12
query12	0.16	0.12	0.12
query13	0.63	0.62	0.62
query14	1.02	1.04	1.02
query15	0.86	0.86	0.85
query16	0.40	0.40	0.41
query17	1.08	1.08	1.06
query18	0.21	0.20	0.20
query19	1.95	1.79	1.82
query20	0.01	0.01	0.01
query21	15.41	0.94	0.60
query22	0.77	1.16	0.81
query23	14.85	1.41	0.65
query24	7.03	1.38	0.60
query25	0.50	0.17	0.10
query26	0.64	0.16	0.13
query27	0.07	0.06	0.05
query28	9.53	0.91	0.43
query29	12.58	3.92	3.28
query30	0.28	0.14	0.11
query31	2.83	0.61	0.40
query32	3.24	0.57	0.47
query33	3.08	3.10	3.10
query34	15.91	5.49	4.92
query35	4.97	4.92	4.91
query36	0.71	0.51	0.52
query37	0.11	0.07	0.07
query38	0.06	0.04	0.05
query39	0.04	0.03	0.03
query40	0.17	0.16	0.13
query41	0.09	0.03	0.03
query42	0.04	0.03	0.03
query43	0.05	0.04	0.04
Total cold run time: 104.25 s
Total hot run time: 30.01 s

@hello-stephen
Copy link
Contributor

FE UT Coverage Report

Increment line coverage 13.89% (5/36) 🎉
Increment coverage report
Complete coverage report

@doris-robot
Copy link

BE UT Coverage Report

Increment line coverage 75.56% (68/90) 🎉

Increment coverage report
Complete coverage report

Category Coverage
Function Coverage 51.97% (17303/33296)
Line Coverage 37.32% (157757/422731)
Region Coverage 31.98% (120430/376626)
Branch Coverage 33.34% (52857/158549)

@hello-stephen
Copy link
Contributor

BE Regression && UT Coverage Report

Increment line coverage 95.56% (86/90) 🎉

Increment coverage report
Complete coverage report

Category Coverage
Function Coverage 70.88% (23173/32695)
Line Coverage 57.27% (241960/422477)
Region Coverage 52.61% (201016/382055)
Branch Coverage 54.28% (86555/159452)

@hello-stephen
Copy link
Contributor

FE Regression Coverage Report

Increment line coverage 94.44% (34/36) 🎉
Increment coverage report
Complete coverage report

@hello-stephen
Copy link
Contributor

BE Regression && UT Coverage Report

Increment line coverage 95.56% (86/90) 🎉

Increment coverage report
Complete coverage report

Category Coverage
Function Coverage 70.85% (23166/32696)
Line Coverage 57.24% (241833/422471)
Region Coverage 52.52% (200702/382126)
Branch Coverage 54.25% (86508/159450)

@hello-stephen
Copy link
Contributor

FE Regression Coverage Report

Increment line coverage 83.33% (30/36) 🎉
Increment coverage report
Complete coverage report

@github-actions
Copy link
Contributor

PR approved by at least one committer and no changes requested.

@github-actions github-actions bot added approved Indicates a PR has been approved by one committer. reviewed labels Sep 11, 2025
@github-actions
Copy link
Contributor

PR approved by anyone and no changes requested.

Copy link
Contributor

@csun5285 csun5285 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@yiguolei yiguolei closed this Sep 11, 2025
@yiguolei yiguolei reopened this Sep 11, 2025
@github-actions
Copy link
Contributor

Possible file(s) that should be tracked in LFS detected: 🚨

The following file(s) exceeds the file size limit: 1048576 bytes, as set in the .yml configuration files:

  • regression-test/data/datatype_p0/nested_types/base_cases/one_level_nestedtypes_with_s3data.out

Consider using git-lfs to manage large files.

@yiguolei yiguolei merged commit b806bce into apache:master Sep 11, 2025
36 of 42 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by one committer. lfs-detected! Warning Label for use when LFS is detected in the commits of a Pull Request reviewed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants