Skip to content

[Enhencement](Outfile/Export) Export data to csv file format with BOM#30533

Merged
morningman merged 7 commits intoapache:masterfrom
BePPPower:ExportBom
Feb 9, 2024
Merged

[Enhencement](Outfile/Export) Export data to csv file format with BOM#30533
morningman merged 7 commits intoapache:masterfrom
BePPPower:ExportBom

Conversation

@BePPPower
Copy link
Contributor

@BePPPower BePPPower commented Jan 29, 2024

Proposed changes

Issue Number: close #xxx

Description

The UTF8 format of the Windows system has BOM.

We add a new user property to Outfile/Export。Therefore, when exporting Doris data, users can choose whether to bring BOM on the beginning of the CSV file.

Usage:

-- outfile:
select * from demo.student
into outfile "file:///xxx/export/exp_"
format as csv
properties(
    "column_separator" = ",",
    "with_bom" = "true"
);

-- Export:
EXPORT TABLE student TO "file:///xx/tmpdata/export/exp_"
PROPERTIES(
    "format" = "csv",
    "with_bom" = "true"
);

Further comments

If this is a relatively large or complex change, kick off the discussion at dev@doris.apache.org by explaining why you chose the solution you did and what alternatives you considered, etc...

@BePPPower
Copy link
Contributor Author

run buildall

@github-actions
Copy link
Contributor

clang-tidy review says "All clean, LGTM! 👍"

@BePPPower
Copy link
Contributor Author

run buildall

@github-actions
Copy link
Contributor

github-actions bot commented Feb 1, 2024

clang-tidy review says "All clean, LGTM! 👍"

@BePPPower
Copy link
Contributor Author

run buildall

@github-actions
Copy link
Contributor

github-actions bot commented Feb 1, 2024

clang-tidy review says "All clean, LGTM! 👍"

@doris-robot
Copy link

TPC-H: Total hot run time: 37019 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit a913e4af5c98136c8fdee5b46a1e19cc35ca29b2, data reload: false

------ Round 1 ----------------------------------
q1	17679	4686	4443	4443
q2	2043	140	137	137
q3	10594	919	924	919
q4	4635	765	721	721
q5	7663	2773	2769	2769
q6	186	119	119	119
q7	1154	721	700	700
q8	9297	2026	2016	2016
q9	7191	6343	6308	6308
q10	8145	2441	2422	2422
q11	421	226	219	219
q12	745	273	284	273
q13	18018	3333	3333	3333
q14	274	242	247	242
q15	529	487	495	487
q16	484	411	413	411
q17	934	578	510	510
q18	6907	5999	5933	5933
q19	1552	1346	1383	1346
q20	570	339	334	334
q21	6947	3104	3067	3067
q22	821	313	310	310
Total cold run time: 106789 ms
Total hot run time: 37019 ms

----- Round 2, with runtime_filter_mode=off -----
q1	4463	4397	4380	4380
q2	333	229	226	226
q3	2987	2892	2843	2843
q4	1839	1750	1699	1699
q5	5219	5338	5254	5254
q6	193	114	115	114
q7	2164	1832	1820	1820
q8	3125	3254	3255	3254
q9	8401	8328	8352	8328
q10	5882	3511	3571	3511
q11	544	449	446	446
q12	732	585	598	585
q13	15042	3075	3084	3075
q14	271	239	247	239
q15	547	496	490	490
q16	512	464	455	455
q17	1839	1639	1682	1639
q18	8104	7718	7619	7619
q19	11136	1553	1570	1553
q20	2146	1906	1896	1896
q21	4953	4711	4642	4642
q22	570	483	476	476
Total cold run time: 81002 ms
Total hot run time: 54544 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 174733 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit a913e4af5c98136c8fdee5b46a1e19cc35ca29b2, data reload: false

query1	932	339	325	325
query2	6567	2017	1861	1861
query3	6721	200	199	199
query4	31703	22087	22029	22029
query5	4459	364	351	351
query6	248	155	161	155
query7	4617	267	255	255
query8	233	176	179	176
query9	8887	2303	2289	2289
query10	411	204	203	203
query11	18797	15402	15373	15373
query12	125	69	68	68
query13	1637	381	396	381
query14	9746	7476	7097	7097
query15	212	177	182	177
query16	7182	273	251	251
query17	1837	489	462	462
query18	1930	260	261	260
query19	179	131	134	131
query20	75	72	67	67
query21	200	138	132	132
query22	4886	4783	4560	4560
query23	31008	30372	30271	30271
query24	6667	2776	2815	2776
query25	474	318	311	311
query26	712	140	142	140
query27	1901	288	280	280
query28	3713	1904	1868	1868
query29	902	631	609	609
query30	209	131	135	131
query31	910	718	736	718
query32	93	55	51	51
query33	415	214	215	214
query34	806	467	476	467
query35	846	783	741	741
query36	1290	1214	1188	1188
query37	97	60	58	58
query38	3298	3138	3150	3138
query39	1304	1270	1249	1249
query40	194	87	88	87
query41	38	41	34	34
query42	87	81	81	81
query43	499	485	460	460
query44	1029	689	709	689
query45	190	182	174	174
query46	1036	618	622	618
query47	1585	1415	1480	1415
query48	410	296	314	296
query49	615	283	286	283
query50	678	311	324	311
query51	5267	5127	5146	5127
query52	87	85	78	78
query53	329	283	259	259
query54	228	187	187	187
query55	82	75	79	75
query56	182	165	166	165
query57	973	889	892	889
query58	190	162	162	162
query59	2353	2437	2147	2147
query60	210	174	177	174
query61	89	84	85	84
query62	488	351	351	351
query63	274	267	266	266
query64	4274	3691	3517	3517
query65	3237	3236	3264	3236
query66	930	327	307	307
query67	14137	14296	14124	14124
query68	4769	512	529	512
query69	459	324	315	315
query70	1558	1462	1499	1462
query71	302	210	217	210
query72	5894	3132	2838	2838
query73	687	320	317	317
query74	6777	6282	6207	6207
query75	2917	2295	2296	2295
query76	2886	976	965	965
query77	398	229	231	229
query78	9175	8662	8411	8411
query79	2742	490	499	490
query80	2243	319	318	318
query81	518	197	202	197
query82	808	85	80	80
query83	275	119	111	111
query84	280	73	74	73
query85	1965	353	329	329
query86	518	396	400	396
query87	3478	3232	3262	3232
query88	3849	2171	2169	2169
query89	416	362	342	342
query90	2031	186	181	181
query91	150	117	119	117
query92	57	48	43	43
query93	4448	435	453	435
query94	1318	157	161	157
query95	507	446	454	446
query96	631	315	321	315
query97	4232	4082	4147	4082
query98	209	189	182	182
query99	1061	676	669	669
Total cold run time: 273478 ms
Total hot run time: 174733 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 32.15 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit a913e4af5c98136c8fdee5b46a1e19cc35ca29b2, data reload: false

query1	0.04	0.03	0.03
query2	0.06	0.03	0.02
query3	0.22	0.06	0.06
query4	1.68	0.10	0.09
query5	0.52	0.52	0.52
query6	1.19	0.62	0.65
query7	0.02	0.02	0.01
query8	0.04	0.02	0.02
query9	0.56	0.51	0.49
query10	0.56	0.55	0.55
query11	0.11	0.08	0.09
query12	0.10	0.09	0.10
query13	0.61	0.62	0.60
query14	0.77	0.80	0.80
query15	0.79	0.77	0.78
query16	0.39	0.38	0.38
query17	0.97	0.97	1.01
query18	0.20	0.27	0.25
query19	1.87	1.77	1.80
query20	0.01	0.01	0.02
query21	15.40	0.58	0.61
query22	2.57	2.55	2.47
query23	17.33	0.73	0.75
query24	2.24	1.40	1.64
query25	0.34	0.22	0.17
query26	0.59	0.13	0.14
query27	0.05	0.05	0.04
query28	11.44	0.85	0.84
query29	12.49	3.16	3.19
query30	0.64	0.54	0.54
query31	2.79	0.36	0.36
query32	3.37	0.48	0.48
query33	3.21	3.21	3.25
query34	15.69	4.33	4.32
query35	4.33	4.37	4.26
query36	1.09	1.05	1.06
query37	0.06	0.05	0.05
query38	0.04	0.03	0.03
query39	0.02	0.02	0.01
query40	0.15	0.14	0.13
query41	0.07	0.02	0.02
query42	0.03	0.02	0.02
query43	0.03	0.02	0.02
Total cold run time: 104.68 s
Total hot run time: 32.15 s

@doris-robot
Copy link

Load test result on machine: 'aliyun_ecs.c7a.8xlarge_32C64G'

Load test result on commit a913e4af5c98136c8fdee5b46a1e19cc35ca29b2 with default session variables
Stream load json:         18 seconds loaded 2358488459 Bytes, about 124 MB/s
Stream load orc:          58 seconds loaded 1101869774 Bytes, about 18 MB/s
Stream load parquet:      31 seconds loaded 861443392 Bytes, about 26 MB/s
Insert into select:       14.0 seconds inserted 10000000 Rows, about 714K ops/s

@doris-robot
Copy link

TeamCity be ut coverage result:
Function Coverage: 36.17% (8627/23850)
Line Coverage: 28.23% (70516/249783)
Region Coverage: 27.23% (36385/133635)
Branch Coverage: 24.01% (18634/77598)
Coverage Report: http://coverage.selectdb-in.cc/coverage/a913e4af5c98136c8fdee5b46a1e19cc35ca29b2_a913e4af5c98136c8fdee5b46a1e19cc35ca29b2/report/index.html

@BePPPower
Copy link
Contributor Author

run buildall

@github-actions
Copy link
Contributor

github-actions bot commented Feb 4, 2024

clang-tidy review says "All clean, LGTM! 👍"

@doris-robot
Copy link

TPC-H: Total hot run time: 37277 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit ecf573dc5edb0ea217e92bed79c2dcdbee650019, data reload: false

------ Round 1 ----------------------------------
q1	18313	5004	4498	4498
q2	2464	151	139	139
q3	11646	947	943	943
q4	5040	761	747	747
q5	7801	2883	2920	2883
q6	187	122	122	122
q7	1160	767	751	751
q8	9443	2025	2076	2025
q9	7780	6379	6353	6353
q10	8144	2430	2425	2425
q11	424	229	193	193
q12	784	274	292	274
q13	18024	3293	3337	3293
q14	284	257	253	253
q15	530	495	500	495
q16	494	409	422	409
q17	950	623	508	508
q18	6826	6064	5877	5877
q19	1577	1361	1380	1361
q20	593	339	360	339
q21	6869	3095	3144	3095
q22	815	300	294	294
Total cold run time: 110148 ms
Total hot run time: 37277 ms

----- Round 2, with runtime_filter_mode=off -----
q1	4462	4371	4382	4371
q2	325	240	237	237
q3	3078	2900	2814	2814
q4	1867	1626	1694	1626
q5	5200	5276	5255	5255
q6	193	114	114	114
q7	2115	1789	1756	1756
q8	3110	3236	3286	3236
q9	8371	8290	8311	8290
q10	5854	3535	3552	3535
q11	535	443	463	443
q12	756	585	571	571
q13	8967	3098	3105	3098
q14	287	247	263	247
q15	537	497	489	489
q16	524	485	474	474
q17	1876	1643	1663	1643
q18	7958	7882	7576	7576
q19	8701	1539	1565	1539
q20	2133	1915	1910	1910
q21	4989	4625	4547	4547
q22	555	472	448	448
Total cold run time: 72393 ms
Total hot run time: 54219 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 174108 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit ecf573dc5edb0ea217e92bed79c2dcdbee650019, data reload: false

query1	927	349	340	340
query2	6519	1905	1887	1887
query3	6697	202	204	202
query4	31897	22009	21963	21963
query5	4313	410	423	410
query6	256	172	166	166
query7	4618	283	274	274
query8	253	181	176	176
query9	9023	2341	2324	2324
query10	411	227	214	214
query11	18507	15648	15497	15497
query12	128	80	74	74
query13	1648	426	430	426
query14	9561	6322	6433	6322
query15	234	173	178	173
query16	8086	257	245	245
query17	1855	535	509	509
query18	2108	261	261	261
query19	213	137	142	137
query20	80	78	75	75
query21	198	130	124	124
query22	5027	4769	4655	4655
query23	31035	30112	30092	30092
query24	10599	2788	2754	2754
query25	588	351	339	339
query26	1487	145	157	145
query27	3052	295	302	295
query28	7590	1863	1868	1863
query29	925	612	605	605
query30	279	135	135	135
query31	913	690	718	690
query32	90	57	52	52
query33	733	223	231	223
query34	1145	465	473	465
query35	844	743	765	743
query36	1076	945	947	945
query37	120	61	66	61
query38	3233	3107	3190	3107
query39	1333	1254	1241	1241
query40	269	91	95	91
query41	36	37	35	35
query42	98	92	93	92
query43	527	489	490	489
query44	1092	700	706	700
query45	194	177	181	177
query46	1065	641	636	636
query47	1573	1509	1436	1436
query48	430	356	370	356
query49	1228	286	286	286
query50	759	381	377	377
query51	5276	5145	5210	5145
query52	98	89	88	88
query53	345	275	263	263
query54	276	216	220	216
query55	79	76	77	76
query56	227	199	200	199
query57	958	927	905	905
query58	205	176	176	176
query59	2520	2355	2302	2302
query60	248	212	212	212
query61	86	81	87	81
query62	639	366	363	363
query63	293	279	278	278
query64	6150	3704	3415	3415
query65	3245	3217	3231	3217
query66	1063	305	309	305
query67	14428	14073	14071	14071
query68	4196	544	532	532
query69	466	333	322	322
query70	1273	1247	1245	1245
query71	317	252	246	246
query72	6009	2884	2654	2654
query73	699	314	328	314
query74	6523	6247	6328	6247
query75	3023	2382	2315	2315
query76	2533	922	925	922
query77	410	231	224	224
query78	9134	8764	8508	8508
query79	3045	488	487	487
query80	2044	354	342	342
query81	531	197	195	195
query82	819	84	82	82
query83	262	124	130	124
query84	283	86	84	84
query85	2345	330	321	321
query86	486	294	283	283
query87	3481	3269	3217	3217
query88	4104	2332	2342	2332
query89	450	368	376	368
query90	2027	172	168	168
query91	165	118	121	118
query92	53	43	43	43
query93	3315	503	490	490
query94	1323	175	174	174
query95	8020	362	7814	362
query96	604	278	277	277
query97	4272	4101	4131	4101
query98	220	200	187	187
query99	1111	706	628	628
Total cold run time: 294551 ms
Total hot run time: 174108 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 31.12 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit ecf573dc5edb0ea217e92bed79c2dcdbee650019, data reload: false

query1	0.04	0.03	0.03
query2	0.06	0.02	0.02
query3	0.23	0.06	0.06
query4	1.64	0.09	0.10
query5	0.53	0.51	0.52
query6	1.18	0.64	0.64
query7	0.02	0.01	0.02
query8	0.04	0.03	0.02
query9	0.53	0.50	0.50
query10	0.55	0.54	0.55
query11	0.12	0.08	0.09
query12	0.11	0.09	0.09
query13	0.60	0.61	0.62
query14	0.81	0.79	0.80
query15	0.78	0.77	0.78
query16	0.37	0.39	0.39
query17	1.01	1.04	1.00
query18	0.21	0.26	0.26
query19	1.90	1.80	1.83
query20	0.01	0.02	0.02
query21	15.43	0.57	0.57
query22	2.63	2.89	2.10
query23	17.11	0.82	0.78
query24	2.81	1.43	0.84
query25	0.27	0.25	0.06
query26	0.59	0.14	0.14
query27	0.06	0.06	0.04
query28	11.14	0.84	0.82
query29	12.52	3.27	3.16
query30	0.64	0.54	0.51
query31	2.80	0.35	0.35
query32	3.34	0.48	0.48
query33	3.21	3.20	3.21
query34	15.76	4.38	4.27
query35	4.31	4.27	4.26
query36	1.08	1.04	1.07
query37	0.07	0.05	0.05
query38	0.04	0.03	0.02
query39	0.02	0.01	0.02
query40	0.17	0.13	0.16
query41	0.08	0.02	0.02
query42	0.03	0.02	0.02
query43	0.03	0.02	0.02
Total cold run time: 104.88 s
Total hot run time: 31.12 s

@doris-robot
Copy link

Load test result on machine: 'aliyun_ecs.c7a.8xlarge_32C64G'

Load test result on commit ecf573dc5edb0ea217e92bed79c2dcdbee650019 with default session variables
Stream load json:         19 seconds loaded 2358488459 Bytes, about 118 MB/s
Stream load orc:          59 seconds loaded 1101869774 Bytes, about 17 MB/s
Stream load parquet:      33 seconds loaded 861443392 Bytes, about 24 MB/s
Insert into select:       13.4 seconds inserted 10000000 Rows, about 746K ops/s

@doris-robot
Copy link

TeamCity be ut coverage result:
Function Coverage: 36.16% (8628/23859)
Line Coverage: 28.22% (70552/250026)
Region Coverage: 27.23% (36411/133699)
Branch Coverage: 24.02% (18648/77638)
Coverage Report: http://coverage.selectdb-in.cc/coverage/ecf573dc5edb0ea217e92bed79c2dcdbee650019_ecf573dc5edb0ea217e92bed79c2dcdbee650019/report/index.html

@BePPPower
Copy link
Contributor Author

run buildall

@github-actions
Copy link
Contributor

github-actions bot commented Feb 6, 2024

clang-tidy review says "All clean, LGTM! 👍"

@doris-robot
Copy link

TPC-H: Total hot run time: 37075 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 9e7180a8d2e52d3c934f98f3001aadf9317a96d1, data reload: false

------ Round 1 ----------------------------------
q1	17631	4598	4470	4470
q2	2045	151	137	137
q3	10648	951	915	915
q4	4646	789	709	709
q5	7700	2783	2799	2783
q6	185	122	121	121
q7	1182	733	738	733
q8	9298	1997	2020	1997
q9	7265	6371	6381	6371
q10	8111	2441	2444	2441
q11	410	209	208	208
q12	752	274	281	274
q13	18023	3331	3272	3272
q14	277	251	239	239
q15	534	504	491	491
q16	497	408	408	408
q17	944	481	610	481
q18	6928	5955	5970	5955
q19	1591	1382	1358	1358
q20	595	359	328	328
q21	6841	3096	3154	3096
q22	806	307	288	288
Total cold run time: 106909 ms
Total hot run time: 37075 ms

----- Round 2, with runtime_filter_mode=off -----
q1	4378	4396	4377	4377
q2	338	229	229	229
q3	2982	2909	2858	2858
q4	1809	1666	1608	1608
q5	5188	5271	5346	5271
q6	191	112	115	112
q7	2139	1747	1814	1747
q8	3129	3266	3242	3242
q9	8388	8360	8268	8268
q10	5924	3619	3578	3578
q11	549	445	476	445
q12	761	557	585	557
q13	12935	3071	3098	3071
q14	277	255	263	255
q15	533	488	494	488
q16	512	487	478	478
q17	1863	1667	1681	1667
q18	8091	7718	7578	7578
q19	10924	1523	1497	1497
q20	2116	1904	1884	1884
q21	4825	4597	4585	4585
q22	566	450	470	450
Total cold run time: 78418 ms
Total hot run time: 54245 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 174095 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 9e7180a8d2e52d3c934f98f3001aadf9317a96d1, data reload: false

query1	927	353	336	336
query2	6540	1966	1842	1842
query3	6706	210	205	205
query4	31654	22139	22089	22089
query5	4296	355	350	350
query6	255	167	169	167
query7	4603	298	280	280
query8	255	173	172	172
query9	9034	2309	2280	2280
query10	418	208	213	208
query11	18414	15409	15361	15361
query12	129	79	78	78
query13	1623	416	412	412
query14	10119	6732	6855	6732
query15	254	174	176	174
query16	8154	267	259	259
query17	1889	514	495	495
query18	2109	307	258	258
query19	355	136	139	136
query20	82	76	77	76
query21	198	128	122	122
query22	4690	4562	4539	4539
query23	31342	29988	30028	29988
query24	7973	2778	2784	2778
query25	540	336	347	336
query26	718	146	149	146
query27	2145	300	302	300
query28	5554	1871	1840	1840
query29	851	616	597	597
query30	280	131	138	131
query31	921	701	722	701
query32	93	55	49	49
query33	524	214	213	213
query34	841	458	462	458
query35	870	764	762	762
query36	962	954	943	943
query37	97	60	56	56
query38	3194	3126	3137	3126
query39	1319	1246	1244	1244
query40	187	96	90	90
query41	43	36	34	34
query42	100	93	93	93
query43	495	456	486	456
query44	1094	699	712	699
query45	196	176	171	171
query46	1031	670	649	649
query47	1595	1530	1541	1530
query48	421	348	349	348
query49	1045	289	280	280
query50	783	392	386	386
query51	5269	5172	5184	5172
query52	95	88	85	85
query53	332	267	283	267
query54	249	213	222	213
query55	91	80	75	75
query56	217	192	193	192
query57	976	940	913	913
query58	196	177	183	177
query59	2283	2321	2416	2321
query60	245	215	213	213
query61	90	86	85	85
query62	598	353	356	353
query63	304	272	274	272
query64	4804	3437	3715	3437
query65	3255	3219	3231	3219
query66	827	308	304	304
query67	14633	14100	14302	14100
query68	5515	553	534	534
query69	496	336	320	320
query70	1300	1222	1138	1138
query71	346	267	241	241
query72	6219	2842	2682	2682
query73	690	333	326	326
query74	6621	6203	6174	6174
query75	3124	2406	2322	2322
query76	2991	934	952	934
query77	370	238	230	230
query78	9284	8778	8517	8517
query79	3515	491	495	491
query80	1767	359	344	344
query81	530	199	198	198
query82	1017	88	79	79
query83	249	121	123	121
query84	291	82	89	82
query85	1961	333	323	323
query86	493	299	286	286
query87	3374	3175	3137	3137
query88	4310	2320	2321	2320
query89	455	367	369	367
query90	1977	163	162	162
query91	155	123	123	123
query92	52	43	39	39
query93	5373	462	450	450
query94	1236	180	171	171
query95	8065	7828	358	358
query96	589	278	271	271
query97	4249	4110	4129	4110
query98	217	203	192	192
query99	1190	767	680	680
Total cold run time: 290403 ms
Total hot run time: 174095 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 31.97 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit 9e7180a8d2e52d3c934f98f3001aadf9317a96d1, data reload: false

query1	0.04	0.03	0.03
query2	0.06	0.02	0.02
query3	0.23	0.06	0.06
query4	1.67	0.10	0.10
query5	0.53	0.52	0.51
query6	1.21	0.65	0.63
query7	0.02	0.02	0.01
query8	0.03	0.03	0.02
query9	0.55	0.49	0.50
query10	0.56	0.55	0.57
query11	0.12	0.08	0.08
query12	0.11	0.09	0.09
query13	0.60	0.63	0.60
query14	0.79	0.78	0.82
query15	0.80	0.78	0.77
query16	0.38	0.38	0.38
query17	1.04	1.01	0.97
query18	0.25	0.25	0.26
query19	1.89	1.77	1.79
query20	0.02	0.01	0.01
query21	15.40	0.61	0.60
query22	2.64	3.16	2.69
query23	17.39	0.98	0.70
query24	2.48	1.20	1.28
query25	0.29	0.24	0.07
query26	0.68	0.14	0.14
query27	0.06	0.04	0.05
query28	11.01	0.86	0.85
query29	12.50	3.10	3.26
query30	0.65	0.54	0.56
query31	2.78	0.34	0.36
query32	3.37	0.47	0.49
query33	3.21	3.23	3.22
query34	15.76	4.30	4.28
query35	4.25	4.30	4.32
query36	1.10	1.07	1.05
query37	0.07	0.05	0.05
query38	0.03	0.02	0.03
query39	0.02	0.01	0.02
query40	0.16	0.16	0.13
query41	0.07	0.02	0.01
query42	0.02	0.02	0.02
query43	0.03	0.02	0.02
Total cold run time: 104.87 s
Total hot run time: 31.97 s

@doris-robot
Copy link

TPC-H: Total hot run time: 38067 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 2c6eb24f0f6122ae99f01dc6260a35d5f65855dd, data reload: false

------ Round 1 ----------------------------------
q1	17635	5368	4735	4735
q2	2037	140	138	138
q3	10591	977	927	927
q4	4654	775	721	721
q5	7658	2967	2880	2880
q6	201	123	123	123
q7	1239	739	726	726
q8	9321	2207	2225	2207
q9	7492	6566	6546	6546
q10	8158	2435	2384	2384
q11	423	206	216	206
q12	747	274	274	274
q13	18014	3317	3323	3317
q14	285	264	260	260
q15	544	502	480	480
q16	485	405	415	405
q17	997	608	544	544
q18	6852	6056	5854	5854
q19	1750	1673	1522	1522
q20	649	330	352	330
q21	7194	3198	3307	3198
q22	825	310	290	290
Total cold run time: 107751 ms
Total hot run time: 38067 ms

----- Round 2, with runtime_filter_mode=off -----
q1	4849	4803	4703	4703
q2	358	239	233	233
q3	3091	2956	2918	2918
q4	1886	1764	1666	1666
q5	5392	5393	5335	5335
q6	214	118	118	118
q7	2202	1784	1781	1781
q8	3299	3477	3528	3477
q9	8464	8448	8487	8448
q10	5934	3678	3683	3678
q11	575	481	485	481
q12	771	563	580	563
q13	11664	3109	3075	3075
q14	282	259	255	255
q15	542	503	487	487
q16	542	472	497	472
q17	1978	1747	1824	1747
q18	8200	7838	7633	7633
q19	9333	1700	1746	1700
q20	2192	1906	1919	1906
q21	4910	4729	4678	4678
q22	569	462	489	462
Total cold run time: 77247 ms
Total hot run time: 55816 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 173925 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 2c6eb24f0f6122ae99f01dc6260a35d5f65855dd, data reload: false

query1	928	340	338	338
query2	6532	2093	1839	1839
query3	6715	205	198	198
query4	31881	21979	21982	21979
query5	4344	428	443	428
query6	269	181	166	166
query7	4603	289	280	280
query8	243	177	171	171
query9	8895	2260	2243	2243
query10	420	208	212	208
query11	18921	15302	15392	15302
query12	129	74	76	74
query13	1633	434	411	411
query14	9005	7215	6735	6735
query15	289	180	181	180
query16	8148	265	258	258
query17	2071	528	499	499
query18	2100	265	262	262
query19	361	136	135	135
query20	82	82	83	82
query21	188	120	133	120
query22	4837	4585	4681	4585
query23	30838	30112	30152	30112
query24	10394	2799	2742	2742
query25	530	338	334	334
query26	714	147	145	145
query27	2193	304	302	302
query28	5713	1847	1833	1833
query29	879	605	608	605
query30	279	134	138	134
query31	917	698	724	698
query32	89	55	48	48
query33	618	218	216	216
query34	846	448	461	448
query35	865	788	756	756
query36	1048	986	930	930
query37	94	56	61	56
query38	3258	3142	3106	3106
query39	1301	1279	1247	1247
query40	186	97	96	96
query41	39	39	32	32
query42	102	87	88	87
query43	515	476	463	463
query44	1063	681	691	681
query45	197	183	172	172
query46	1064	651	652	651
query47	1562	1527	1444	1444
query48	440	363	363	363
query49	1043	286	291	286
query50	763	371	382	371
query51	5294	5073	5202	5073
query52	101	84	84	84
query53	328	272	282	272
query54	283	213	211	211
query55	80	78	73	73
query56	210	203	195	195
query57	925	887	915	887
query58	205	180	184	180
query59	2399	2392	2255	2255
query60	235	216	211	211
query61	81	81	79	79
query62	661	348	352	348
query63	300	260	278	260
query64	4863	3710	3388	3388
query65	3268	3239	3224	3224
query66	808	304	298	298
query67	14661	14113	14239	14113
query68	5249	564	548	548
query69	475	345	341	341
query70	1316	1217	1247	1217
query71	327	242	249	242
query72	6382	2830	2654	2654
query73	706	335	328	328
query74	6540	6211	6233	6211
query75	3086	2383	2329	2329
query76	3355	955	929	929
query77	348	237	223	223
query78	9062	9013	8490	8490
query79	3307	503	497	497
query80	2078	366	336	336
query81	545	198	198	198
query82	873	83	80	80
query83	259	128	122	122
query84	286	86	82	82
query85	2197	327	324	324
query86	483	292	269	269
query87	3362	3167	3243	3167
query88	4618	2371	2384	2371
query89	459	353	371	353
query90	1938	166	164	164
query91	179	121	138	121
query92	58	47	44	44
query93	5254	486	467	467
query94	1289	175	174	174
query95	7937	361	7888	361
query96	613	286	272	272
query97	4224	4148	4123	4123
query98	220	212	228	212
query99	1132	708	674	674
Total cold run time: 293003 ms
Total hot run time: 173925 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 30.26 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit 2c6eb24f0f6122ae99f01dc6260a35d5f65855dd, data reload: false

query1	0.04	0.04	0.03
query2	0.06	0.02	0.02
query3	0.23	0.06	0.06
query4	1.65	0.11	0.10
query5	0.53	0.52	0.52
query6	1.21	0.67	0.65
query7	0.02	0.02	0.01
query8	0.04	0.02	0.02
query9	0.55	0.50	0.49
query10	0.57	0.55	0.56
query11	0.12	0.08	0.08
query12	0.11	0.09	0.09
query13	0.60	0.60	0.62
query14	0.79	0.82	0.81
query15	0.80	0.79	0.78
query16	0.38	0.40	0.39
query17	1.03	1.01	1.00
query18	0.24	0.27	0.25
query19	1.89	1.75	1.81
query20	0.01	0.01	0.02
query21	15.41	0.58	0.57
query22	2.52	3.70	1.58
query23	17.19	0.80	0.86
query24	2.64	0.35	0.43
query25	0.29	0.14	0.14
query26	0.43	0.14	0.15
query27	0.06	0.06	0.06
query28	12.76	0.83	0.87
query29	12.53	3.15	3.21
query30	0.63	0.54	0.54
query31	2.79	0.35	0.36
query32	3.31	0.50	0.47
query33	3.26	3.21	3.27
query34	15.78	4.33	4.26
query35	4.30	4.34	4.27
query36	1.10	1.06	1.06
query37	0.07	0.05	0.06
query38	0.03	0.03	0.03
query39	0.02	0.01	0.02
query40	0.16	0.13	0.13
query41	0.07	0.02	0.01
query42	0.03	0.02	0.02
query43	0.02	0.02	0.02
Total cold run time: 106.27 s
Total hot run time: 30.26 s

@doris-robot
Copy link

Load test result on machine: 'aliyun_ecs.c7a.8xlarge_32C64G'

Load test result on commit 2c6eb24f0f6122ae99f01dc6260a35d5f65855dd with default session variables
Stream load json:         18 seconds loaded 2358488459 Bytes, about 124 MB/s
Stream load orc:          59 seconds loaded 1101869774 Bytes, about 17 MB/s
Stream load parquet:      31 seconds loaded 861443392 Bytes, about 26 MB/s
Insert into select:       13.7 seconds inserted 10000000 Rows, about 729K ops/s

@doris-robot
Copy link

TeamCity be ut coverage result:
Function Coverage: 36.19% (8637/23864)
Line Coverage: 28.20% (70623/250395)
Region Coverage: 27.21% (36441/133946)
Branch Coverage: 23.98% (18664/77830)
Coverage Report: http://coverage.selectdb-in.cc/coverage/2c6eb24f0f6122ae99f01dc6260a35d5f65855dd_2c6eb24f0f6122ae99f01dc6260a35d5f65855dd/report/index.html

@BePPPower
Copy link
Contributor Author

run p0

@BePPPower
Copy link
Contributor Author

run buildall

@doris-robot
Copy link

TPC-H: Total hot run time: 37057 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 2c6eb24f0f6122ae99f01dc6260a35d5f65855dd, data reload: false

------ Round 1 ----------------------------------
q1	17662	4521	4395	4395
q2	2042	149	136	136
q3	10779	953	958	953
q4	4755	702	740	702
q5	7676	2773	2885	2773
q6	182	120	120	120
q7	1156	733	736	733
q8	9526	2035	2063	2035
q9	7874	6364	6359	6359
q10	8105	2433	2421	2421
q11	408	208	204	204
q12	757	277	290	277
q13	18024	3329	3285	3285
q14	277	245	258	245
q15	530	500	490	490
q16	478	405	419	405
q17	942	558	509	509
q18	6776	6022	6029	6022
q19	1578	1362	1310	1310
q20	646	361	332	332
q21	6539	3054	3163	3054
q22	803	297	304	297
Total cold run time: 107515 ms
Total hot run time: 37057 ms

----- Round 2, with runtime_filter_mode=off -----
q1	4381	4431	4464	4431
q2	327	231	230	230
q3	2961	2866	2819	2819
q4	1824	1652	1605	1605
q5	5176	5199	5255	5199
q6	190	112	115	112
q7	2146	1831	1766	1766
q8	3118	3222	3250	3222
q9	8308	8345	8286	8286
q10	5813	3543	3584	3543
q11	549	461	488	461
q12	762	543	571	543
q13	8442	3097	3061	3061
q14	271	250	263	250
q15	533	487	489	487
q16	514	455	473	455
q17	1836	1690	1671	1671
q18	7911	7730	7606	7606
q19	6949	1519	1515	1515
q20	2126	1908	1916	1908
q21	4701	4530	4583	4530
q22	534	499	458	458
Total cold run time: 69372 ms
Total hot run time: 54158 ms

@doris-robot
Copy link

TeamCity be ut coverage result:
Function Coverage: 36.17% (8638/23883)
Line Coverage: 28.19% (70628/250502)
Region Coverage: 27.18% (36440/134063)
Branch Coverage: 23.96% (18660/77868)
Coverage Report: http://coverage.selectdb-in.cc/coverage/2c6eb24f0f6122ae99f01dc6260a35d5f65855dd_2c6eb24f0f6122ae99f01dc6260a35d5f65855dd/report/index.html

@doris-robot
Copy link

TPC-DS: Total hot run time: 174290 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 2c6eb24f0f6122ae99f01dc6260a35d5f65855dd, data reload: false

query1	915	347	335	335
query2	6525	1938	1922	1922
query3	6738	209	204	204
query4	31914	22049	22035	22035
query5	4262	354	422	354
query6	256	166	165	165
query7	4608	286	283	283
query8	244	187	171	171
query9	8736	2241	2225	2225
query10	414	206	207	206
query11	18587	15411	15324	15324
query12	132	77	73	73
query13	1614	410	411	410
query14	9322	7167	6981	6981
query15	312	176	189	176
query16	8146	263	266	263
query17	2072	537	495	495
query18	2126	262	259	259
query19	346	139	141	139
query20	79	81	77	77
query21	204	121	125	121
query22	4773	4673	4620	4620
query23	31573	30080	30067	30067
query24	10527	2758	2775	2758
query25	573	353	339	339
query26	1424	144	148	144
query27	2933	305	315	305
query28	7514	1822	1806	1806
query29	903	627	601	601
query30	283	130	140	130
query31	901	708	723	708
query32	90	58	53	53
query33	736	229	218	218
query34	1094	449	460	449
query35	859	754	761	754
query36	1027	957	912	912
query37	121	55	58	55
query38	3214	3146	3097	3097
query39	1290	1261	1258	1258
query40	266	95	91	91
query41	37	36	34	34
query42	97	88	84	84
query43	523	490	489	489
query44	1087	669	705	669
query45	193	185	179	179
query46	1038	662	653	653
query47	1559	1513	1477	1477
query48	418	373	362	362
query49	1198	288	278	278
query50	759	383	381	381
query51	5241	5114	5159	5114
query52	106	77	84	77
query53	327	266	273	266
query54	278	207	222	207
query55	77	76	72	72
query56	220	198	196	196
query57	952	910	913	910
query58	206	180	177	177
query59	2469	2371	2345	2345
query60	249	218	209	209
query61	85	87	81	81
query62	674	368	361	361
query63	297	270	264	264
query64	5972	3541	3717	3541
query65	3271	3228	3222	3222
query66	1058	308	313	308
query67	14574	13784	14243	13784
query68	4068	557	541	541
query69	454	321	339	321
query70	1251	1237	1215	1215
query71	326	238	247	238
query72	6034	2867	2656	2656
query73	700	335	315	315
query74	6581	6255	6266	6255
query75	2993	2385	2320	2320
query76	2525	958	932	932
query77	325	226	221	221
query78	9227	8731	8541	8541
query79	3041	494	482	482
query80	2056	348	341	341
query81	520	195	193	193
query82	844	83	77	77
query83	245	129	123	123
query84	283	75	85	75
query85	2239	344	326	326
query86	478	312	317	312
query87	3412	3255	3174	3174
query88	3982	2379	2361	2361
query89	441	366	360	360
query90	2030	161	163	161
query91	150	120	120	120
query92	56	41	46	41
query93	3207	487	472	472
query94	1293	170	172	170
query95	455	355	354	354
query96	596	281	280	280
query97	4271	4112	4114	4112
query98	220	206	193	193
query99	1111	685	694	685
Total cold run time: 286032 ms
Total hot run time: 174290 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 31.26 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit 2c6eb24f0f6122ae99f01dc6260a35d5f65855dd, data reload: false

query1	0.04	0.03	0.03
query2	0.06	0.03	0.02
query3	0.22	0.07	0.06
query4	1.67	0.10	0.10
query5	0.53	0.51	0.53
query6	1.29	0.62	0.65
query7	0.02	0.01	0.02
query8	0.04	0.03	0.03
query9	0.55	0.49	0.51
query10	0.56	0.55	0.58
query11	0.12	0.08	0.09
query12	0.10	0.09	0.09
query13	0.62	0.61	0.62
query14	0.79	0.80	0.80
query15	0.80	0.78	0.79
query16	0.40	0.38	0.38
query17	1.03	1.01	1.03
query18	0.24	0.20	0.28
query19	1.88	1.78	1.78
query20	0.01	0.01	0.01
query21	15.48	0.58	0.56
query22	2.38	2.82	2.20
query23	17.44	0.80	0.93
query24	2.47	1.61	0.90
query25	0.40	0.24	0.07
query26	0.42	0.14	0.14
query27	0.04	0.05	0.06
query28	10.89	0.88	0.82
query29	12.59	3.16	3.20
query30	0.60	0.55	0.56
query31	2.80	0.34	0.36
query32	3.35	0.48	0.48
query33	3.23	3.24	3.26
query34	15.84	4.26	4.21
query35	4.27	4.27	4.29
query36	1.08	1.04	1.07
query37	0.06	0.05	0.05
query38	0.04	0.03	0.03
query39	0.02	0.01	0.02
query40	0.17	0.13	0.13
query41	0.07	0.02	0.02
query42	0.03	0.02	0.01
query43	0.02	0.02	0.02
Total cold run time: 104.66 s
Total hot run time: 31.26 s

@doris-robot
Copy link

Load test result on machine: 'aliyun_ecs.c7a.8xlarge_32C64G'

Load test result on commit 2c6eb24f0f6122ae99f01dc6260a35d5f65855dd with default session variables
Stream load json:         19 seconds loaded 2358488459 Bytes, about 118 MB/s
Stream load orc:          59 seconds loaded 1101869774 Bytes, about 17 MB/s
Stream load parquet:      31 seconds loaded 861443392 Bytes, about 26 MB/s
Insert into select:       14.6 seconds inserted 10000000 Rows, about 684K ops/s

@BePPPower
Copy link
Contributor Author

run feut

Copy link
Contributor

@morningman morningman left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@github-actions github-actions bot added the approved Indicates a PR has been approved by one committer. label Feb 9, 2024
@github-actions
Copy link
Contributor

github-actions bot commented Feb 9, 2024

PR approved by at least one committer and no changes requested.

@github-actions
Copy link
Contributor

github-actions bot commented Feb 9, 2024

PR approved by anyone and no changes requested.

@morningman morningman merged commit 210b0ec into apache:master Feb 9, 2024
yiguolei pushed a commit that referenced this pull request Feb 16, 2024
…#30533)

The UTF8 format of the Windows system has BOM. 

We add a new user property to `Outfile/Export`。Therefore, when exporting Doris data, users can choose whether to bring BOM on the beginning of the CSV file.

**Usage:**
```sql
-- outfile:
select * from demo.student
into outfile "file:///xxx/export/exp_"
format as csv
properties(
    "column_separator" = ",",
    "with_bom" = "true"
);

-- Export:
EXPORT TABLE student TO "file:///xx/tmpdata/export/exp_"
PROPERTIES(
    "format" = "csv",
    "with_bom" = "true"
);
```
mymeiyi pushed a commit to mymeiyi/doris that referenced this pull request Feb 19, 2024
…apache#30533)

The UTF8 format of the Windows system has BOM. 

We add a new user property to `Outfile/Export`。Therefore, when exporting Doris data, users can choose whether to bring BOM on the beginning of the CSV file.

**Usage:**
```sql
-- outfile:
select * from demo.student
into outfile "file:///xxx/export/exp_"
format as csv
properties(
    "column_separator" = ",",
    "with_bom" = "true"
);

-- Export:
EXPORT TABLE student TO "file:///xx/tmpdata/export/exp_"
PROPERTIES(
    "format" = "csv",
    "with_bom" = "true"
);
```
BePPPower added a commit to BePPPower/doris that referenced this pull request Feb 20, 2024
…apache#30533)

The UTF8 format of the Windows system has BOM.

We add a new user property to `Outfile/Export`。Therefore, when exporting Doris data, users can choose whether to bring BOM on the beginning of the CSV file.

**Usage:**
```sql
-- outfile:
select * from demo.student
into outfile "file:///xxx/export/exp_"
format as csv
properties(
    "column_separator" = ",",
    "with_bom" = "true"
);

-- Export:
EXPORT TABLE student TO "file:///xx/tmpdata/export/exp_"
PROPERTIES(
    "format" = "csv",
    "with_bom" = "true"
);
```
morningman pushed a commit that referenced this pull request Feb 22, 2024
@morningman morningman added dev/2.0.5 usercase Important user case type label dev/2.0.5-merged and removed dev/2.0.5 labels Feb 22, 2024
mongo360 pushed a commit to mongo360/doris that referenced this pull request Aug 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by one committer. dev/2.0.5-merged meta-change reviewed usercase Important user case type label

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants