Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[improvement](statistics)Support identical column name in different index. #32792

Merged
merged 1 commit into from
Mar 28, 2024

Conversation

Jibing-Li
Copy link
Contributor

@Jibing-Li Jibing-Li commented Mar 25, 2024

For each column stats meta in tableStatsMeta, add the index name as key along with column name, so we can tell columns with identical names, and trigger auto analyze after a new mv created.

Before, we keep this map in memory for each table:
ColumnName -> ColumnStatistics

But different index in table may have identical column name. So we change the map to:
Pair<ColumnName, IndexName> -> ColumnStatistics

Further comments

If this is a relatively large or complex change, kick off the discussion at dev@doris.apache.org by explaining why you chose the solution you did and what alternatives you considered, etc...

@doris-robot
Copy link

Thank you for your contribution to Apache Doris.
Don't know what should be done next? See How to process your PR

Since 2024-03-18, the Document has been moved to doris-website.
See Doris Document.

@Jibing-Li Jibing-Li marked this pull request as ready for review March 25, 2024 11:47
@Jibing-Li
Copy link
Contributor Author

run buildall

@doris-robot
Copy link

TPC-H: Total hot run time: 37917 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit b53a3ed0a0895f8a493d47cd75902f2eafdc0918, data reload: false

------ Round 1 ----------------------------------
q1	16089	4251	4130	4130
q2	2112	155	151	151
q3	10594	1110	1180	1110
q4	10254	755	773	755
q5	7482	3018	3015	3015
q6	205	127	123	123
q7	1029	598	564	564
q8	9387	2037	1967	1967
q9	7257	6624	6547	6547
q10	8476	3468	3564	3468
q11	437	220	222	220
q12	417	199	195	195
q13	17807	2905	2853	2853
q14	244	206	205	205
q15	507	476	465	465
q16	499	373	371	371
q17	957	593	533	533
q18	7325	6537	6445	6445
q19	3505	1424	1448	1424
q20	554	261	248	248
q21	3543	2842	3055	2842
q22	349	301	286	286
Total cold run time: 109029 ms
Total hot run time: 37917 ms

----- Round 2, with runtime_filter_mode=off -----
q1	4116	4132	4063	4063
q2	328	229	235	229
q3	2958	2849	2877	2849
q4	1876	1600	1549	1549
q5	5281	5356	5328	5328
q6	196	116	117	116
q7	2252	1886	1847	1847
q8	3166	3295	3287	3287
q9	8711	8713	8685	8685
q10	3774	3883	4015	3883
q11	564	469	461	461
q12	748	563	568	563
q13	16933	3089	2972	2972
q14	301	279	280	279
q15	506	467	489	467
q16	505	472	458	458
q17	1794	1539	1537	1537
q18	8210	7667	7591	7591
q19	3777	1578	1554	1554
q20	2057	1831	1852	1831
q21	5010	4883	5015	4883
q22	539	448	441	441
Total cold run time: 73602 ms
Total hot run time: 54873 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 182206 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit b53a3ed0a0895f8a493d47cd75902f2eafdc0918, data reload: false

query1	926	357	364	357
query2	6367	2028	1960	1960
query3	6709	211	210	210
query4	31613	21379	21272	21272
query5	4279	403	396	396
query6	266	181	191	181
query7	4632	282	290	282
query8	235	204	166	166
query9	9268	2293	2268	2268
query10	421	243	267	243
query11	17384	14292	14251	14251
query12	139	89	85	85
query13	1643	404	426	404
query14	9516	7756	7461	7461
query15	229	195	195	195
query16	7918	262	258	258
query17	1958	593	548	548
query18	1935	299	274	274
query19	220	154	154	154
query20	94	85	87	85
query21	201	129	128	128
query22	5008	4747	4809	4747
query23	33290	32698	32862	32698
query24	12326	2932	2962	2932
query25	680	396	400	396
query26	1602	163	159	159
query27	3320	381	367	367
query28	7994	1909	1862	1862
query29	1050	708	666	666
query30	321	155	151	151
query31	1005	763	755	755
query32	96	62	59	59
query33	773	252	257	252
query34	1358	500	511	500
query35	935	717	737	717
query36	1067	892	932	892
query37	271	75	68	68
query38	3727	3697	3660	3660
query39	1645	1620	1588	1588
query40	246	108	116	108
query41	49	50	47	47
query42	106	96	99	96
query43	499	460	456	456
query44	1162	743	725	725
query45	294	272	258	258
query46	1156	703	701	701
query47	2048	1912	1924	1912
query48	437	388	356	356
query49	1023	345	349	345
query50	823	391	405	391
query51	6909	6720	6843	6720
query52	105	93	92	92
query53	362	276	280	276
query54	291	238	233	233
query55	87	76	79	76
query56	246	239	242	239
query57	1303	1190	1195	1190
query58	232	208	234	208
query59	2922	2661	2639	2639
query60	247	233	228	228
query61	94	122	93	93
query62	616	442	450	442
query63	297	273	276	273
query64	6189	4063	4039	4039
query65	3061	3027	3049	3027
query66	1249	361	345	345
query67	15583	15120	14770	14770
query68	9932	525	528	525
query69	674	366	376	366
query70	1339	1146	1149	1146
query71	530	267	258	258
query72	6799	2758	2559	2559
query73	1620	324	313	313
query74	7019	6409	6481	6409
query75	4519	2250	2204	2204
query76	5884	913	933	913
query77	599	256	257	256
query78	11023	10165	10103	10103
query79	11748	534	507	507
query80	1947	361	364	361
query81	501	215	214	214
query82	231	81	85	81
query83	210	143	149	143
query84	286	74	78	74
query85	1000	335	303	303
query86	360	296	287	287
query87	3725	3511	3525	3511
query88	5345	2270	2265	2265
query89	482	365	365	365
query90	2057	171	180	171
query91	167	132	135	132
query92	57	47	44	44
query93	6117	481	496	481
query94	1248	182	175	175
query95	403	307	299	299
query96	637	275	266	266
query97	2664	2462	2514	2462
query98	236	213	212	212
query99	1101	869	888	869
Total cold run time: 320517 ms
Total hot run time: 182206 ms

@doris-robot
Copy link

Load test result on machine: 'aliyun_ecs.c7a.8xlarge_32C64G'

Load test result on commit b53a3ed0a0895f8a493d47cd75902f2eafdc0918 with default session variables
Stream load json:         19 seconds loaded 2358488459 Bytes, about 118 MB/s
Stream load orc:          59 seconds loaded 1101869774 Bytes, about 17 MB/s
Stream load parquet:      33 seconds loaded 861443392 Bytes, about 24 MB/s
Insert into select:       20.4 seconds inserted 10000000 Rows, about 490K ops/s

@Jibing-Li Jibing-Li force-pushed the mvtrigger branch 4 times, most recently from b5b40f3 to 9e9e54b Compare March 26, 2024 04:47
@Jibing-Li
Copy link
Contributor Author

run buildall

@Jibing-Li
Copy link
Contributor Author

run buildall

@doris-robot
Copy link

TPC-H: Total hot run time: 37614 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit bd148846b7d3a5b97ec5e74f334a14cf6f4ba6b7, data reload: false

------ Round 1 ----------------------------------
q1	17682	4243	4039	4039
q2	2099	160	148	148
q3	10582	1083	1176	1083
q4	10227	756	769	756
q5	7463	3019	2979	2979
q6	200	124	122	122
q7	1028	574	564	564
q8	9333	1999	1947	1947
q9	7132	6568	6546	6546
q10	8432	3459	3544	3459
q11	439	221	214	214
q12	407	199	192	192
q13	17795	2841	2845	2841
q14	241	193	202	193
q15	503	466	463	463
q16	485	375	377	375
q17	937	612	567	567
q18	7041	6411	6352	6352
q19	1523	1405	1453	1405
q20	554	269	258	258
q21	3520	2920	2810	2810
q22	339	301	305	301
Total cold run time: 107962 ms
Total hot run time: 37614 ms

----- Round 2, with runtime_filter_mode=off -----
q1	4106	4049	4016	4016
q2	328	228	234	228
q3	2946	2807	2854	2807
q4	1791	1566	1513	1513
q5	5265	5322	5318	5318
q6	194	117	119	117
q7	2215	1853	1837	1837
q8	3138	3294	3282	3282
q9	8627	8695	8648	8648
q10	3771	3824	3967	3824
q11	554	470	464	464
q12	762	598	597	597
q13	17100	3015	3079	3015
q14	298	266	297	266
q15	526	489	479	479
q16	500	459	465	459
q17	1775	1546	1503	1503
q18	7986	7831	7499	7499
q19	3961	1554	1540	1540
q20	2003	1836	1713	1713
q21	4929	4767	4798	4767
q22	559	466	458	458
Total cold run time: 73334 ms
Total hot run time: 54350 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 181833 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit bd148846b7d3a5b97ec5e74f334a14cf6f4ba6b7, data reload: false

query1	922	356	346	346
query2	6334	2006	1778	1778
query3	6698	207	210	207
query4	31494	21277	21392	21277
query5	4291	402	398	398
query6	285	191	179	179
query7	4644	292	293	292
query8	237	176	172	172
query9	9343	2370	2352	2352
query10	440	262	260	260
query11	15473	14159	14230	14159
query12	140	89	89	89
query13	1633	419	427	419
query14	9865	8052	7423	7423
query15	285	203	198	198
query16	8273	273	266	266
query17	1968	558	532	532
query18	2096	288	281	281
query19	336	150	153	150
query20	92	85	88	85
query21	195	123	123	123
query22	5235	4936	4925	4925
query23	33277	32904	32933	32904
query24	10541	2947	2998	2947
query25	583	396	374	374
query26	693	155	160	155
query27	2308	343	358	343
query28	6217	1894	1930	1894
query29	884	640	666	640
query30	316	169	159	159
query31	997	766	772	766
query32	100	61	55	55
query33	669	251	279	251
query34	1020	487	491	487
query35	916	725	745	725
query36	1058	927	902	902
query37	103	66	68	66
query38	3789	3695	3633	3633
query39	1113	1045	1037	1037
query40	171	110	117	110
query41	54	50	46	46
query42	109	98	95	95
query43	487	452	464	452
query44	1109	744	741	741
query45	295	277	267	267
query46	1127	719	715	715
query47	2085	1987	1935	1935
query48	474	373	375	373
query49	856	354	329	329
query50	807	392	390	390
query51	7010	6891	6970	6891
query52	102	102	91	91
query53	363	278	272	272
query54	320	247	246	246
query55	101	81	78	78
query56	285	248	246	246
query57	1304	1237	1203	1203
query58	265	245	220	220
query59	2891	2646	2608	2608
query60	272	256	239	239
query61	94	98	91	91
query62	597	456	451	451
query63	299	270	266	266
query64	4844	3955	3909	3909
query65	3073	2998	3019	2998
query66	782	352	345	345
query67	15734	14850	14690	14690
query68	9910	536	550	536
query69	657	382	372	372
query70	1394	1084	1087	1084
query71	475	263	261	261
query72	6780	2691	2570	2570
query73	1577	329	324	324
query74	8160	6433	6434	6433
query75	4152	2197	2199	2197
query76	5196	900	828	828
query77	587	258	251	251
query78	10863	10082	10075	10075
query79	10480	540	519	519
query80	2029	373	370	370
query81	489	217	215	215
query82	239	81	83	81
query83	203	144	143	143
query84	287	75	80	75
query85	904	318	298	298
query86	338	318	325	318
query87	3739	3504	3555	3504
query88	5026	2360	2357	2357
query89	475	364	361	361
query90	1881	174	170	170
query91	167	134	134	134
query92	60	46	51	46
query93	6033	496	486	486
query94	1222	178	182	178
query95	410	309	308	308
query96	618	269	265	265
query97	2685	2461	2494	2461
query98	225	217	211	211
query99	1064	856	886	856
Total cold run time: 309098 ms
Total hot run time: 181833 ms

@doris-robot
Copy link

Load test result on machine: 'aliyun_ecs.c7a.8xlarge_32C64G'

Load test result on commit bd148846b7d3a5b97ec5e74f334a14cf6f4ba6b7 with default session variables
Stream load json:         18 seconds loaded 2358488459 Bytes, about 124 MB/s
Stream load orc:          59 seconds loaded 1101869774 Bytes, about 17 MB/s
Stream load parquet:      33 seconds loaded 861443392 Bytes, about 24 MB/s
Insert into select:       21.7 seconds inserted 10000000 Rows, about 460K ops/s

Comment on lines +174 to +168
public void convertDeprecatedColStatsToNewVersion() {
deprecatedColNameToColStatsMeta = null;
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

not impl?

Copy link
Contributor Author

@Jibing-Li Jibing-Li Mar 27, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't feel it's worth to implement this. The older version only have table id in memory, so it's not easy to find the table during replay metadata, we need to go through all catalogs->dbs to find it.
The downside of not implementing this is auto analyze need to collect stats again for all tables. But the old stats already collected are still available.

@Jibing-Li Jibing-Li force-pushed the mvtrigger branch 2 times, most recently from a7e57dd to 01763fa Compare March 27, 2024 03:14
@Jibing-Li
Copy link
Contributor Author

run buildall

@doris-robot
Copy link

TPC-H: Total hot run time: 38103 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit f39d9c3c223e765583656d221866abb6b84b75d2, data reload: false

------ Round 1 ----------------------------------
q1	17648	4465	4197	4197
q2	2111	165	156	156
q3	10571	1144	1179	1144
q4	10240	789	798	789
q5	7484	3004	2934	2934
q6	205	124	125	124
q7	1030	576	561	561
q8	9348	2033	1966	1966
q9	7284	6725	6642	6642
q10	8439	3483	3554	3483
q11	449	235	225	225
q12	412	204	191	191
q13	17800	2883	2834	2834
q14	237	207	206	206
q15	527	474	472	472
q16	505	379	375	375
q17	962	589	567	567
q18	7166	6624	6343	6343
q19	3065	1429	1464	1429
q20	549	250	238	238
q21	3636	2941	2930	2930
q22	345	297	313	297
Total cold run time: 110013 ms
Total hot run time: 38103 ms

----- Round 2, with runtime_filter_mode=off -----
q1	4223	4107	4057	4057
q2	326	240	231	231
q3	2989	2904	2876	2876
q4	1835	1524	1503	1503
q5	5332	5351	5331	5331
q6	194	117	121	117
q7	2286	1839	1877	1839
q8	3157	3275	3273	3273
q9	8709	8781	8760	8760
q10	3845	4011	3966	3966
q11	574	492	466	466
q12	777	597	639	597
q13	17212	3150	3035	3035
q14	294	271	266	266
q15	530	478	487	478
q16	518	439	449	439
q17	1846	1570	1568	1568
q18	8074	7596	7554	7554
q19	1679	1585	1541	1541
q20	1968	1741	1826	1741
q21	5138	4956	4899	4899
q22	567	475	464	464
Total cold run time: 72073 ms
Total hot run time: 55001 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 183089 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit f39d9c3c223e765583656d221866abb6b84b75d2, data reload: false

query1	936	379	366	366
query2	6265	2118	1969	1969
query3	6699	212	214	212
query4	31637	21450	21460	21450
query5	4329	403	398	398
query6	281	204	190	190
query7	4645	303	296	296
query8	233	178	168	168
query9	9349	2278	2304	2278
query10	438	250	260	250
query11	17389	14221	14195	14195
query12	133	86	90	86
query13	1624	419	421	419
query14	9851	8024	7277	7277
query15	267	188	197	188
query16	8179	269	272	269
query17	1944	586	568	568
query18	2141	320	309	309
query19	298	160	168	160
query20	101	95	93	93
query21	203	142	142	142
query22	5203	5074	4998	4998
query23	33600	32790	33131	32790
query24	11128	2963	2979	2963
query25	598	411	402	402
query26	715	167	164	164
query27	2327	381	387	381
query28	5982	1925	1962	1925
query29	948	708	663	663
query30	324	166	166	166
query31	1014	793	789	789
query32	97	61	59	59
query33	724	254	259	254
query34	1153	518	510	510
query35	912	701	738	701
query36	1054	909	891	891
query37	103	68	69	68
query38	3790	3690	3744	3690
query39	1704	1606	1613	1606
query40	178	115	123	115
query41	49	50	51	50
query42	115	109	105	105
query43	497	460	445	445
query44	1187	779	768	768
query45	267	265	251	251
query46	1155	722	738	722
query47	2048	1963	1947	1947
query48	489	370	376	370
query49	883	352	376	352
query50	798	394	390	390
query51	6991	6751	6791	6751
query52	115	96	99	96
query53	360	285	287	285
query54	313	245	231	231
query55	92	83	78	78
query56	252	244	244	244
query57	1332	1190	1168	1168
query58	247	225	217	217
query59	2931	2414	2461	2414
query60	255	242	232	232
query61	92	90	92	90
query62	619	444	441	441
query63	297	270	271	270
query64	4919	4206	3693	3693
query65	3103	3027	3042	3027
query66	796	346	343	343
query67	15565	14800	14839	14800
query68	9610	529	552	529
query69	667	377	373	373
query70	1317	1123	1173	1123
query71	535	260	263	260
query72	6796	2714	2572	2572
query73	1623	335	325	325
query74	6949	6570	6508	6508
query75	4525	2236	2216	2216
query76	5842	929	972	929
query77	597	259	253	253
query78	11141	10279	10374	10279
query79	11844	562	545	545
query80	1627	371	377	371
query81	524	228	225	225
query82	251	86	87	86
query83	200	150	148	148
query84	291	78	82	78
query85	929	325	300	300
query86	340	280	291	280
query87	3778	3558	3594	3558
query88	5322	2371	2366	2366
query89	491	382	376	376
query90	1997	179	175	175
query91	169	135	131	131
query92	60	45	47	45
query93	6047	498	497	497
query94	1287	176	175	175
query95	418	317	320	317
query96	621	275	277	275
query97	2725	2492	2501	2492
query98	225	218	219	218
query99	1071	866	874	866
Total cold run time: 314082 ms
Total hot run time: 183089 ms

@doris-robot
Copy link

Load test result on machine: 'aliyun_ecs.c7a.8xlarge_32C64G'

Load test result on commit f39d9c3c223e765583656d221866abb6b84b75d2 with default session variables
Stream load json:         18 seconds loaded 2358488459 Bytes, about 124 MB/s
Stream load orc:          58 seconds loaded 1101869774 Bytes, about 18 MB/s
Stream load parquet:      32 seconds loaded 861443392 Bytes, about 25 MB/s
Insert into select:       13.9 seconds inserted 10000000 Rows, about 719K ops/s

@Jibing-Li
Copy link
Contributor Author

run external

@github-actions github-actions bot added the approved Indicates a PR has been approved by one committer. label Mar 28, 2024
Copy link
Contributor

PR approved by at least one committer and no changes requested.

Copy link
Contributor

PR approved by anyone and no changes requested.

@Jibing-Li Jibing-Li merged commit 4c7885b into apache:master Mar 28, 2024
27 of 30 checks passed
Jibing-Li added a commit that referenced this pull request Mar 29, 2024
* [fix](merge cloud) Fix cloud be set be tag map (#32864)

* [chore] Add gavinchou to collaborators (#32881)

* [chore](show) support statement to show views from table (#32358)

MySQL [test]> show views;
+----------------+
| Tables_in_test |
+----------------+
| t1_view        |
| t2_view        |
+----------------+
2 rows in set (0.00 sec)

MySQL [test]> show views like '%t1%';
+----------------+
| Tables_in_test |
+----------------+
| t1_view        |
+----------------+
1 row in set (0.01 sec)

MySQL [test]> show views where create_time > '2024-03-18';
+----------------+
| Tables_in_test |
+----------------+
| t2_view        |
+----------------+
1 row in set (0.02 sec)

* [Enhancement](ranger) Disable some permission operations when Ranger or LDAP are enabled (#32538)

Disable some permission operations when Ranger or LDAP are enabled.

* [chore](ci) exclude unstable trino_connector case (#32892)

Co-authored-by: stephen <hello-stephen@qq.com>

* [fix](Nereids) NPE when create table with implicit index type (#32893)

* [improvement](mtmv) Support more join types for query rewriting by materialized view (#32685)

This pattern of rewriting is supported for multi-table joins and supported join types is as following:

INNER JOIN
LEFT OUTER JOIN
RIGHT OUTER JOIN
FULL OUTER JOIN
LEFT SEMI JOIN
RIGHT SEMI JOIN
LEFT ANTI JOIN
RIGHT ANTI JOIN

* [Serde](Variant) support arrow serialization for varint type (#32780)

* [fix](multicatalog) fix no data error when read hive table on cosn (#32815)

Currently, when reading a hive on cosn table, doris return empty result, but the table has data.
iceberg on cosn is ok.
The reason is misuse of cosn's file sytem. according to cosn's doc, its fs.cosn.impl should be org.apache.hadoop.fs.CosFileSystem

* [fix](nereids)EliminateGroupByConstant should replace agg's output after removing constant group by keys (#32878)

* [Fix](executor)Fix regression test for test_active_queries/test_backend_active_tasks #32899

* [fix](iceberg) fix iceberg catalog bug and p2 test cases (#32898)

1. Fix iceberg catalog bug

    This PR #30198 change the logic of `IcebergHMSExternalCatalog.java`,
    to get locationUrl by calling hive metastore's `getCatalog()` method.
    But this method only exists in hive 3+. So it will fail if we using hive 2.x.

    I temporary remove this logic, because this logic is only used from iceberg table writing.
    Which is still under development. We will rethink this logic later.

2. Fix test cases

    Some of P2 test cases missed `order_qt`. And because the output format of the floating point
    type is changed, some result in `out` files need to be regenerated.

* [revert](jni) revert part of #32455 (#32904)

* [fix](spill) Avoid releasing resources while spill tasks are executing (#32783)

* [chore](log) print query id before logging profile in be.INFO (#32922)

* [fix](grace-exit) Stop incorrectly of reportwork cause heap use after free #32929

* [improvement](decommission be) decommission check replica num (#32748)

* [fix](arrow-flight) Fix reach limit of connections error (#32911)

Fix Reach limit of connections error
in fe.conf , arrow_flight_token_cache_size is mandatory less than qe_max_connection/2. arrow flight sql is a stateless protocol, connection is usually not actively disconnected, bearer token is evict from the cache will unregister ConnectContext.

Fix ConnectContext.command not be reset to COM_SLEEP in time, this will result in frequent kill connection after query timeout.

Fix bearer token evict log and exception.

TODO: use arrow flight session: https://mail.google.com/mail/u/0/#inbox/FMfcgzGxRdxBLQLTcvvtRpqsvmhrHpdH

* [bugfix](cloud) few variable not initialized (#32868)

../../cloud/src/recycler/meta_checker.cpp
can cause uninitialised memory read.

* [fix](arrow-flight) Fix arrow flight sql compatible with JDK 17 and upgrade arrow 15.0.2 (#32796)

--add-opens=java.base/java.nio=ALL-UNNAMED, see: https://arrow.apache.org/docs/java/install.html#java-compatibility
groovy use flight sql connection to execute query SUM(MAX(c1) OVER (PARTITION BY)) report error: AGGREGATE clause must not contain analytic expressions, but no problem in Java execute it with jdbc::arrow-flight-sql.
groovy not support print arrow array type, throw IndexOutOfBoundsException.
"arrow_flight_sql" not support two phase read
./run-regression-test.sh --run --clean -g arrow_flight_sql

* [fix](spill) SpillStream's writer maybe may not have been finalized (#32931)

* [improvement](spill) Disable DistinctStreamingAgg when spill is enabled (#32932)

* [Improve](inverted_index) update clucene and improve array inverted index writer  (#32436)

* [Performance](exec) replace SipHash in function by XXHash (#32919)

* [feature](agg) add aggregate function sum0 (#32541)

* [improvement](mtmv) Support to get tables in materialized view when collecting table in plan (#32797)

Support to get tables in materialized view when collecting table in plan

table scehma as fllowing:

create materialized view mv1
BUILD IMMEDIATE REFRESH COMPLETE ON MANUAL
DISTRIBUTED BY RANDOM BUCKETS 1 
PROPERTIES ('replication_num' = '1')
 as 
select 
  t1.c1, 
  t3.c2 
from 
  table1 t1 
  inner join table3 t3 on t1.c1 = t3.c2

if get table from the plan as follwoing, we can get [table1, table3, table2], the mv1 is expanded to get base tables;

SELECT 
  mv1.*, 
  uuid() 
FROM 
  mv1 LEFT SEMI 
  JOIN table2 ON mv1.c1 = table2.c1 
WHERE 
  mv1.c1 IN (
    SELECT 
      c1 
    FROM 
      table2
  ) 
  OR mv1.c1 < 10

* [enhance](mtmv)support olap table partition column is null (#32698)

* [enhancement](cloud) add table version to cloud (#32738)

Add table version to cloud.

In Fe:
Get: If Fe is cloud mode, get table version from meta service.
Update: Op drop/replace temp partition, commit transaction.

In meta service:
Add: create Index. init value is 1.
Remove: by recycler.
Update: commit/drop partition rpc, commit txn rpc. Atomic++.

* [fix](cloud) schema change from not null to null (#32913)

1. Use equals instead of == for type comparing
2. null bitmap size is reisze by size of ref column.

* [feature](Nereids): add ColumnPruningPostProcessor. (#32800)

* [case](rowpolicy)fix row policy has been exist (#32880)

* [fix](pipeline) fix use error row desc when origin block clear (#32803)

* [fix](Nereids) support variant column with index when create table (#32948)

* [opt](Nereids) support create table with variant type (#32953)

* [test](insert-overwrite) Add insert overwrite auto detect concurrency cases (#32935)

* [fix](compile) fe cannot compile in idea (#32955)

* [enhancement](plsql) Support select * from routines (#32866)

Support show of plsql procedure using select * from routines.

* [fix](trino-connector) fix `NoClassDefFoundError` of hudi `Utils` class (#32846)

Due to the change of this PR #32455 , the `trino-connector-scanner` package cannot access the `hudi_scanner` package, so the exception NoclassDeffounderror will appear.

We need to write a separate Utils class.

* [exec](column) change some complex column move to noexcept (#32954)

* [Enhancement](data skew) extends show data skew (#32732)

* [chore](test) let suite compatible with Nereids (#32964)

* Support identical column name in different index. (#32792)

* Limit the max string length to 1024 while collecting column stats to control BE memory usage. (#32470)

* [fix](merge-iterator) fix NOT_IMPLEMENTED_ERROR when read next block view (#32961)

* [improvement](executor)Add tag property for workload group #32874

* [fix](auth)unified workload and resource permission logic (#32907)

- `Grant resource` can no longer grant global `usage_priv`
-  `grant resource %` instead of `grant resource *`

before change:
```
grant usage_priv on resource * to f;
show grants for f\G
*************************** 1. row ***************************
      UserIdentity: 'f'@'%'
           Comment: 
          Password: No
             Roles: 
       GlobalPrivs: Usage_priv 
      CatalogPrivs: NULL
     DatabasePrivs: internal.information_schema: Select_priv ; internal.mysql: Select_priv 
        TablePrivs: NULL
          ColPrivs: NULL
     ResourcePrivs: NULL
 CloudClusterPrivs: NULL
WorkloadGroupPrivs: normal: Usage_priv 
```
after change
```
grant usage_priv on resource '%' to f;
show grants for f\G
*************************** 1. row ***************************
      UserIdentity: 'f'@'%'
           Comment: 
          Password: No
             Roles: 
       GlobalPrivs: NULL
      CatalogPrivs: NULL
     DatabasePrivs: internal.information_schema: Select_priv ; internal.mysql: Select_priv 
        TablePrivs: NULL
          ColPrivs: NULL
     ResourcePrivs: %: Usage_priv 
 CloudClusterPrivs: NULL
WorkloadGroupPrivs: normal: Usage_priv 

```

---------

Co-authored-by: yujun <yu.jun.reach@gmail.com>
Co-authored-by: Gavin Chou <gavineaglechou@gmail.com>
Co-authored-by: xy720 <22125576+xy720@users.noreply.github.com>
Co-authored-by: yongjinhou <109586248+yongjinhou@users.noreply.github.com>
Co-authored-by: Dongyang Li <hello_stephen@qq.com>
Co-authored-by: stephen <hello-stephen@qq.com>
Co-authored-by: morrySnow <101034200+morrySnow@users.noreply.github.com>
Co-authored-by: seawinde <149132972+seawinde@users.noreply.github.com>
Co-authored-by: lihangyu <15605149486@163.com>
Co-authored-by: Yulei-Yang <yulei.yang0699@gmail.com>
Co-authored-by: starocean999 <40539150+starocean999@users.noreply.github.com>
Co-authored-by: wangbo <wangbo@apache.org>
Co-authored-by: Mingyu Chen <morningman@163.com>
Co-authored-by: Jerry Hu <mrhhsg@gmail.com>
Co-authored-by: zhiqiang <seuhezhiqiang@163.com>
Co-authored-by: Xinyi Zou <zouxinyi02@gmail.com>
Co-authored-by: Vallish Pai <vallishpai@gmail.com>
Co-authored-by: amory <wangqiannan@selectdb.com>
Co-authored-by: HappenLee <happenlee@hotmail.com>
Co-authored-by: Jensen <czjourney@163.com>
Co-authored-by: zhangdong <493738387@qq.com>
Co-authored-by: Yongqiang YANG <98214048+dataroaring@users.noreply.github.com>
Co-authored-by: jakevin <jakevingoo@gmail.com>
Co-authored-by: Mryange <59914473+Mryange@users.noreply.github.com>
Co-authored-by: zclllyybb <zhaochangle@selectdb.com>
Co-authored-by: Tiewei Fang <43782773+BePPPower@users.noreply.github.com>
Co-authored-by: Xin Liao <liaoxinbit@126.com>
@Jibing-Li Jibing-Li deleted the mvtrigger branch April 1, 2024 02:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by one committer. reviewed
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants