Skip to content

Conversation

@jacktengg
Copy link
Contributor

@jacktengg jacktengg commented Jul 2, 2024

Add logs to debug spill hash join bugs:

*** Query id: d7f1126be4e948c6-87f1a80ed3cbd69e ***
*** is nereids: 0 ***
*** tablet id: 0 ***
*** Aborted at 1719291313 (unix time) try "date -d @1719291313" if you are using GNU date ***
*** Current BE git commitID: 5f5262a885 ***
*** SIGSEGV address not mapped to object (@0x8) received by PID 1419021 (TID 1421288 OR 0x7f0212b43640) from PID 8; stack trace: ***
 0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, siginfo_t*, void*) at /home/zcp/repo_center/doris_master/doris/be/src/common/signal_handler.h:421
 1# PosixSignals::chained_handler(int, siginfo*, void*) [clone .part.0] in /usr/lib/jvm/java-17-openjdk-amd64/lib/server/libjvm.so
 2# JVM_handle_linux_signal in /usr/lib/jvm/java-17-openjdk-amd64/lib/server/libjvm.so
 3# 0x00007F06BD506520 in /lib/x86_64-linux-gnu/libc.so.6
 4# doris::vectorized::SpillReader::read(doris::vectorized::Block*, bool*) at /home/zcp/repo_center/doris_master/doris/be/src/vec/spill/spill_reader.cpp:96
 5# doris::vectorized::SpillStream::read_next_block_sync(doris::vectorized::Block*, bool*) in /mnt/disk1/STRESS_ENV/be/lib/doris_be
 6# std::_Function_handler<void (), doris::pipeline::PartitionedHashJoinProbeLocalState::recovery_build_blocks_from_disk(doris::RuntimeState*, unsigned int, bool&)::$_1>::_M_invoke(std::_Any_data const&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/std_function.h:291
 7# doris::ThreadPool::dispatch_thread() in /mnt/disk1/STRESS_ENV/be/lib/doris_be
 8# doris::Thread::supervise_thread(void*) at /home/zcp/repo_center/doris_master/doris/be/src/util/thread.cpp:499
 9# start_thread at ./nptl/pthread_create.c:442
10# 0x00007F06BD5EA850 at ../sysdeps/unix/sysv/linux/x86_64/clone3.S:83

@doris-robot
Copy link

Thank you for your contribution to Apache Doris.
Don't know what should be done next? See How to process your PR

Since 2024-03-18, the Document has been moved to doris-website.
See Doris Document.

Copy link
Contributor

@github-actions github-actions bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

clang-tidy made some suggestions

return Status::OK();
}

Status PartitionedHashJoinProbeLocalState::recovery_build_blocks_from_disk(RuntimeState* state,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

warning: function 'recovery_build_blocks_from_disk' has cognitive complexity of 57 (threshold 50) [readability-function-cognitive-complexity]

Status PartitionedHashJoinProbeLocalState::recovery_build_blocks_from_disk(RuntimeState* state,
                                           ^
Additional context

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:265: +1, including nesting penalty of 0, nesting level increased to 1

    if (!spilled_stream) {
    ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:270: +1, including nesting penalty of 0, nesting level increased to 1

    if (!mutable_block) {
    ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:284: nesting level increased to 1

    auto read_func = [this, query_id, state, spilled_stream = spilled_stream, &mutable_block,
                     ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:287: +2, including nesting penalty of 1, nesting level increased to 2

        if (!shared_state_sptr || state->is_cancelled()) {
        ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:287: +1

        if (!shared_state_sptr || state->is_cancelled()) {
                               ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:300: +2, including nesting penalty of 1, nesting level increased to 2

        while (!eos) {
        ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:303: +3, including nesting penalty of 2, nesting level increased to 3

            DBUG_EXECUTE_IF("fault_inject::partitioned_hash_join_probe::recover_build_blocks", {
            ^

be/src/util/debug_points.h:36: expanded from macro 'DBUG_EXECUTE_IF'

    if (UNLIKELY(config::enable_debug_points)) {                              \
    ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:303: +4, including nesting penalty of 3, nesting level increased to 4

            DBUG_EXECUTE_IF("fault_inject::partitioned_hash_join_probe::recover_build_blocks", {
            ^

be/src/util/debug_points.h:38: expanded from macro 'DBUG_EXECUTE_IF'

        if (dp) {                                                             \
        ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:307: +3, including nesting penalty of 2, nesting level increased to 3

            if (st.ok()) {
            ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:310: +3, including nesting penalty of 2, nesting level increased to 3

            if (!st.ok()) {
            ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:318: +3, including nesting penalty of 2, nesting level increased to 3

            if (block.empty()) {
            ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:322: +3, including nesting penalty of 2, nesting level increased to 3

            if (UNLIKELY(state->is_cancelled())) {
            ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:327: +3, including nesting penalty of 2, nesting level increased to 3

            if (mutable_block->empty()) {
            ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:329: +1, nesting level increased to 3

            } else {
              ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:332: +4, including nesting penalty of 3, nesting level increased to 4

                if (!st.ok()) {
                ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:348: nesting level increased to 1

    auto exception_catch_func = [read_func, query_id, this]() {
                                ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:349: +2, including nesting penalty of 1, nesting level increased to 2

        DBUG_EXECUTE_IF("fault_inject::partitioned_hash_join_probe::recover_build_blocks_cancel", {
        ^

be/src/util/debug_points.h:36: expanded from macro 'DBUG_EXECUTE_IF'

    if (UNLIKELY(config::enable_debug_points)) {                              \
    ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:349: +3, including nesting penalty of 2, nesting level increased to 3

        DBUG_EXECUTE_IF("fault_inject::partitioned_hash_join_probe::recover_build_blocks_cancel", {
        ^

be/src/util/debug_points.h:38: expanded from macro 'DBUG_EXECUTE_IF'

        if (dp) {                                                             \
        ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:356: nesting level increased to 2

        auto status = [&]() {
                      ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:357: +3, including nesting penalty of 2, nesting level increased to 3

            RETURN_IF_CATCH_EXCEPTION(read_func());
            ^

be/src/common/exception.h:89: expanded from macro 'RETURN_IF_CATCH_EXCEPTION'

    do {                                                                                         \
    ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:357: +4, including nesting penalty of 3, nesting level increased to 4

            RETURN_IF_CATCH_EXCEPTION(read_func());
            ^

be/src/common/exception.h:94: expanded from macro 'RETURN_IF_CATCH_EXCEPTION'

        } catch (const doris::Exception& e) {                                                    \
          ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:357: +5, including nesting penalty of 4, nesting level increased to 5

            RETURN_IF_CATCH_EXCEPTION(read_func());
            ^

be/src/common/exception.h:95: expanded from macro 'RETURN_IF_CATCH_EXCEPTION'

            if (e.code() == doris::ErrorCode::MEM_ALLOC_FAILED) {                                \
            ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:361: +2, including nesting penalty of 1, nesting level increased to 2

        if (!status.ok()) {
        ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:372: +1, including nesting penalty of 0, nesting level increased to 1

        if (pipeline_task) {
        ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:381: +1, including nesting penalty of 0, nesting level increased to 1

    DBUG_EXECUTE_IF("fault_inject::partitioned_hash_join_probe::recovery_build_blocks_submit_func",
    ^

be/src/util/debug_points.h:36: expanded from macro 'DBUG_EXECUTE_IF'

    if (UNLIKELY(config::enable_debug_points)) {                              \
    ^

be/src/pipeline/exec/partitioned_hash_join_probe_operator.cpp:381: +2, including nesting penalty of 1, nesting level increased to 2

    DBUG_EXECUTE_IF("fault_inject::partitioned_hash_join_probe::recovery_build_blocks_submit_func",
    ^

be/src/util/debug_points.h:38: expanded from macro 'DBUG_EXECUTE_IF'

        if (dp) {                                                             \
        ^

@jacktengg jacktengg force-pushed the fix-0628 branch 2 times, most recently from 71c8689 to eddabd8 Compare July 2, 2024 09:15
@jacktengg
Copy link
Contributor Author

run buildall

@doris-robot
Copy link

TPC-H: Total hot run time: 39771 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit eddabd8080bd075ccfd6cce010768136e3bc5a8e, data reload: false

------ Round 1 ----------------------------------
q1	17615	4479	4287	4287
q2	2031	192	189	189
q3	10433	1225	1090	1090
q4	10187	828	793	793
q5	7504	2685	2624	2624
q6	219	139	137	137
q7	979	594	609	594
q8	9238	2112	2076	2076
q9	9001	6515	6517	6515
q10	8980	3723	3696	3696
q11	451	236	253	236
q12	469	224	227	224
q13	19006	2978	2937	2937
q14	268	239	218	218
q15	519	486	489	486
q16	513	369	378	369
q17	979	707	721	707
q18	8180	7436	7412	7412
q19	5893	1504	1395	1395
q20	661	320	321	320
q21	4915	3145	3858	3145
q22	387	321	339	321
Total cold run time: 118428 ms
Total hot run time: 39771 ms

----- Round 2, with runtime_filter_mode=off -----
q1	4371	4261	4275	4261
q2	381	273	248	248
q3	3091	3047	2984	2984
q4	1986	1720	1746	1720
q5	5604	5599	5476	5476
q6	238	131	128	128
q7	2204	1851	1868	1851
q8	3288	3439	3762	3439
q9	8773	8723	8786	8723
q10	4120	3777	3838	3777
q11	596	484	522	484
q12	812	665	635	635
q13	17111	3131	3190	3131
q14	325	285	291	285
q15	538	484	493	484
q16	511	438	426	426
q17	1814	1526	1536	1526
q18	8246	7922	7856	7856
q19	5331	1678	1707	1678
q20	2165	1867	1829	1829
q21	10779	4894	4720	4720
q22	649	560	572	560
Total cold run time: 82933 ms
Total hot run time: 56221 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 172806 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit eddabd8080bd075ccfd6cce010768136e3bc5a8e, data reload: false

query1	920	386	374	374
query2	6438	2498	2336	2336
query3	6626	215	235	215
query4	19509	17164	17220	17164
query5	3771	476	486	476
query6	268	178	170	170
query7	4598	291	291	291
query8	325	277	270	270
query9	8529	2353	2336	2336
query10	545	285	289	285
query11	10725	10077	9965	9965
query12	117	81	83	81
query13	1633	364	350	350
query14	9461	7672	6865	6865
query15	237	186	184	184
query16	7744	267	260	260
query17	1780	546	518	518
query18	1943	265	267	265
query19	190	145	146	145
query20	86	84	84	84
query21	207	128	126	126
query22	4191	4069	3905	3905
query23	33533	33463	33475	33463
query24	10915	2905	2805	2805
query25	579	369	362	362
query26	730	152	150	150
query27	2316	324	319	319
query28	5786	2115	2115	2115
query29	884	630	633	630
query30	241	153	161	153
query31	964	791	748	748
query32	91	53	55	53
query33	727	295	291	291
query34	1029	479	481	479
query35	752	649	629	629
query36	1093	963	942	942
query37	148	85	83	83
query38	2932	2820	2890	2820
query39	930	858	835	835
query40	220	130	147	130
query41	65	81	52	52
query42	110	109	110	109
query43	625	563	529	529
query44	1165	744	711	711
query45	189	165	168	165
query46	1079	730	733	730
query47	1871	1779	1767	1767
query48	365	292	291	291
query49	831	398	412	398
query50	756	375	382	375
query51	6881	6871	6667	6667
query52	103	86	89	86
query53	371	292	283	283
query54	910	437	435	435
query55	73	78	72	72
query56	277	254	266	254
query57	1105	1059	1046	1046
query58	253	237	252	237
query59	3404	3376	3245	3245
query60	319	279	290	279
query61	113	108	110	108
query62	597	447	468	447
query63	318	282	292	282
query64	8655	2245	1725	1725
query65	3185	3094	3105	3094
query66	737	321	336	321
query67	15383	15015	14998	14998
query68	4538	542	537	537
query69	449	309	297	297
query70	1185	1100	1115	1100
query71	393	277	275	275
query72	7042	5322	5859	5322
query73	740	324	325	324
query74	5953	5542	5429	5429
query75	3513	2686	2687	2686
query76	2117	890	968	890
query77	558	296	292	292
query78	10443	9835	9745	9745
query79	2716	509	506	506
query80	1124	520	467	467
query81	596	217	218	217
query82	1054	101	101	101
query83	296	167	160	160
query84	251	82	87	82
query85	1384	277	267	267
query86	475	312	314	312
query87	3305	3139	3086	3086
query88	3108	2348	2348	2348
query89	452	368	375	368
query90	1864	183	186	183
query91	128	99	100	99
query92	60	49	49	49
query93	1985	498	495	495
query94	1292	182	185	182
query95	399	309	306	306
query96	596	264	263	263
query97	3240	3041	3012	3012
query98	222	200	193	193
query99	1148	835	849	835
Total cold run time: 266079 ms
Total hot run time: 172806 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 30.78 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit eddabd8080bd075ccfd6cce010768136e3bc5a8e, data reload: false

query1	0.04	0.03	0.03
query2	0.09	0.04	0.04
query3	0.23	0.06	0.05
query4	1.71	0.10	0.10
query5	0.50	0.48	0.49
query6	1.13	0.72	0.72
query7	0.02	0.01	0.02
query8	0.05	0.05	0.04
query9	0.56	0.49	0.49
query10	0.53	0.54	0.53
query11	0.16	0.11	0.11
query12	0.15	0.12	0.12
query13	0.60	0.58	0.58
query14	0.77	0.76	0.80
query15	0.86	0.82	0.82
query16	0.34	0.36	0.37
query17	1.01	0.96	0.97
query18	0.22	0.24	0.25
query19	1.83	1.72	1.68
query20	0.01	0.01	0.01
query21	15.46	0.83	0.66
query22	4.15	7.25	2.23
query23	18.27	1.34	1.23
query24	2.14	0.23	0.21
query25	0.16	0.09	0.08
query26	0.27	0.19	0.17
query27	0.07	0.07	0.08
query28	13.25	1.00	0.99
query29	12.64	3.32	3.32
query30	0.25	0.06	0.05
query31	2.86	0.39	0.39
query32	3.29	0.47	0.47
query33	2.85	2.95	2.85
query34	17.10	4.46	4.44
query35	4.50	4.47	4.43
query36	0.65	0.48	0.47
query37	0.19	0.15	0.16
query38	0.14	0.14	0.15
query39	0.05	0.03	0.04
query40	0.17	0.15	0.14
query41	0.09	0.05	0.05
query42	0.05	0.05	0.05
query43	0.04	0.03	0.04
Total cold run time: 109.45 s
Total hot run time: 30.78 s

@jacktengg
Copy link
Contributor Author

run p1

@jacktengg
Copy link
Contributor Author

run external

@jacktengg jacktengg changed the title [test](spill) add logs to debug spill bugs [spill](logs) add logs to debug spill bugs Jul 2, 2024
@jacktengg
Copy link
Contributor Author

run external

@jacktengg
Copy link
Contributor Author

run buildall

@doris-robot
Copy link

TPC-H: Total hot run time: 39643 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 4727f4e729bdaecbcd8d9d2564d07142eace882b, data reload: false

------ Round 1 ----------------------------------
q1	17604	4432	4306	4306
q2	2020	200	187	187
q3	10879	1224	1070	1070
q4	10574	906	787	787
q5	7934	2699	2579	2579
q6	222	138	136	136
q7	964	602	613	602
q8	9526	2099	2106	2099
q9	9063	6559	6600	6559
q10	9366	3678	3705	3678
q11	458	237	234	234
q12	444	231	228	228
q13	18771	2987	2973	2973
q14	259	218	212	212
q15	522	474	484	474
q16	527	382	373	373
q17	972	630	671	630
q18	8139	7543	7334	7334
q19	4175	1461	1318	1318
q20	678	321	339	321
q21	5035	3207	4066	3207
q22	398	336	342	336
Total cold run time: 118530 ms
Total hot run time: 39643 ms

----- Round 2, with runtime_filter_mode=off -----
q1	4360	4259	4474	4259
q2	376	267	269	267
q3	2958	2744	2718	2718
q4	1841	1603	1586	1586
q5	5270	5278	5279	5278
q6	221	128	128	128
q7	2114	1771	1755	1755
q8	3189	3380	3331	3331
q9	8287	8333	8292	8292
q10	3890	3628	3668	3628
q11	595	481	476	476
q12	787	608	624	608
q13	16258	2978	2988	2978
q14	288	263	251	251
q15	529	482	483	482
q16	469	410	414	410
q17	1756	1476	1448	1448
q18	7801	7521	7448	7448
q19	2080	1613	1535	1535
q20	1974	1775	1776	1775
q21	5001	4843	4778	4778
q22	637	551	559	551
Total cold run time: 70681 ms
Total hot run time: 53982 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 171860 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 4727f4e729bdaecbcd8d9d2564d07142eace882b, data reload: false

query1	922	380	377	377
query2	6466	2382	2339	2339
query3	6656	207	209	207
query4	20284	17636	17322	17322
query5	4191	498	497	497
query6	277	179	153	153
query7	4589	288	287	287
query8	326	287	296	287
query9	8399	2374	2348	2348
query10	598	310	276	276
query11	10442	9828	10004	9828
query12	134	83	83	83
query13	1645	367	356	356
query14	10116	7060	7728	7060
query15	245	190	186	186
query16	7747	261	268	261
query17	1884	540	517	517
query18	1768	262	270	262
query19	192	154	160	154
query20	91	83	83	83
query21	217	124	135	124
query22	4597	4156	4039	4039
query23	33603	32921	32939	32921
query24	12010	2784	2753	2753
query25	666	351	361	351
query26	1758	150	149	149
query27	2950	306	322	306
query28	7323	2070	2076	2070
query29	1094	629	607	607
query30	285	150	149	149
query31	930	737	720	720
query32	95	53	51	51
query33	779	286	282	282
query34	1026	454	467	454
query35	743	638	614	614
query36	1089	917	938	917
query37	291	82	75	75
query38	2898	2719	2789	2719
query39	868	793	780	780
query40	272	127	126	126
query41	60	54	53	53
query42	119	104	98	98
query43	594	540	542	540
query44	1240	759	724	724
query45	196	168	159	159
query46	1069	694	678	678
query47	1864	1747	1743	1743
query48	368	302	307	302
query49	1208	405	412	405
query50	764	380	387	380
query51	7013	6918	6709	6709
query52	107	94	96	94
query53	353	290	287	287
query54	1023	447	448	447
query55	73	76	72	72
query56	289	266	274	266
query57	1136	1016	1042	1016
query58	247	234	243	234
query59	3409	3134	3332	3134
query60	314	269	265	265
query61	93	93	92	92
query62	634	456	440	440
query63	322	294	291	291
query64	9806	2217	1822	1822
query65	3146	3085	3092	3085
query66	1366	336	329	329
query67	15539	14849	15157	14849
query68	4542	538	547	538
query69	452	306	306	306
query70	1149	1028	1082	1028
query71	372	278	270	270
query72	7148	5187	5468	5187
query73	734	322	321	321
query74	5952	5482	5419	5419
query75	3381	2693	2696	2693
query76	2743	910	881	881
query77	455	292	310	292
query78	10280	10047	9691	9691
query79	2553	511	528	511
query80	2163	472	455	455
query81	590	222	216	216
query82	769	108	106	106
query83	299	172	170	170
query84	268	86	92	86
query85	1980	275	272	272
query86	499	330	328	328
query87	3277	3105	3052	3052
query88	4315	2437	2441	2437
query89	472	376	395	376
query90	1838	188	190	188
query91	125	98	99	98
query92	59	48	47	47
query93	1934	512	513	512
query94	1231	185	187	185
query95	403	321	311	311
query96	589	274	268	268
query97	3216	2987	2985	2985
query98	213	206	194	194
query99	1270	833	870	833
Total cold run time: 278089 ms
Total hot run time: 171860 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 30.87 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit 4727f4e729bdaecbcd8d9d2564d07142eace882b, data reload: false

query1	0.04	0.03	0.03
query2	0.09	0.04	0.04
query3	0.23	0.05	0.05
query4	1.67	0.08	0.09
query5	0.50	0.46	0.47
query6	1.13	0.72	0.72
query7	0.02	0.01	0.01
query8	0.06	0.04	0.04
query9	0.55	0.51	0.49
query10	0.53	0.54	0.53
query11	0.15	0.11	0.12
query12	0.15	0.13	0.12
query13	0.58	0.59	0.58
query14	0.74	0.77	0.77
query15	0.84	0.81	0.80
query16	0.37	0.38	0.37
query17	1.00	1.04	0.95
query18	0.21	0.26	0.25
query19	1.81	1.74	1.79
query20	0.01	0.01	0.02
query21	15.48	0.75	0.66
query22	4.43	6.43	2.31
query23	18.29	1.38	1.20
query24	2.18	0.22	0.23
query25	0.16	0.09	0.08
query26	0.26	0.17	0.17
query27	0.08	0.07	0.08
query28	13.23	1.01	1.00
query29	12.62	3.29	3.27
query30	0.26	0.05	0.05
query31	2.91	0.40	0.37
query32	3.24	0.48	0.48
query33	2.86	2.96	2.86
query34	17.24	4.44	4.47
query35	4.49	4.47	4.48
query36	0.65	0.45	0.46
query37	0.19	0.16	0.15
query38	0.16	0.15	0.16
query39	0.05	0.04	0.04
query40	0.16	0.15	0.14
query41	0.09	0.05	0.05
query42	0.05	0.05	0.05
query43	0.04	0.04	0.04
Total cold run time: 109.8 s
Total hot run time: 30.87 s

@github-actions github-actions bot added the approved Indicates a PR has been approved by one committer. label Jul 4, 2024
@github-actions
Copy link
Contributor

github-actions bot commented Jul 4, 2024

PR approved by at least one committer and no changes requested.

@github-actions
Copy link
Contributor

github-actions bot commented Jul 4, 2024

PR approved by anyone and no changes requested.

Copy link
Member

@mrhhsg mrhhsg left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

Copy link
Contributor

@HappenLee HappenLee left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@jacktengg jacktengg merged commit fa77151 into apache:master Jul 4, 2024
dataroaring pushed a commit that referenced this pull request Jul 17, 2024
Add logs to debug spill hash join bugs:
```
*** Query id: d7f1126be4e948c6-87f1a80ed3cbd69e ***
*** is nereids: 0 ***
*** tablet id: 0 ***
*** Aborted at 1719291313 (unix time) try "date -d @1719291313" if you are using GNU date ***
*** Current BE git commitID: 5f5262a ***
*** SIGSEGV address not mapped to object (@0x8) received by PID 1419021 (TID 1421288 OR 0x7f0212b43640) from PID 8; stack trace: ***
 0# doris::signal::(anonymous namespace)::FailureSignalHandler(int, siginfo_t*, void*) at /home/zcp/repo_center/doris_master/doris/be/src/common/signal_handler.h:421
 1# PosixSignals::chained_handler(int, siginfo*, void*) [clone .part.0] in /usr/lib/jvm/java-17-openjdk-amd64/lib/server/libjvm.so
 2# JVM_handle_linux_signal in /usr/lib/jvm/java-17-openjdk-amd64/lib/server/libjvm.so
 3# 0x00007F06BD506520 in /lib/x86_64-linux-gnu/libc.so.6
 4# doris::vectorized::SpillReader::read(doris::vectorized::Block*, bool*) at /home/zcp/repo_center/doris_master/doris/be/src/vec/spill/spill_reader.cpp:96
 5# doris::vectorized::SpillStream::read_next_block_sync(doris::vectorized::Block*, bool*) in /mnt/disk1/STRESS_ENV/be/lib/doris_be
 6# std::_Function_handler<void (), doris::pipeline::PartitionedHashJoinProbeLocalState::recovery_build_blocks_from_disk(doris::RuntimeState*, unsigned int, bool&)::$_1>::_M_invoke(std::_Any_data const&) at /var/local/ldb-toolchain/bin/../lib/gcc/x86_64-linux-gnu/11/../../../../include/c++/11/bits/std_function.h:291
 7# doris::ThreadPool::dispatch_thread() in /mnt/disk1/STRESS_ENV/be/lib/doris_be
 8# doris::Thread::supervise_thread(void*) at /home/zcp/repo_center/doris_master/doris/be/src/util/thread.cpp:499
 9# start_thread at ./nptl/pthread_create.c:442
10# 0x00007F06BD5EA850 at ../sysdeps/unix/sysv/linux/x86_64/clone3.S:83
```
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by one committer. dev/3.0.1-merged reviewed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants