sanitycheck log mixing between tests #25719

ABOSTM · 2020-05-28T17:00:44Z

Describe the bug
When running sanitycheck on all tests of directory tests\arch\arm, there is some mixing in logs.
This mixing causes, from time to time, in test PASSED, but reported Failed.
All run doesn't give always the same result (not always test failed , not always the same mixing)
But the failed test is always tests/arch/arm/arm_irq_vector_table (even if I think it is just a combination of circumstances). And it is not always failed on the same board.
Note: tests is launched on a bunch of STM32 board:
nucleo_f207zg, nucleo_f429zi, nucleo_f746zg, nucleo_l152re, nucleo_l4r5zi, nucleo_wb55rg, stm32f3_disco
Thoses tests run everyday on bench and failure occurs at least on 1 board about every day or every 2 days.

To Reproduce
Command to reproduce (require all boards to be connected):
sanitycheck --ninja --warnings-as-errors --runtime-artifact-cleanup -p nucleo_f207zg -p nucleo_f429zi -p nucleo_f746zg -p nucleo_l152re -p nucleo_l4r5zi -p nucleo_wb55rg -p stm32f3_disco -T tests/arch/arm/ --hardware-map ../map.yaml --device-testing --outdir /local/mcu/zephyrproject/logs/200528/log_job_master_d8560f698b_1503 [log_job_master_d8560f698b_1503.zip](https://github.com/zephyrproject-rtos/zephyr/files/4697102/log_job_master_d8560f698b_1503.zip) [log_job_master_d8560f698b_1503.zip](https://github.com/zephyrproject-rtos/zephyr/files/4697103/log_job_master_d8560f698b_1503.zip)

Log mixing
Hereafter some example of mixing (see zip file):
log_job_master_d8560f698b_1503\nucleo_f746zg\tests\arch\arm\arm_interrupt\arch.interrupt.arm\handler.log
--> "Running test suite arm_thread_swap"

log_job_master_d8560f698b_1503\nucleo_l4r5zi\tests\arch\arm\arm_irq_vector_table\arch.interrupt.arm.irq_vector_table\handler.log
--> "Running test suite arm_thread_swap"

log_job_master_d8560f698b_1503\nucleo_f746zg\tests\arch\arm\arm_ramfunc\arch.arm.ramfunc\handler.log
--> "Running test suite arm_thread_swap"

log_job_master_d8560f698b_1503\nucleo_f746zg\tests\arch\arm\arm_interrupt\arch.interrupt.arm\handler.log
--> "Running test suite arm_thread_swap"

c:\tmps\log_zephyr\log_job_master_d8560f698b_1503\nucleo_f429zi\tests\arch\arm\arm_interrupt\arch.interrupt.no_optimizations\handler.log
--> "Running test suite arm_thread_swap"

c:\tmps\log_zephyr\log_job_master_d8560f698b_1503\nucleo_l4r5zi\tests\arch\arm\arm_interrupt\arch.interrupt.no_optimizations\handler.log
--> "Running test suite vector_table"

(And more, I do not mention all of mixing)

The stranger mixing corresponding to failed case:
log_job_master_d8560f698b_1503\nucleo_f429zi\tests\arch\arm\arm_irq_vector_table\arch.interrupt.arm.irq_vector_table\handler.log
It starts with a test "arm_iunterrupt" and finishes with another test "vector_table"
--> "Running test suite arm_interrupt"
--> "Test suite vector_table succeeded"

So it looks like a sanitycheck issue, parsing/splitting console logs.

I attache a zip file containing all logs in there respective directories.

The text was updated successfully, but these errors were encountered:

ABOSTM · 2020-05-28T17:01:03Z

log_job_master_d8560f698b_1503.zip

ABOSTM · 2020-06-02T15:17:00Z

I found similar mixing when running:

sanitycheck -N --device-testing --hardware-map ../map.yaml -c -p nucleo_f746zg -T samples/drivers/watchdog/ -T samples/drivers/can/
sanitycheck -N --device-testing --hardware-map ../map.yaml -c-p nucleo_f303re -T tests/kernel/mem_slab

carlescufi · 2020-06-02T17:49:57Z

@nashif could you take a look?
CC @PerMac

github-actions · 2020-08-02T00:47:59Z

This issue has been marked as stale because it has been open (more than) 60 days with no activity. Remove the stale label or add a comment saying that you would like to have the label removed otherwise this issue will automatically be closed in 14 days. Note, that you can always re-open a closed issue at any time.

PerMac · 2020-09-17T11:23:05Z

I observed the same (similar?) issue when running on-target tests in our setup. Sometimes a test case is reported as failed in the platform.xml report but the failure message indicates something went wrong since the logs are from different test. E.g.:

<testcase classname="sample.kernel" name="sample.kernel.philosopher.stacks" time="5.862062">
<failure message="failed" type="failure">
time_nmi =================================================================== Test suite arm_runtime_nmi_fn succeeded =================================================================== PROJECT EXECUTION SUCCESSFUL
</failure>
</testcase>

PerMac · 2020-10-07T07:33:07Z

This looks like a higher priority than low (probably medium). Currently, this is the issue that prevents us from having green builds. During most of the runs, it happens that there is one false positive due to this error. It also seems that a test which failed due to this is not repeated with --retry-failed. @nashif I can try making a patch that will fix retrying of such cases for now, but it won't be the solution for log/results mixing.

PerMac · 2020-11-24T07:47:39Z

FYI: Most of the time I see this output leaking into other tests and causing them to fail:

arm_runtime_nmi
===================================================================
Test suite arm_runtime_nmi_fn succeeded
===================================================================
PROJECT EXECUTION SUCCESSFUL

nashif · 2021-03-05T03:03:54Z

fixed by #32784

ABOSTM added the bug The issue is a bug, or the PR is fixing a bug label May 28, 2020

ABOSTM added the area: Sanitycheck Sanitycheck has been renamed to Twister label May 28, 2020

carlescufi assigned nashif May 28, 2020

carlescufi added the priority: low Low impact/importance bug label Jun 2, 2020

github-actions bot added the Stale label Aug 2, 2020

ABOSTM removed the Stale label Aug 3, 2020

nashif added the In progress For PRs: is work in progress and should not be merged yet. For issues: Is being worked on label Aug 31, 2020

nashif changed the title ~~tests: arch: arm: sanitycheck log mixing between tests~~ sanitycheck log mixing between tests Sep 11, 2020

nashif added the priority: high High impact/importance bug label Oct 26, 2020

nashif removed the priority: low Low impact/importance bug label Dec 15, 2020

nashif added area: Twister Twister and removed area: Sanitycheck Sanitycheck has been renamed to Twister labels Jan 11, 2021

nashif added priority: medium Medium impact/importance bug and removed priority: high High impact/importance bug labels Jan 22, 2021

PerMac mentioned this issue Feb 19, 2021

twister: Twister cannot properly handle runners errors (flashing) #32478

Closed

nashif mentioned this issue Mar 3, 2021

scripts/twister: Fix race with device-testing #32784

Merged

nashif closed this as completed Mar 5, 2021

ABOSTM mentioned this issue May 12, 2021

twister log mixing between tests #35229

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sanitycheck log mixing between tests #25719

sanitycheck log mixing between tests #25719

ABOSTM commented May 28, 2020

ABOSTM commented May 28, 2020

ABOSTM commented Jun 2, 2020

carlescufi commented Jun 2, 2020

github-actions bot commented Aug 2, 2020

PerMac commented Sep 17, 2020

PerMac commented Oct 7, 2020 •

edited

Loading

PerMac commented Nov 24, 2020

nashif commented Mar 5, 2021

sanitycheck log mixing between tests #25719

sanitycheck log mixing between tests #25719

Comments

ABOSTM commented May 28, 2020

ABOSTM commented May 28, 2020

ABOSTM commented Jun 2, 2020

carlescufi commented Jun 2, 2020

github-actions bot commented Aug 2, 2020

PerMac commented Sep 17, 2020

PerMac commented Oct 7, 2020 • edited Loading

PerMac commented Nov 24, 2020

nashif commented Mar 5, 2021

PerMac commented Oct 7, 2020 •

edited

Loading