Fixing test flakiness #570

sleepy-owl · 2021-02-24T04:11:02Z

The test test_ground_truth_separated_modes sometimes fails non-deterministically. This PR addresses this issue.

To find a solution, I collected samples of value of statistic in the assertion from several test executions and computed the tail distribution. I computed the extreme percentiles to check how high can the values be. The computed percentiles are as follows:

0.9:: 0.09
0.99 :: 0.16
0.999:: 0.20
0.9999 :: 0.30

For this fix, i chose the 99.9th percentile. I think setting the bound using the statistical evaluation might be a good way to ensure the test is not flaky.

Do you guys think this makes sense? Please let me know if this looks good or if you have any other suggestions. Also, here I assume there are no bugs in the code under test.

yannikschaelte · 2021-02-24T08:18:18Z

Hi @sleepy-owl , thanks for this contribution! I think checking the test percentiles is the way to go indeed (unless we set the RNG, which we however rather don't want to atm). The Kolmogorov-Smirnov test is unfortunately rather unstable, s.t. the 20% may be the best we can do here I guess.

sleepy-owl · 2021-02-24T17:12:40Z

Thanks! Can you please merge this if this looks good?

sleepy-owl · 2021-02-25T19:29:37Z

@yannikschaelte gentle ping! Is there anything else that I should include in the PR? if not, can you please merge the PR?

yannikschaelte · 2021-02-25T19:52:05Z

Was waiting for another PR, but will merge soon, thanks!

sleepy-owl · 2021-02-26T01:01:53Z

Thanks!

Fixing test flakiness

dc88b1f

yannikschaelte assigned sleepy-owl Feb 24, 2021

yannikschaelte approved these changes Feb 24, 2021

View reviewed changes

yannikschaelte changed the base branch from master to develop February 24, 2021 08:18

Merge branch 'develop' into testfix

71ac7cd

Merge branch 'develop' into testfix

fc491fe

FFroehlich and others added 2 commits February 25, 2021 15:38

Merge branch 'develop' into testfix

94cbf1d

Merge branch 'develop' into testfix

261fc9b

yannikschaelte merged commit 3dc7259 into ICB-DCM:develop Feb 26, 2021

yannikschaelte mentioned this pull request Mar 17, 2021

Release 0.2.4 #596

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixing test flakiness #570

Fixing test flakiness #570

sleepy-owl commented Feb 24, 2021

yannikschaelte commented Feb 24, 2021

sleepy-owl commented Feb 24, 2021

sleepy-owl commented Feb 25, 2021

yannikschaelte commented Feb 25, 2021

sleepy-owl commented Feb 26, 2021

Fixing test flakiness #570

Fixing test flakiness #570

Conversation

sleepy-owl commented Feb 24, 2021

yannikschaelte commented Feb 24, 2021

sleepy-owl commented Feb 24, 2021

sleepy-owl commented Feb 25, 2021

yannikschaelte commented Feb 25, 2021

sleepy-owl commented Feb 26, 2021