[MNT] remove coverage reporting and `pytest-cov` from PR CI and `setup.cfg` #6363

fkiraly · 2024-04-29T14:32:43Z

This PR removes generation of coverage reports, installation and use of pytest-cov from standard CI. Also removes the (unreliable) coverage badge from the README

Reasons:

generation of coverage reports seems to make CI take substantially more time: [BUG] long test collection time and test timeouts #6344
coverage reports are unreliable (spuriously low) due to incremental/differential testing: [MNT] ensuring correct coverage from incremental testing #5090
in consequence, the README coverage badge showed randomly fluctuating coverage depending on which set of partial tests were executed. I think it makes sense to remove it until we have a way to address [MNT] ensuring correct coverage from incremental testing #5090.

fkiraly · 2024-05-01T13:49:10Z

Anecdotal, but looks like this leads to substantial runtime improvements.

Before:

After:

yarnabrina · 2024-05-01T18:35:25Z

@fkiraly I want to be catious for this. Can we test these:

what happens if instead of removing completely we only skip the xml and html reports? I believe a base one (.coverage) gets generated always, and these two run separately.
if it is removed completely (assuming only for CI runs in PR), how does coverage report appear in README (after merge to main)? If it shows missing or 0% or similar, that will be misleading. To test, may be you can try to edit the link of README in this branch without actually merging.

My caution is mainly for the reason that it's highly counter intuitive to me that coverage will affect timing by this much. It's more than 3-4 times in your screenshots, and if had that been the general effect of pytest-cov, it's expected to be detected by users quite ago. It's very popular and standard, so I am really wondering if we are missing something else (though I don't have any alternative ideas yet).

fkiraly · 2024-05-01T19:10:38Z

what happens if instead of removing completely we only skip the xml and html reports? I believe a base one (.coverage) gets generated always, and these two run separately.

According to the profiler, indeed these parts create the overhead.

How would you turn these off separately? Can you help? Would it be removing the --cov-report html etc, but not --cov?

how does coverage report appear in README

This PR also removes the badge from the readme, because it is misleading anyway, with or without this PR.

We should find a way to display genuine coverage in the readme (see #5090) - I would consider that a separate issue (namely, #5090), and it would then include adding the correct coverage display to the readme.

fkiraly · 2024-05-01T19:12:05Z

so I am really wondering if we are missing something else (though I don't have any alternative ideas yet).

so am I.
A wild guess is that we have some runaway import chains in the style of #6355, which is causing the long runtimes.

Or, perhaps cause/effect are hard to detect in general?

yarnabrina · 2024-05-02T09:21:31Z

How would you turn these off separately? Can you help? Would it be removing the --cov-report html etc, but not --cov?

Yes, that only. Let's see what happens.

This PR also removes the badge from the readme, because it is misleading anyway, with or without this PR.

I think if README shows 0% or etc. it may give potential users/contributors a negative impression that this framework is untested (e.g. I know I'll feel the same for a new tool).

fkiraly · 2024-05-02T09:31:32Z

ok - I've added it back in the test-nosoftdeps-full job now, and in the pyproject.toml, to see what happens

yarnabrina · 2024-05-02T13:52:17Z

Only 5 jobs got triggered, not a single testing job! How did it ran everything earlier?

fkiraly · 2024-05-02T23:12:50Z

Only 5 jobs got triggered, not a single testing job! How did it ran everything earlier?

I see - I think I understand why the difference.

Previously, pytest-cov was removed from pyproject.toml, and that triggered "test all". Now, we added it back, and pyproject.toml is no longer modified, so there is no trigger for testing anything anymore.

yarnabrina · 2024-05-03T14:13:36Z

https://github.com/sktime/sktime/actions/runs/8940323391

I triggered a manual test all workflow on this branch for debugging.

fkiraly · 2024-05-03T14:56:27Z

Thanks.

Sth is taking hours again, how do we find out with which estimator it gets stuck?

yarnabrina · 2024-05-03T17:49:51Z

I am not aware of a better solution than going into verbose mode.

By the way, have you seen the failures? It seems every single "other" run failed with this:

FAILED sktime/tests/tests/test_test_utils.py::test_run_test_for_class - AssertionError: assert 'True_run_always' in ['True_pyproject_change', 'True_changed_class', 'True_changed_tests']

fkiraly · 2024-05-03T17:57:44Z

By the way, have you seen the failures? It seems every single "other" run failed with this:

Thanks for pointing this out - this is a bug with a test I added to make sure we test the run_test_for_class utility.

The bug surfaces only in the test_all workflow which has a certain combination of conditions. Fix here:
#6383

yarnabrina · 2024-05-03T18:25:34Z

I checked other jobs, and so far no timeout failure. Only one module job failed and it's for forecasting:

FAILED sktime/forecasting/model_evaluation/tests/test_evaluate.py::test_evaluate_common_configs[backend8-scoring1-refit-1-10-fh5-ExpandingWindowSplitter] - OverflowError: Python int too large to convert to C long

Ref. https://github.com/sktime/sktime/actions/runs/8940323391/job/24558260007#step:3:6594

Any idea if it's sporadic? We'll probably know from random seed diagnostic.

(FYI @benHeid )

fkiraly · 2024-05-03T18:29:15Z

FAILED sktime/forecasting/model_evaluation/tests/test_evaluate.py::test_evaluate_common_configs[backend8-scoring1-refit-1-10-fh5-ExpandingWindowSplitter] - OverflowError: Python int too large to convert to C long

This is definitely a new one - have not seen this before.

However, there have been failures in test_evaluate_common_configs in ancient times, but these seem unrelated?
#1194

We'll probably know from random seed diagnostic.

Probably not, as that one does not add random seeds except in TestAllForecasters, the test_evaluate_common_configs lives elsewhere.

Abhay-Lejith · 2024-06-10T11:57:33Z

@fkiraly @yarnabrina , mentioning this here as it might be related.
I was not able to make breakpoints work in my python debugger in vscode up until now.
I've managed to fix it by essentially disabling pytest-cov ( or something of that effect ) by modifying my launch.json and adding the following field .
"env": {"PYTEST_ADDOPTS": "--no-cov"}
It appears that pytest-cov somehow interferes with the debugger. More info on it in the "Note section" here: https://code.visualstudio.com/docs/python/testing#_pytest-configuration-settings

yarnabrina · 2024-06-10T14:51:22Z

@fkiraly I think you use vs code and use the integrated debugging? Did you ever face the issue @Abhay-Lejith mentioned?

I definitely face a lot of issues in our current setup.cfg, so I have a local patch to ignore that file altogether, essentially doing what @Abhay-Lejith has done with VS Cide settings so never faced this issue myself. If this is indeed an issue, as the documentation seem to suggest, I expect this to be a very common thing, and am wondering why no one ever reported it before? 😕

fkiraly · 2024-06-10T22:57:10Z

@fkiraly I think you use vs code and use the integrated debugging? Did you ever face the issue @Abhay-Lejith mentioned?

Yes, the GUI integrated breakpoint debugging never worked, so I ended up adopting a more manual workflow and also ignoring the file in practice.

I applaud @Abhay-Lejith to having finally identified the reason.

I expect this to be a very common thing, and am wondering why no one ever reported it before?

Perhaps groupthink bias, i.e., everyone thinks it works for everyone else and they would be considered stupid if they raise it in public? Nothing further from the truth, but sometimes it is how the mind works.

Shall we remove the flags then, it seems to cause problems systematically?

yarnabrina

If we are disabling coverage everywhere, should we drop them from test dependencies too?

fkiraly · 2024-06-11T23:12:43Z

we should probably have some replacement plan in mind, e.g., where and when we run coverage. On the full test run?

remove cov

4b2acb9

fkiraly added maintenance Continuous integration, unit testing & package distribution module:tests test framework functionality - only framework, excl specific tests labels Apr 29, 2024

fkiraly added 2 commits April 29, 2024 16:26

Update README.md

835b9b9

Merge branch 'main' into remove-pytest-cov

4223442

add back cov

eeda87c

Merge branch 'main' into remove-pytest-cov

ad6facd

fkiraly changed the title ~~[MNT] remove coverage reporting and pytest-cov from PR CI~~ [MNT] remove coverage reporting and pytest-cov from PR CI and setup.cfg Jun 10, 2024

fkiraly marked this pull request as ready for review June 10, 2024 22:57

fkiraly requested review from achieveordie, benHeid and yarnabrina as code owners June 10, 2024 22:57

yarnabrina approved these changes Jun 11, 2024

View reviewed changes

fkiraly merged commit bd2f0e3 into main Jun 16, 2024
177 of 183 checks passed

fkiraly deleted the remove-pytest-cov branch June 16, 2024 11:27

yarnabrina mentioned this pull request Jul 28, 2024

[MNT] removal of coverage upload steps from CI #6858

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MNT] remove coverage reporting and `pytest-cov` from PR CI and `setup.cfg` #6363

[MNT] remove coverage reporting and `pytest-cov` from PR CI and `setup.cfg` #6363

fkiraly commented Apr 29, 2024

fkiraly commented May 1, 2024

yarnabrina commented May 1, 2024

fkiraly commented May 1, 2024

fkiraly commented May 1, 2024

yarnabrina commented May 2, 2024

fkiraly commented May 2, 2024

yarnabrina commented May 2, 2024

fkiraly commented May 2, 2024

yarnabrina commented May 3, 2024

fkiraly commented May 3, 2024

yarnabrina commented May 3, 2024

fkiraly commented May 3, 2024

yarnabrina commented May 3, 2024

fkiraly commented May 3, 2024 •

edited

Loading

Abhay-Lejith commented Jun 10, 2024

yarnabrina commented Jun 10, 2024

fkiraly commented Jun 10, 2024 •

edited

Loading

yarnabrina left a comment

fkiraly commented Jun 11, 2024

[MNT] remove coverage reporting and pytest-cov from PR CI and setup.cfg #6363

[MNT] remove coverage reporting and pytest-cov from PR CI and setup.cfg #6363

Conversation

fkiraly commented Apr 29, 2024

fkiraly commented May 1, 2024

yarnabrina commented May 1, 2024

fkiraly commented May 1, 2024

fkiraly commented May 1, 2024

yarnabrina commented May 2, 2024

fkiraly commented May 2, 2024

yarnabrina commented May 2, 2024

fkiraly commented May 2, 2024

yarnabrina commented May 3, 2024

fkiraly commented May 3, 2024

yarnabrina commented May 3, 2024

fkiraly commented May 3, 2024

yarnabrina commented May 3, 2024

fkiraly commented May 3, 2024 • edited Loading

Abhay-Lejith commented Jun 10, 2024

yarnabrina commented Jun 10, 2024

fkiraly commented Jun 10, 2024 • edited Loading

yarnabrina left a comment

Choose a reason for hiding this comment

fkiraly commented Jun 11, 2024

[MNT] remove coverage reporting and `pytest-cov` from PR CI and `setup.cfg` #6363

[MNT] remove coverage reporting and `pytest-cov` from PR CI and `setup.cfg` #6363

fkiraly commented May 3, 2024 •

edited

Loading

fkiraly commented Jun 10, 2024 •

edited

Loading