Coverage on unit tests fails when tracking child processes #6887

alalazo · 2018-01-10T16:31:13Z

As of 596d463 using concurrency=multiprocess to run:

$ coverage run bin/spack test

results in a few .coverage* files being corrupted. This may compromise the execution of:

$ coverage combine
$ codecov --required

and in the end the coverage report from codecov (see data from January 2018, where it dropped from ~75% to ~50%).

#6872 has been merged as a work-around to get more stable measurements, but the underlying issue with pytest + multiprocessing + coverage needs to be investigated further.

@adamjstewart @tgamblin @scheibelp @becker33

The text was updated successfully, but these errors were encountered:

alalazo · 2018-01-10T17:24:27Z

For the records, this seems to be the first commit behaving in a strange way on codecov, but probably this is Travis fault (it was an automatic trigger of a commit that already went good before).

The next commit that's passed on Travis is c3b003e, the build is here and the codecov report here. Both of the linux unit tests were not sent to codecov (so 8 reports instead of 10). It dates 2018-01-02T19:59:34Z.

Commits before those two seem to end in a good way. Next step: what changed on 2018-01-02 ?

alalazo · 2018-01-10T17:43:34Z

Commit 088c193 scores 74.45% on codecov (last stable number). It has all 10 reports on codecov, but still the log of Travis shows:

Coverage.py warning: Couldn't read data from '/home/travis/build/spack/spack/.coverage.travis-job-spack-spack-323456025.travisci.net.9618.332776': CoverageException: Doesn't seem to be a coverage.py data file
Coverage.py warning: Couldn't read data from '/home/travis/build/spack/spack/.coverage.travis-job-spack-spack-323456025.travisci.net.9686.644486': CoverageException: Doesn't seem to be a coverage.py data file
Coverage.py warning: Couldn't read data from '/home/travis/build/spack/spack/.coverage.travis-job-spack-spack-323456025.travisci.net.9719.277474': CoverageException: Doesn't seem to be a coverage.py data file

codecov version is 2.0.10. On January 2 version 2.0.11 went out.

At this point I start to think that we had this bug with pytest + coverage since quite some time, but never noticed it because codecov was able to deal nicely with the merged coverage file (despite the failures above) until version 2.0.11.

alalazo · 2018-01-10T18:30:05Z

I think I finally got the the bottom of this twisted issue. Between 2.0.10 and 2.0.11 codecov/codecov-python#112 was merged. This means that we started calling:

$ coverage combine

twice.

Now, doing this:

$ coverage run --concurrency=multiprocessing bin/spack test
$ coverage combine
Coverage.py warning: Couldn't read data from '/home/mculpo/PycharmProjects/spack/.coverage.nuvolari.1043.077127': CoverageException: Doesn't seem to be a coverage.py data file

will result very likely in some error. If you check your .coverage now you'll find it has data in it. The corrupted files will instead be empty and are not deleted like the good ones by the command.

Running again:

$ coverage combine

(which is what effectively codecov 2.0.11 does for us) will account only for the corrupted files and leave you with an empty report! The content is:

!coverage.py: This is a private format, don't read it directly!{}

alalazo · 2018-01-10T18:36:31Z

Bottomline: yes, we have a buglet in our tests somewhere related to multiprocessing, coverage has a bug (because it doesn't realize that a .coverage file is already there before deleting it) , Travis has numerous unrelated bugs that cause random failures of jobs and codecov had a bug that once fixed broke our CI 😆

adamjstewart · 2018-01-10T18:42:51Z

I would seriously like to get to the bottom of these Travis network connection problems. I've never seen so many failing jobs before. But there's not much we can do on our end for that one. Good work!

alalazo · 2018-01-11T08:21:13Z

Reported in this issue the behavior of coverage.

nedbat · 2018-01-11T12:58:44Z

@alalazo Thanks for the report. I asked on the bitbucket issue for a reproducible case, but I see the issue there is anonymous. Or we can talk about it more here :)

alalazo · 2018-01-11T17:27:09Z

@nedbat Sure. Concerning a simple case to reproduce what I see with coverage:

$ coverage run -p $(which flake8) 
$ ls -a
.  ..  .coverage.nuvolari.4834.430541
$ touch .coverage.nuvolari.4677.764578
$ coverage combine
Coverage.py warning: Couldn't read data from '/home/mculpo/tmp/covbug/.coverage.nuvolari.4677.764578': CoverageException: Doesn't seem to be a coverage.py data file

At this point .coverage has data in it. When I do again:

$ coverage combine
Coverage.py warning: Couldn't read data from '/home/mculpo/tmp/covbug/.coverage.nuvolari.4677.764578': CoverageException: Doesn't seem to be a coverage.py data file

the .coverage file is empty.

nedbat · 2018-01-14T23:54:08Z

The problem here is two things working together:

Files that can't be read as data files during combine are left in place. Empty files can't be read as data files.
"coverage combine" deletes the .coverage file, and then combines all the ".coverage.*" files it can find. The second "coverage combine" deletes the data file, and then tries again to read the empty files, and cannot, so there is no data left.

If you use "coverage combine -a" instead, then it won't delete the data file during the combine step.

alalazo · 2018-01-15T06:23:21Z

@nedbat You're completely correct, and what you say was clear to me before - basically that's what I got tracking the issue we had. What I was wondering is if point 2. above:

"coverage combine" deletes the .coverage file, and then combines all the ".coverage.*" files it can find.

is to be considered a bug or not for your application, as it may delete without warning a valid .coverage file.

In our case it was not trivial to understand this, as some command down the chain (codecov) started calling coverage combine "for us", at some point, and that resulted in a highly fluctuating coverage. To arrive at this conclusion we had to track:

when the coverage started to behave weirdly
what changed on that date + get the relevant PR of codecov
reproduce what coverage combine called twice does in presence of empty .coverage.* files

What I was suggesting is that maybe coverage combine should check upfront for the presence of a .coverage file, and fail with a meaningful message if it is present. For instance:

Error: cannot combine coverage files when .coverage is already present.

If you instead think coverage combine should behave like it does, feel free to close the issue I opened on bitbucket (and apologies for the noise).

According to what was discovered in spack#6887, one of the problems is calling 'coverage combine' twice without the '-a' flag. This removes the first call within our test scripts.

* Revert "Travis: use --concurrency=multiprocessing only on build tests (#6872)" This reverts commit 596d463. * Removing 'coverage combine' in test script According to what was discovered in #6887, one of the problems is calling 'coverage combine' twice without the '-a' flag. This removes the first call within our test scripts.

nedbat · 2018-01-21T21:12:31Z

I think combine does this to parallel the behavior of run: it always generates a new data file, and both have a -a flag to append to the existing file. This might be too literal a design.

What about this: a key point in your scenario is that the second combine step doesn't find any usable data files. Perhaps we should avoid saving the new data file if there wasn't any actual combining that got done?

nedbat · 2018-01-21T22:17:25Z

Or perhaps: if after reading all the combinable files we could, there is no data at all, then don't write over the coverage data file?

nedbat · 2018-01-21T23:05:22Z

I just fixed this by raising an error if combine can't read any of the files it found.

alalazo · 2018-01-22T06:46:37Z

Thanks @nedbat. Just read your messages above. It seems to me a sensible behavior.

* Revert "Travis: use --concurrency=multiprocessing only on build tests (spack#6872)" This reverts commit 596d463. * Removing 'coverage combine' in test script According to what was discovered in spack#6887, one of the problems is calling 'coverage combine' twice without the '-a' flag. This removes the first call within our test scripts.

alalazo · 2019-02-14T09:42:50Z

This issue has been solved a year ago, forgot to close

alalazo added bug Something isn't working tests General test capability(ies) labels Jan 10, 2018

alalazo self-assigned this Jan 10, 2018

alalazo mentioned this issue Jan 16, 2018

Restore multiprocessing on unit tests #6948

Closed

This was referenced Jan 16, 2018

Restore multiprocessing in unit tests #6949

Merged

coverage combine does not delete .coverage file if present codecov/codecov-python#134

Merged

nedbat mentioned this issue Jun 23, 2018

Multiple use of combine leads to empty .coverage nedbat/coveragepy#629

Closed

benclifford mentioned this issue Jan 2, 2019

Measure test coverage during CI Parsl/parsl#724

Merged

alalazo closed this as completed Feb 14, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Coverage on unit tests fails when tracking child processes #6887

Coverage on unit tests fails when tracking child processes #6887

alalazo commented Jan 10, 2018

alalazo commented Jan 10, 2018

alalazo commented Jan 10, 2018

alalazo commented Jan 10, 2018

alalazo commented Jan 10, 2018

adamjstewart commented Jan 10, 2018

alalazo commented Jan 11, 2018

nedbat commented Jan 11, 2018

alalazo commented Jan 11, 2018

nedbat commented Jan 14, 2018

alalazo commented Jan 15, 2018

nedbat commented Jan 21, 2018

nedbat commented Jan 21, 2018

nedbat commented Jan 21, 2018

alalazo commented Jan 22, 2018

alalazo commented Feb 14, 2019

Coverage on unit tests fails when tracking child processes #6887

Coverage on unit tests fails when tracking child processes #6887

Comments

alalazo commented Jan 10, 2018

alalazo commented Jan 10, 2018

alalazo commented Jan 10, 2018

alalazo commented Jan 10, 2018

alalazo commented Jan 10, 2018

adamjstewart commented Jan 10, 2018

alalazo commented Jan 11, 2018

nedbat commented Jan 11, 2018

alalazo commented Jan 11, 2018

nedbat commented Jan 14, 2018

alalazo commented Jan 15, 2018

nedbat commented Jan 21, 2018

nedbat commented Jan 21, 2018

nedbat commented Jan 21, 2018

alalazo commented Jan 22, 2018

alalazo commented Feb 14, 2019