critera.md: remove "code coverage" requirements. #1

gregkh · 2015-08-10T21:47:13Z

Code "coverage" is really just "test coverage" and is not a good metric for any sort of "assurance" other than someone wrote a bunch of tests. See http://martinfowler.com/bliki/TestCoverage.html for more details as to why I wouldn't even put this on a list of something to try to achieve toward.

dankohn · 2015-08-11T02:47:18Z

Fowler says:

If you are testing thoughtfully and well, I would expect a coverage percentage in the upper 80s or 90s.

The criteria you're suggesting to delete are not for the initial badge, but for a potential future version with more stringent criteria. I think the point could be clarified that 100% test coverage probably isn't a good criteria even for a hypothetical platinum badge, for the reasons that Fowler describes.

But code coverage does have a reasonable correlation with the chance that an unrelated change will break basic functionality elsewhere in the app (i.e., a smoke test). While this isn't particularly relevant to security bugs, it does matter to the reliability and resilience of a project to updates, especially for projects that don't have lots of eyeballs. For example, it is useful for gems and npm modules to have enough test coverage to tell whether new pull requests break any core functionality (which is automatically verified by Travis and displays on the Github user interface). That lets a maintainer merge after a code review without having to manually provide QA (quality assurance) of the library before merging.

(Oh, and thanks for making pull request #1!)

gregkh · 2015-08-11T03:06:59Z

I agree this isn't for the "initial" badge, it's just that a blind "100%" or "greater than X%" might not be all that relevant of a metric on it's own.

Feel free to keep it in the list though, what we want is people talking about this stuff. Feel free to close this pull request out if you don't want to change it.

david-a-wheeler · 2015-08-11T22:57:49Z

I think for the moment it's better leave in the coverage idea. That way, we can get people talking. It's not a required criterion, and may never be required in any badge.

There's debate about the value of coverage measures. SQLite, for example, strongly emphasizes the value of their 100% branch and 100% MC/DC coverage, per https://www.sqlite.org/testing.html (they do other things, but that is a key part of what they do). In contrast, the Linux kernel has to deal with a lot of weird hardware with undocumented behavior and no one has them all... it's not clear how the Linux kernel drivers (for example) could get very high coverage numbers, frankly, or what value they'd have.

Even Fowler notes that "if you are testing thoughtfully and well, I would expect a coverage percentage in the upper 80s or 90s... Certainly low coverage numbers, say below half, are a sign of trouble." So while 80% is not necessarily a measure by itself of good testing, a very low measure is a sign of bad testing. If there's a way we can determine if it's really good testing - perhaps by pairing this with other measures - I am all ears. Also, high coverage may not be good for finding current defects, but they can often be really helpful when refactoring code. I find that having lots of tests (with good coverage) means that I am much more willing to refactor code.

That said, this is pull request #1... and that is AWESOME. Thank you.

Please keep coming with comments; love to hear them!

gregkh · 2015-08-11T23:12:57Z

For the kernel, about 15 years ago, we did have people doing code-coverage analysis, I think the build option is still in there to enable it if you want to use it. Turns out, no one ever cared, it was just a "number on a spreadsheet" that someone wanted to track. We do have options to 'fail' core calls (like memory allocation) a % of the time to try to exercise error paths, but again, no one seems to ever use them, as again, no one really cares.

Perhaps this could be rewritten as something like:
Tests for major feature additions
as without some kind of test being present, it's hard to verify if something new even works. This is now the "unofficial" rule for the kernel, as we have been burned by this in the past (feature additions that never even worked).

I'll close this one out for now, as keeping what you have is fine at the moment, thanks.

gregkh closed this Aug 11, 2015

david-a-wheeler mentioned this pull request Aug 13, 2015

Proposal: Add to criteria "Tests for major feature additions" #2

Closed

david-a-wheeler mentioned this pull request Mar 24, 2016

Easy display of the remaining nits for each project #258

Closed

david-a-wheeler mentioned this pull request Jun 9, 2017

Redirect to a locale when one isn't selected using browser preferences (when available) #792

Closed

david-a-wheeler mentioned this pull request Jan 20, 2018

Update to Ruby 2.5.0 (will need to fix some things) #1008

Closed

kaywilliams mentioned this pull request Nov 19, 2021

Rebrand from CII to OpenSSF #1515

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

critera.md: remove "code coverage" requirements. #1

critera.md: remove "code coverage" requirements. #1

gregkh commented Aug 10, 2015

dankohn commented Aug 11, 2015

gregkh commented Aug 11, 2015

david-a-wheeler commented Aug 11, 2015

gregkh commented Aug 11, 2015

critera.md: remove "code coverage" requirements. #1

critera.md: remove "code coverage" requirements. #1

Conversation

gregkh commented Aug 10, 2015

dankohn commented Aug 11, 2015

gregkh commented Aug 11, 2015

david-a-wheeler commented Aug 11, 2015

gregkh commented Aug 11, 2015