Code coverage runtime #20539

myrrc · 2021-02-15T18:30:16Z

I hereby agree to the terms of the CLA available at: https://yandex.ru/legal/cla/?lang=en

Changelog category (leave one):

Build/Testing/Packaging Improvement

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):

Сode coverage runtime for functional tests (replaces coverage CI check).
Uses clang SanitizerCoverage callbacks to store data on a per-test basis and dump them after testing.
Introduces critical edges coverage metric.

moved coverage to base/common temp add Small bugfixes regarding old-style casts and variable renames Removed old include Removed old declaration Added the initial dump callback Added the definition to hide the sanitizer callbacks

maybe it causes such bugs. fix amend finalizing fix

myrrc · 2021-02-16T17:36:09Z

Looks like it's the harmful contrib that was causing the bugs, removed it from the coverage builds as we won't need it anyway.

is better for build with and without code coverage

…acro

myrrc · 2021-04-07T16:29:54Z

A word about the IPC between the tester script and the server -- currently we have 3 main options:

Utilizing the tester's --testname option which executes a SELECT testname query before running the test.
The issue here is that we need to interfere with the query process pipeline -- either write a hook or handle the special coverage case with a macro.
Setting some option like SET coverage_test_name = name; Same issue with the pipeline.
Sending a signal from the tester to the server. This option is "cheaper" in a way, the only issue here is how to attach the test_id to the signal. The server uses the POSIX signal handler with the SA_SIGINFO. The tester preloads libc and executes the sigqueue function.

as clang's sanitizer blacklist options are broken

…rage/ folder

myrrc · 2021-08-05T14:40:13Z

Depends on #27228

…stream/master' into ba-thesis

…nch 'upstream/master' into ba-thesis

myrrc · 2021-08-06T19:55:52Z

Depends on #27361

myrrc · 2021-08-10T16:55:42Z

I think this PR won't be done in near future, so I'm closing it.

If you operate entirely on the information provided by the binary itself (with or without PC-table), all you can get is addresses that were hit (or some basic blocks indices that were hit). You can symbolize this data and get the source file and line, but the problem is that, you can't really do anything with this information.

Multiple basic blocks can correspond to some lines (e.g. function template instantiations). Basic blocks can correspond to lines not belonging to functions (e.g. macro expansion like settings traits implementation).
If you instrument the code at critical edges level, you lose the access to basic blocks structure, these addresses (of edges) could as well be generated randomly as there's no profit from using them.
You won't be able to get lines ranges that were hit as C++ is hard to parse and you need a compiler.

If you involve the compiler (.gcno), things don't get better. clang does not enumerate its basic blocks, so there's no way to match the basic block that was hit, for example, in the boolean counters array (the PC-table) with a basic block parsed from a gcno file. You get stuck at the function level. Sorting-transforming basic blocks line ranges still does not solve the problem (moreover, .gcno produces strange results like a line belonging to a basic block that's empty). gcno parsers like lcov use perl scripts containing thousands of lines to turn all this information into a human-readable format.

And if one day you want to try the source-based code coverage, you'll also fail. There's no way to tweak what data are stored and tracked (unless you patch the compiler which obviously is not a good idea here), so you just collect everything (and it takes >= 9 hours currently for a general run without per-test data). All you can do is use builtins provided to write the report to some other location, but it won't make everything substantially faster.

You may wonder: why can't you simply collect addresses that were hit and display them? Well, for CH purposes it's a) useless, as one needs to get the coverage percentage, and b) useless, as you reinvent the sancov wheel.

I doubt there's a good way to resolve this PR.

alexey-milovidov · 2023-11-21T19:50:15Z

You may wonder: why can't you simply collect addresses that were hit and display them?

I'm going to do exactly this in #56102

useless, as one needs to get the coverage percentage

The percentage can be calculated as the percentage of all instrumented edges, the percentage of instrumented symbols in the binary, functions, or source files. The percentage of lines can be calculated roughly if you take the assumption that every basic block spans from the line corresponding to its address to the next instrumented line.

myrrc and others added 6 commits February 15, 2021 02:48

Initial: added the sanitizer callbacks

2982237

moved coverage to base/common temp add Small bugfixes regarding old-style casts and variable renames Removed old include Removed old declaration Added the initial dump callback Added the definition to hide the sanitizer callbacks

Merge remote-tracking branch 'upstream/master' into ba-thesis

1822d40

Initial solution that writes every call to a single file with test_id -1

0ad508c

Exiting in case of file open error

4bdbe2b

Status message when building with coverage with clang

a258466

Trying to fix the segfaults

5b20ba6

This comment has been minimized.

Sign in to view

myrrc added 4 commits February 15, 2021 21:47

Removing the coverage.cpp source include if WITH_COVERAGE=0

72216e7

Adding the split declaration version

36f500c

Some other attempts

919a5d2

Removed harmful library from WITH_COVERAGE build,

483599d

maybe it causes such bugs. fix amend finalizing fix

myrrc force-pushed the ba-thesis branch from 6946446 to 483599d Compare February 16, 2021 17:35

myrrc force-pushed the ba-thesis branch from 7dbec2a to 4ae5de1 Compare February 16, 2021 18:48

Multiple attempts to determine which way of inclusion

c20d19f

is better for build with and without code coverage

myrrc force-pushed the ba-thesis branch from ec4a309 to c20d19f Compare April 5, 2021 21:26

myrrc added 3 commits April 6, 2021 00:27

Merge remote-tracking branch 'upstream/master' into ba-thesis

103177b

Possible solution: include files only if coverage is on, hide under m…

80b69f4

…acro

Macro replacement to hide the error

f3e3142

filimonov added altinity and removed altinity labels Apr 6, 2021

This comment has been minimized.

Sign in to view

robot-ch-test-poll2 added the submodule changed At least one submodule changed in this PR. label Apr 7, 2021

Changed the sanitize options into a per-cmake-target basis

b1f5658

as clang's sanitizer blacklist options are broken

myrrc force-pushed the ba-thesis branch from da0ee2f to b1f5658 Compare April 7, 2021 17:39

Working proto -- casted BB addesses are stored in text format in cove…

87606ec

…rage/ folder

myrrc force-pushed the ba-thesis branch from 4c2f042 to 87606ec Compare April 7, 2021 22:29

This comment has been minimized.

Sign in to view

Added test symbolizer

1be592f

Exceptions -> asserts

b5aa80c

myrrc force-pushed the ba-thesis branch from b343713 to b5aa80c Compare July 2, 2021 18:03

myrrc added 6 commits July 3, 2021 15:00

Merge remote-tracking branch 'upstream/master' into ba-thesis

dfb3b8b

Split coverage scripts

2b79d16

Re-adding task queue

90667d9

Simplification

4106651

More low-level report writing with mmap

f31e1ce

Removing unused

b999b27

myrrc force-pushed the ba-thesis branch from d6a8aeb to b999b27 Compare July 5, 2021 20:01

Added ch-test support for signals

329c1ab

myrrc force-pushed the ba-thesis branch from 548f2d1 to 329c1ab Compare July 8, 2021 14:44

myrrc added 3 commits August 5, 2021 16:01

Resizing report at testing end

e902e0d

Making coverage Writer constexpr

4f8c48f

Merge remote-tracking branch 'upstream/master' into ba-thesis

548218e

myrrc force-pushed the ba-thesis branch from 8429503 to 548218e Compare August 5, 2021 14:27

myrrc added 3 commits August 5, 2021 17:43

Changing tester to reflect separated PR

e440314

Merge branch 'improvement/tester-changes', remote-tracking branch 'up…

0dfc998

…stream/master' into ba-thesis

Merge branch 'bugfix/error-parsing-proc-meminfo', remote-tracking bra…

8135c26

…nch 'upstream/master' into ba-thesis

myrrc added 2 commits August 8, 2021 00:41

Removing advices to kernel as they slow down everything

ff8f002

Merge remote-tracking branch 'upstream/master' into ba-thesis

7545b7c

myrrc force-pushed the ba-thesis branch from 782f813 to 7545b7c Compare August 7, 2021 21:41

myrrc closed this Aug 10, 2021

alexey-milovidov mentioned this pull request Jan 15, 2022

sql mutation testing tool #33640

Closed

alexey-milovidov mentioned this pull request Oct 29, 2023

Granular code coverage with introspection #56102

Merged

alexey-milovidov assigned alexey-milovidov and unassigned kitaisreal Nov 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Code coverage runtime #20539

Code coverage runtime #20539

myrrc commented Feb 15, 2021 •

edited

This comment has been minimized.

myrrc commented Feb 16, 2021

This comment has been minimized.

myrrc commented Apr 7, 2021

This comment has been minimized.

myrrc commented Aug 5, 2021

myrrc commented Aug 6, 2021

myrrc commented Aug 10, 2021

alexey-milovidov commented Nov 21, 2023

Code coverage runtime #20539

Code coverage runtime #20539

Conversation

myrrc commented Feb 15, 2021 • edited

This comment has been minimized.

myrrc commented Feb 16, 2021

This comment has been minimized.

myrrc commented Apr 7, 2021

This comment has been minimized.

myrrc commented Aug 5, 2021

myrrc commented Aug 6, 2021

myrrc commented Aug 10, 2021

alexey-milovidov commented Nov 21, 2023

myrrc commented Feb 15, 2021 •

edited