Run only the tests which contains some mutation #50

gnieto · 2018-03-08T23:50:50Z

Attempt to fix issue: #28

I want to get a little bit of feedback before keep going with that.

As a first step, all the mutations are being instrumented and are
calling ::mutagen::report_coverage(). This method, if an environment
variables is set (MUTAGEN_COVERAGE), the first time it will be called,
it will write to a specific file that some mutation has been hit.

On the other side, cargo-mutagen will check all the tests that the
binary contains, and will execute all the tests individually, with the
mentioned env var. After executing each of the tests, we will check if
the file has been created. If it exists, we know that the executed test
contains code that it's mutated.

Finally, when running mutations, we will run only the tests that
contains mutations (unfortunatelly, we will need to run the test
one-by-one, which means that we will spawn a process per test, instead
of a process per testsuite). Then, coverage flag it's only useful when
the ratio of tests which executes mutated code is low.

To test with coverage, after updating to the new version of the plugin
on the target project, and after reinstalling the runner:

cargo mutagen -- --coverage

Pending tasks would be (maybe new issues can be opened):

Improve the "ping" system with something better than a file
Maybe we can include the mutation number to report_coverage method,
in order to know which are the mutations that took place on the given
test. If we do, we can filter more accuretly which tests should be
executed on each mutation
Maybe we can think about use clap to parse arguments, add subcommands,
help, ...
~~Document the cargo mutagen command~~

llogiq · 2018-03-09T00:04:41Z

plugin/src/lib.rs

@@ -187,6 +187,8 @@ impl<'a, 'cx> MutatorPlugin<'a, 'cx> {

                    mut_expression = quote_expr!(self.cx,
                                    {
+                                        ::mutagen::report_coverage();


Why don't we add $n as an argument? Or even $n, $n + x (for some value of x if we have multiple mutations)? Because this way, we can not only detect whether mutations have been active at all, but also which mutations are covered, allowing us to restrict the set of tests to run per mutation even further.

Added on: f9e543c

llogiq · 2018-03-09T00:12:10Z

Perhaps we should insert a static _mutation_$n: AtomicBool to restrict writing to the file to the first time around or something. clap is cool, but I'm personally a big fan of structopt. If the process per test is too costly, we may want to look into replacing the test runner with our own (sort of like stainless or whatever RFC #2318 will afford us).

llogiq

All in all this is great work!

llogiq · 2018-03-10T06:55:19Z

src/coverage.rs

+        return
+    }
+
+    match COVERAGE_ALREDY_CHECKED.lock() {


This is probably funny, but I find this to be a bit heavy. 😄

We could generate one static AtomicBool per mutation and use that.

Thanks for the feedback, adressed on: 648be7c

I've needed an extra env var to be able to create a vector which is long enough. If there's any alternative, let me know.

llogiq · 2018-03-11T07:07:40Z

There are two other options:

on the first_block start, store the current mutation count and a generated symbol for the function. Then at the end of first_block, get the difference of the current count and the stored count and prepend a statement to the block with a static AtomicBool array of current_count - initial_count size. Then you can use this array from within the block (index at current_count - initial_count)
Generate one symbol per mutation, prepend the static AtomicBools at the end of first_block

Both options get rid of the allocation and the end var.

gnieto · 2018-03-11T19:49:09Z

@llogiq If I understand, you suggest to add static variables on the scope of the mutated code (so, added via first_block; which, as far as I can see, it's called with the body of any function/method/... that it's mutated).

Then, you suggest to add static local variables on that scope. But, does it makes sense? If i'm not wrong, the generated code will be something like:

pub fn test() -> usize {
    static COVERAGE_CHECKED: AtomicBool = AtomicBool::new(false);
    <original_code>
}

But, if test function is called more than once, it will create and drop the value every time is called, as far as i can see. Said with other words: does it make sense the static annotation if it's not added on the global scope? I think I'm missing something :/

Thanks!

llogiq · 2018-03-11T20:19:13Z

statics are directly embedded in the binary and are not initialized, see an example.

llogiq · 2018-03-11T20:20:19Z

(so in our extended case, we'd have a static $gensymed_arr : [AtomicBool ; $number_of_mutations] = [ATOMIC_BOOL_INIT; $number_of_mutations];)

Attempt to fix issue: llogiq#28 As a first step, all the mutations are being instrumented and are calling `::mutagen::report_coverage()`. This method, if an environment variables is set (`MUTAGEN_COVERAGE`), the first time it will be called, it will write to a specific file that some mutation has been hit. On the other side, `cargo-mutagen` will check all the tests that the binary contains, and will execute all the tests individually, with the mentioned env var. After executing any of the tests, we will check if the file has been created. If it exists, we know that the executed test contains code that it's mutated. Finally, when running mutations, we will run only the tests that contains mutations (unfortunatelly, we will need to run the test one-by-one, which means that we will spawn a process per test, instead of a process per testsuite). Then, coverage flag it's only useful when the ratio of tests which executes mutated code is low. To test with coverage, after updating to the new version of the plugin on the target project, and after reinstalling the runner: ``` cargo mutagen -- --coverage ``` Pending tasks would be: - Improve the "ping" system with something better than a filename - Maybe we can include the mutation number to `report_coverage` method, in order to know which are the mutations that took place on the given test. If we do, we can filter more accuretly which tests should be executed on each mutation - Maybe we can think about use clap to parse arguments, add subcommands, ...

Instead of track if a test has hit any mutation, now we are tracking the exact mutations that has been hit. Now we can execute ONLY the tests that are affected by a given mutation identifier.

gnieto · 2018-03-11T22:26:38Z

Thanks again for the tip and the info of the local static vars!

I've been implementing it (you can check it here: 64faf7a), but I got the impression that code is getting more complicated and I'm not sure if the complexity is worth.
In fact, on the referenced commit, I'm getting errors when mutating code as quote_expr!(...) is evaluated before the new quote_stmts (the ones which define the new static variables) has been inserted on fold_first_block.

I was wondering if the code as it is now is enough or if it still needs refinement.

llogiq · 2018-03-12T07:05:33Z

I'm ok with merging now and refining later.

I'd just like to note that the reason I'm so worried about performance of this code is that it will be executed once for every call of the mutated method, and we don't know how often the tests will call it. So I don't worry much about the one-time cost (although I must confess that I've looked into using sockets instead of files), but the code that potentially gets run a lot should be fast.

To quell your errors, I think you could quote the static statement first (perhaps with a ! initializer), and insert the correct array initializer later.

llogiq · 2018-03-12T11:11:57Z

I just merged so we can continue other development without too much hassle – I will try to optimize things later.

llogiq reviewed Mar 9, 2018

View reviewed changes

gnieto force-pushed the coverage branch from e78870f to f9e543c Compare March 9, 2018 23:51

llogiq approved these changes Mar 10, 2018

View reviewed changes

gnieto added 4 commits March 11, 2018 23:23

Track mutations hit by each test

6b10640

Instead of track if a test has hit any mutation, now we are tracking the exact mutations that has been hit. Now we can execute ONLY the tests that are affected by a given mutation identifier.

Explain how to execute with coverage on README

6451291

Use a vec of AtomicBool instead of Mutex+HashMap

b4345ce

gnieto force-pushed the coverage branch from 648be7c to b4345ce Compare March 11, 2018 22:24

llogiq merged commit d3b000b into llogiq:master Mar 12, 2018

llogiq mentioned this pull request Mar 12, 2018

Coverage: Run only tests that actually execute any mutated method #28

Closed

gnieto deleted the coverage branch April 17, 2018 21:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run only the tests which contains some mutation #50

Run only the tests which contains some mutation #50

gnieto commented Mar 8, 2018 •

edited

llogiq Mar 9, 2018

gnieto Mar 9, 2018

llogiq commented Mar 9, 2018

llogiq left a comment

llogiq Mar 10, 2018

gnieto Mar 10, 2018

llogiq commented Mar 11, 2018

gnieto commented Mar 11, 2018

llogiq commented Mar 11, 2018

llogiq commented Mar 11, 2018

gnieto commented Mar 11, 2018

llogiq commented Mar 12, 2018 •

edited

llogiq commented Mar 12, 2018

Run only the tests which contains some mutation #50

Run only the tests which contains some mutation #50

Conversation

gnieto commented Mar 8, 2018 • edited

llogiq Mar 9, 2018

Choose a reason for hiding this comment

gnieto Mar 9, 2018

Choose a reason for hiding this comment

llogiq commented Mar 9, 2018

llogiq left a comment

Choose a reason for hiding this comment

llogiq Mar 10, 2018

Choose a reason for hiding this comment

gnieto Mar 10, 2018

Choose a reason for hiding this comment

llogiq commented Mar 11, 2018

gnieto commented Mar 11, 2018

llogiq commented Mar 11, 2018

llogiq commented Mar 11, 2018

gnieto commented Mar 11, 2018

llogiq commented Mar 12, 2018 • edited

llogiq commented Mar 12, 2018

gnieto commented Mar 8, 2018 •

edited

llogiq commented Mar 12, 2018 •

edited