proposal and preview for mutagen v0.2 #149

samuelpilz · 2019-07-16T14:04:21Z

As discussed in #142, this is a proposal for a modular architecture, which is the base for mutagen v0.2. I reviewed all parts of the code and tried to move code that works fine to the new system.

This is a minimal implementation to demonstrate the effectiveness of the architecture. The steps towards releasing mutagen v0.2 are:

implement more mutators and document their functionality
generate meaningful reports

Are there any other major tasks for this PR?

…e for mutators

Now, the code about a single mutator is in a single file

samuelpilz · 2019-07-20T21:18:24Z

I have done quite a few changes to the architecture since the first commit. I am quite happy with the current state.

I added a crate mutagen-core that contains most code. This crate is imported from mutagen-transform and mutagen. This makes re-use of code a lot easier and still meets the requirement of a standalone crate for the procedural macro.
The code to transform expressions is defined in the same file with their mutators.
The hack for exhaustive testing has beed replaced by the module mutagen-selftest, which provides the same functionality in a cleaner way.
I added an example-crate with well-known functions that help demonstrate the use of #[mutate] and the functionality of the runner. The runner can be called with cargo run --bin cargo-mutagen --package cargo-mutagen from the examples/simple directory

llogiq · 2019-07-22T12:32:11Z

README.md


-It also will only see the bare AST, no inferred types, no control flow or data flow, unless we analyse them ourselves. But not only that, we want to be *fast*.  This means we want to avoid doing one compile run per mutation, so we try to bake in all mutations into the code once and select them at runtime via a mutation count. This means we must avoid mutations that break the code so it no longer compiles.


Why are we losing this information? I feel it is a crucial piece to understand the design decisions for mutagen, which haven't really changed with your redesign.

This information is indeed important. I thought I kept this information somewhere. I will add this paragraph back into the documentation and the readme.

In the newest commits this section is back in the readme. The documents about the architecture and design decisions also contain paragraphs on this topic.

llogiq · 2019-07-22T12:33:26Z

README.md


-You can run `cargo mutagen -- --coverage` in order to reduce the time it takes to run the mutated code. When running on this mode, it runs the testsuite at the beginning of the process and checks which tests are hitting mutated code. Then, for each mutation, instead of running the whole testsuite again, it executes only the tests that are affected by the current mutation. This mode is specially useful when the testsuite is slow or when the mutated code affects a little part of it.
+*Use `mutagen` as `dev-dependency`, unless otherwise necessary.* Compiling `mutagen` is time-intensive and library-users should not have to download `mutagen` as a dependency.


Perhaps we should advise users to use sccache?

I have no experience with that. I wanted to say that if a library uses mutagen, users of that library should not have to compile mutagen since it is only used for internal tests.

For testing purposes, the compile-time is fine. I guess I have to rewrite this section to avoid confusion about this topic.

Updated in the newest commits. I think the new version is the best reason for using mutagen as a dev-dependency: "ensures nothing will land in released code"

mutagen-core/src/mutator/mutator_binop_add.rs

docs/mutators/lit_int.md

samuelpilz · 2019-07-27T17:31:09Z

In the newest commit, I implemented another mutator that mutates comparison operators or PartialOrd into each other. I have some of other mutators planned, but I want to look at the current structure a bit and try to find rough edges in the design before implementing more mutators.

At this stage, it is still possible to change the architecture of everything. Do you have any requests / suggestions?

.travis.yml

mutagen-core/src/lib.rs

llogiq · 2019-08-12T15:22:45Z

mutagen-core/src/optimistic/add_to_sub.rs

+    type Output = <L as Add<R>>::Output;
+
+    default fn may_sub(self, _r: R) -> <L as Add<R>>::Output {
+        panic!("not sub");


Why do we panic here?

As far as I understand:

If the optimistic assumption fails, the mutant should not be considered survived.

To be able to kill the mutant, the test suite has to fail.

To fail the test suite, the code must have some different behavior when activating the mutant.

I believe panicking is the only/easiest way to cause behavior that is different from the non-mutated case. However, I am open to other options.

In the future, I would like to read the output of the test-binary and detect if a certain panic-message has been printed. Based on that, the mutant could be considered "failed" instead of "killed" or "survived".

Ok, that makes sense. I had envisioned a backchannel (an mmap, socket or simple file) to inform the test runner of the ineffective mutation on the no-mutation-run, so the runner could mark the mutation as ineffective. However, there is actually a tri-state at play here, because such a mutation could be within a generic function that monomorphizes to any number of ineffective mutations. Thus, we should be able to both mark an optimistic mutation as (potentially) ineffective and as effective (at least we should do so if it was marked as ineffective before and succeeds nonetheless).

I should add that in-band messaging is inherently problematic and I would oppose misusing panics for that purpose. What if a user has a custom panic message that is inadvertently parsed as a killed mutant?

I see that using panics might not be the cleanest solution. I will work on some backchannel for coverage-analysis. The information about optimistic assumptions can be included there.

However, I think that not necessarily part of a first working release, which I would like to finish in early September.

I did not think about optimistic mutantions in generic functions. I will investigate some issues here.

I'm fine with that. Currently we just write a file on the nonmutated run, but there are likely faster and leaner solutions.

llogiq · 2019-08-12T15:24:08Z

Sorry for letting you wait for so long. I was on vacation. This looks pretty good so far, I only commented on a few questions I had.

I think this should be ready to merge once we reach feature parity with the previous version. Or would you like this to be merged sooner?

generated mutants

samuelpilz · 2019-09-15T20:22:30Z

Your suggestion is very promising. However, the that approach duplicates the input expressions, which is not allowed when the expressions move something out of a variable. I've not been able to use this idea to get a working transformation in all cases.

samuelpilz · 2019-09-22T12:10:06Z

I am quite happy with the current layout of the report and consider this as done for now. For the time being, I am also happy with the mutators implemented so far. I implemented mutators for numeric operations +, -, * and / as well as the bit-operations &, |, ^.

After that, I would like to publish the current state as mutagen-0.2. I added you to the authors in the Cargo.tomls, which was not the case during my mutagen-preview proposal. Is this ok? If necessary, I would write a post about the release. I will also improve my repository mutagen-applied that tries to show how mutagen could be applied to existing libraries.

llogiq · 2019-09-22T15:50:29Z

I'm currently very short on time, but will review soon. I'd also like to see some docs regarding open tasks for the future.

samuelpilz · 2019-10-05T14:13:28Z

I wrote a short list of in beyond mutagen-0.2. These tasks could give a roadmap for the mid-term future of mutagen.

llogiq · 2019-10-19T10:54:03Z

👍

samuelpilz · 2019-10-31T10:09:45Z

During my work for mutagen-applied, I found some more issues when applying mutagen to the crate bytes:

There is a line std::process::abort(); and its output type of ! is required.

The previos version rewrote the statement into if [...] {std::process::abort();}, which has type ().

I introduced a fix by generating the following code: if [...] {std::process::abort();} else {::[...]::stmt_call_to_none()}, which uses similar tricks to other optimistic mutators.

This also fixes the issues discussed above for f(return 1)-like cases. Further, I labelled the mutator for deleting such statements as optimistic.

samuelpilz · 2019-10-31T23:49:55Z

I just realized a major issue. Currently, we suggest adding #[cfg_attr(test, mutate)] to the code under test. Then, we run all test suites found by cargo test. However, the test flag is enabled for the unit tests directly in the lib only. It is not enabled for other tests (e.g. files from tests folder).

As I understand it: It is impossible for the macro mutate to be applied in integration tests while having mutagen in the dev-dependencies section

I see several ways to resolve this. I am also happy to discuss other approaches.

only consider unit-tests as executed by cargo test --lib for mutation coverage and stick with the current suggestions.
suggest that mutagen is feature gated. This implies a few things:
- the crate has to introduce a new feature (e.g. mutationtest, suggested name can be discussed but cannot be named mutagen)
- the dependency mutagen must be a real dependency, not just a dev-dependency and should be optional
- the feature has to imply the dependency mutagen
- the mutate attribute has to be guarded with #[cfg_attr(feature="mutationtest",mutate)]
- the runner must enable the feature for the tests, hence it must know the name of the feature either by convention or configuration
- It is no longer guaranteed by cargo that no mutated code is shipped with production

short example for a possible suggested Cargo.toml configuration

[features]
mutationtest = ["mutagen"]
[dependencies]
mutagen = {optional = true}

I am not sure where to go with this and which direction is the least dangerous or most useful.

llogiq · 2019-11-01T21:28:20Z

I would take the current approach and lobby the rust devs to set the test flag also when running integration tests.

In other news, David Tolnay has documented a way to hack together specialization on stable. I'm currently trying to introduce this to overflower, and it would fit well with mutagen, too.

samuelpilz · 2019-11-02T10:47:00Z

I am fine with the current approach. I am currently refactoring the runner to respect coverage measures of each testsuite individually.

However, I believe that the choice to disable the test flag and dev-dependencies inside the integration tests is based on the fact that they should test the crate "as any outside crate would". If possible, I will not do the lobbing necessary, but am happy with you or anyone else doing it.

I will try David Tolnay's method. However, we still require proc_macro_span feature for getting the filenames and locations for mutations. The tracking issue rust-lang/rust#54725 has made little progress. If we need to be on nightly anyway, I would like to stick to the "official but unstable" approach instead. If it is possible to provide the desired functionality without any unstable features, I will invest the effort to cut them all.

samuelpilz · 2019-11-05T07:31:45Z

I uncovered another issue. (see Post in users.rust-lang). This means that type inference rules can cause compilation errors in code that is not transformed by the mutagen. How should we deal with this?

samuelpilz · 2019-11-17T14:52:04Z

I did some improvements on the printed report and the running mechanism. My tests on mutagen-applied have been successful. Integrating mutagen into an existing project is not completely automatic and requires minimal amount of manual intervention in some cases. However, this effort is far smaller than the analysis of the mutationtest report, which is obviously a manual process.

I would like to merge and publish my proposed version of mutagen as v0.2. Is this possible within November?

llogiq · 2020-01-10T13:59:53Z

I finally found the time to review this. Good job and thank you!

samuelpilz added 7 commits July 16, 2019 15:47

proposal and preview for mutagen v0.2 using a new modular architectur…

d798211

…e for mutators

fix travis script

f15599e

move mutators to core-crate and integration-tests into own crate

295d7bb

add more example-functions

d99076b

add mutator binop_eq for == patterns

9bd3f97

move transformers into core-crate

9afef82

merge files of mutators and transformers

098f3ed

Now, the code about a single mutator is in a single file

llogiq reviewed Jul 22, 2019

View reviewed changes

mutagen-core/src/mutator/mutator_binop_add.rs Outdated Show resolved Hide resolved

add multiple mutations for lit_int mutator

8efd909

llogiq reviewed Jul 23, 2019

View reviewed changes

docs/mutators/lit_int.md Outdated Show resolved Hide resolved

samuelpilz added 4 commits July 25, 2019 13:22

document the implications of the use of macros for mutation testing

f5cbf31

clarify explanation for using mutagen as a dev-dependency

87a4713

fix error message when mutations-file is missing

1687d22

implement mutator binop_cmp and improve tests

379f8a0

refactor transformer code

e68d7f6

samuelpilz commented Aug 8, 2019

View reviewed changes

.travis.yml Outdated Show resolved Hide resolved

add mutation != to ==

0179c1d

llogiq reviewed Aug 12, 2019

View reviewed changes

mutagen-core/src/lib.rs Outdated Show resolved Hide resolved

llogiq reviewed Aug 12, 2019

View reviewed changes

mutagen-core/src/lib.rs Outdated Show resolved Hide resolved

llogiq reviewed Aug 12, 2019

View reviewed changes

samuelpilz added 5 commits August 13, 2019 12:19

refactor argument parsing and include a check for the number of

4f18da7

generated mutants

remove unneeded features

556fba5

fix travis file

f239825

install rustfmt in traivs-ci

c212583

implement mutator binop_bool

a70ab8d

samuelpilz added 2 commits September 21, 2019 22:11

implement mutator binop_add

05e5d14

implement mutator binop_bit

31ca55c

write about future plans

bf66d62

samuelpilz added 3 commits October 18, 2019 15:56

treat missing coverage file as empty

986e4ae

do not mutate static items

c584965

document that static expressions are not mutated

ac3dfbe

samuelpilz added 2 commits October 21, 2019 12:25

avoid move in mutator binop_cmp

55f48a3

fix issues in stmt_call mutator and add documentation

3c5743d

samuelpilz added 4 commits October 31, 2019 16:08

add central function for reporting failed optimistic mutators

80c979a

run rustfmt

5c72b07

improve num-expr detection

ec2ae8d

add example with integration tests

7583c79

try alternative mutagen-setup for example-project

abc51b7

record coverage per testsuite and improve documentation

32cd22c

improve mutation report

0010720

Simplify command suggestion in "test structure" document

963f2e6

llogiq merged commit c151d76 into llogiq:master Jan 10, 2020

samuelpilz mentioned this pull request Jan 13, 2020

Mutagen architecure proposal #142

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

proposal and preview for mutagen v0.2 #149

proposal and preview for mutagen v0.2 #149

samuelpilz commented Jul 16, 2019 •

edited

samuelpilz commented Jul 20, 2019

llogiq Jul 22, 2019

samuelpilz Jul 22, 2019

samuelpilz Jul 25, 2019

llogiq Jul 22, 2019

samuelpilz Jul 22, 2019

samuelpilz Jul 25, 2019 •

edited

samuelpilz commented Jul 27, 2019

llogiq Aug 12, 2019

samuelpilz Aug 12, 2019

llogiq Aug 12, 2019

llogiq Aug 12, 2019

samuelpilz Aug 13, 2019

llogiq Aug 13, 2019

llogiq commented Aug 12, 2019

samuelpilz commented Sep 15, 2019

samuelpilz commented Sep 22, 2019

llogiq commented Sep 22, 2019

samuelpilz commented Oct 5, 2019

llogiq commented Oct 19, 2019

samuelpilz commented Oct 31, 2019

samuelpilz commented Oct 31, 2019

llogiq commented Nov 1, 2019

samuelpilz commented Nov 2, 2019

samuelpilz commented Nov 5, 2019

samuelpilz commented Nov 17, 2019

llogiq commented Jan 10, 2020


		It also will only see the bare AST, no inferred types, no control flow or data flow, unless we analyse them ourselves. But not only that, we want to be fast. This means we want to avoid doing one compile run per mutation, so we try to bake in all mutations into the code once and select them at runtime via a mutation count. This means we must avoid mutations that break the code so it no longer compiles.


		You can run `cargo mutagen -- --coverage` in order to reduce the time it takes to run the mutated code. When running on this mode, it runs the testsuite at the beginning of the process and checks which tests are hitting mutated code. Then, for each mutation, instead of running the whole testsuite again, it executes only the tests that are affected by the current mutation. This mode is specially useful when the testsuite is slow or when the mutated code affects a little part of it.
		Use `mutagen` as `dev-dependency`, unless otherwise necessary. Compiling `mutagen` is time-intensive and library-users should not have to download `mutagen` as a dependency.

proposal and preview for mutagen v0.2 #149

proposal and preview for mutagen v0.2 #149

Conversation

samuelpilz commented Jul 16, 2019 • edited

samuelpilz commented Jul 20, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

samuelpilz Jul 25, 2019 • edited

Choose a reason for hiding this comment

samuelpilz commented Jul 27, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

llogiq commented Aug 12, 2019

samuelpilz commented Sep 15, 2019

samuelpilz commented Sep 22, 2019

llogiq commented Sep 22, 2019

samuelpilz commented Oct 5, 2019

llogiq commented Oct 19, 2019

samuelpilz commented Oct 31, 2019

samuelpilz commented Oct 31, 2019

llogiq commented Nov 1, 2019

samuelpilz commented Nov 2, 2019

samuelpilz commented Nov 5, 2019

samuelpilz commented Nov 17, 2019

llogiq commented Jan 10, 2020

samuelpilz commented Jul 16, 2019 •

edited

samuelpilz Jul 25, 2019 •

edited