Implement Property Based Testing/ Data Generators #850

philsquared · 2017-03-10T17:13:08Z

This has been a long-standing promise/ point of discussion and a number of issues have been raised that amount to this, so I thought I'd write up the definite feature request to direct them all to.

There are two strands to this:

Data generators, or parametrised tests: i.e. you want to re-use the same core set of assertions with a range (possible a wide range) or different inputs - including the cross product of multiple possible inputs. In some cases you want to specifically control the inputs, in others generate it as an increasing range, or randomly. There's a variation where different types are used to parameterise. While strictly speaking that's a different feature I suspect it's all going to get tied up in them same solution, so I'll include that here too. We can always unbundle it later, if necessary.
Building on (1) is the idea of Property-Based Testing. This is a more formally structured approach to working with generated ranges of inputs and also includes features such as shrinking (where failing inputs are automatically reduced to the simplest/ smallest possible failure). The tests themselves are for properties which are usually invariants that should always hold - although sometimes an alternate (e.g. simple or reference) implementation is compared against.

Support for generators was experimentally added in the very early days of Catch, but were never progressed. In fact quite a lot of work was done in support of them, and a few alternate implementations started. The main obstacle was interaction with the SECTION tracking infrastructure. That code was reworked with generators in mind a couple of years ago, and a new proof-of-concept generators implementation was successfully written against it. However by that time the goal was full property-based testing and the path there seemed tortuous in a C++03 constrained context, so the effort was deferred to such a time as we could at least rebase on C++11 (and ideally would leverage C++14 and beyond to allow for things like Ranges v3 to be supported in the generators part). C++11 rebasing was decided for Catch2 - with generators/ PBT being one of the main motivators. At time of this writing a proof-of-concept implementation of Catch2 exists which consists of a mostly rewritten core. That work has paused while the "Catch Classic" backlog is tamed but will resume again soon - with generators and PBT being one of the first big features to be worked on next.

I'll keep this issue open until the feature is ready so others can be closed in favour of it.

The text was updated successfully, but these errors were encountered:

capsocrates · 2017-07-20T21:45:39Z

You asked me this in #558

I believe I have answered this query now (even if not entirely satisfactorily) - and objection to me closing it? (@capsocrates, if you're still watching?)

If you meant that Catch2 is an answer to the query, I suppose that's satisfactory enough. :)

Quincunx271 · 2017-08-17T07:52:50Z

I found myself needing this, so I made a workaround:

template <typename F, typename... Args>
void parameterized_test(F fn, Args&&... args)
{
    // this for_each should do the equivalent of Boost Hana's hana::for_each
    // Alternatively, see implementation after
    detail::for_each(std::forward_as_tuple(std::forward<Args>(args)...), fn);
}

And it is used like so:

parameterized_test(
    [](std::string parameter) {
        // rest of test code here - haven't tried using SECTION, though
    }, "arguments", "that", "are", "passed"
);

This is working because the assertion macros work even inside a function. The error message on where a macro failed is also useful because it shows the file/line of where the lambda is defined, rather than in my utility header. I also know that CAPTURE works.

This might be useful in figuring out how to do parameterized tests.

One drawback is that this doesn't allow multiple parameters, but that can be implemented through some heavier TMP, or worked around with tuples and C++17's bind expressions / std::tie

An implementation of for_each:

    template <typename Tuple, typename F, std::size_t... Is>
    void for_each_impl(Tuple&& tuple, F&& fn, std::index_sequence<Is...>)
    {
        int unused[] = {
            (std::invoke(std::forward<F>(fn), std::get<Is>(std::forward<Tuple>(tuple))), 0)...};
        (void) unused;
    }

    template <typename Tuple, typename F>
    void for_each(Tuple&& tuple, F&& fn)
    {
        for_each_impl(std::forward<Tuple>(tuple), std::forward<F>(fn),
                                    std::make_index_sequence<std::tuple_size_v<Tuple>>{});
    }

Trass3r · 2017-08-19T14:05:10Z

Looking forward to this! Especially now that Catch2 finally surfaces.

ArekPiekarz · 2017-10-03T18:30:09Z

@Quincunx271 Your version doesn't work with SECTIONs, because each section is executed only once.
So if you nest parameterized_test with multiple GIVEN, WHEN, THEN, some of them will be invoked for only the first parameter.

@philsquared Do you have any fuzzy estimation when this feature may be available in Catch2? Is it in a ballpark of a few months or closer to a year?

mlimber · 2017-10-23T15:28:13Z

If you tell me how to do this with the catch2 branch, I will try it out on my project and contribute fixes and documentation if possible.

horenmar · 2017-10-26T12:50:55Z

@ArekPiekarz So far most of our estimates were wildly wrong, so I am going to give "when it is done". Sorry.

@mlimber You can't yet.

ThirtySomething · 2018-02-10T11:13:22Z

Added myself to #850 after closing #1177

johnthagen · 2018-03-04T22:34:02Z

This is really the one remaining feature I'd love to see in Catch, and I think it would round out the framework really well in terms of the Don't Repeat Yourself (DRY) philosophy.

One framework that does parametrizing well is pytest. While Python is a much different beast than C++11 (dynamically typed, monkey patching, etc), the overall API style that they use for parameterization could be helpful to look at.

An example:

import pytest

@pytest.mark.parametrize("test_input,expected", [
    ("3+5", 8),
    ("2+4", 6),
    ("6*9", 42),
])
def test_eval(test_input, expected):
    assert eval(test_input) == expected

Properties of this API I like:

Minimal repetition when adding extra input data
Clearly associating the input data with a test case (in this case through a @property)
The input data can be named in a meaningful way

Leandros · 2018-03-08T11:49:18Z

Any progress here? Might also want to take a look at how GoogleTest does it: https://github.com/google/googletest/blob/master/googletest/docs/AdvancedGuide.md#how-to-write-value-parameterized-tests

greenrobot · 2018-06-10T20:00:35Z

There are so many approaches to this. Not sure if this was discussed yet: Like SECTION, Catch2 could offer something like INIT_SECTION, which would add another section dimension for init (setting up the test). Thus, each INIT_SECTION would run all SECTIONs.

philsquared · 2018-06-21T08:22:44Z

I know it's been a while, but I've started looking at this again.
I've got an initial implementation up and running on a local branch - but I wanted to bikeshed the syntax a bit before going further.

Here's one of my test cases:

TEST_CASE("Generators") {

    auto i = GENERATE( values( { "a", "b", "c" } ) );

    SECTION( "one" ) {
        auto j = GENERATE( range( 8, 11 ) << 2 );
        std::cout << "one: " << i << ", " << j << std::endl;
    }
    SECTION( "two" ) {
        auto j = GENERATE( 3.141 << 1.379 );
        std::cout << "two: " << i << ", " << j << std::endl;
    }
}

particular things to note:

Use of << as a "sequencing operator. It means: after generating the values on the left, continue by generating the values on the right
The syntax for the values generator. It takes an initialiser list - which means you get the double-braced syntax.
Sequencing individual values effectively adds a single value generator. Is this too magic? Or a big convenience for a common case?
Would improving (1) make (3) a complete substitute for (2)?
GENERATE acts similarly to SECTION in that the whole test case is re-entered for each value generated (and the cross-product of all other GENERATEs and SECTIONs) - in fact they run on the same underlying mechanism.
values and range are instances of generators which can be composed (in this case by sequencing with <<)
All generators (including composite, sequenced, generators) allow random ordering and random sub-ranges.
Additional generators can be user supplied (although this may initially be prohibited to allow more implementation flexibility until it beds in).

ArekPiekarz · 2018-06-21T18:27:39Z

@philsquared Could you make it clear what is the output of your proposed solution?

philsquared · 2018-06-22T13:15:27Z

Excellent point, @ArekPiekarz - that would help, wouldn't it - thanks :-)

one: a, 8
one: a, 9
one: a, 10
one: a, 11
one: a, 2
two: a, 3.141
two: a, 1.379
one: b, 8
one: b, 9
one: b, 10
one: b, 11
one: b, 2
two: b, 3.141
two: b, 1.379
one: c, 8
one: c, 9
one: c, 10
one: c, 11
one: c, 2
two: c, 3.141
two: c, 1.379

johnthagen · 2018-06-22T13:53:26Z

@philsquared

Use of << as a "sequencing operator

If the generated values have overloads for the ostream operator (for example, user defined types), does this change the behavior?

philsquared · 2018-06-22T14:02:32Z

No, the << overloads are on a Generator type:

template<typename T>
auto operator << ( Generator<T>&& g1, T const& val ) -> Generator<T> {
    return { std::move(g1), value( val ) };
}

philsquared · 2018-06-26T14:19:23Z

@johnthagen no - no change to the table examples

philsquared · 2018-06-26T14:20:29Z

Should work with any types that are copy/ movable.
You might need to specialise the range generator if your type doesn't support +

ekrieger001 · 2018-06-26T14:22:03Z

how would the report look like for such testcases? What would be the testcasenames?

philsquared · 2018-06-26T14:30:38Z

@ekrieger001 currently there is no change to the test names - but this relates to the open question, above, about capturing variable names.

I'm thinking that generated variable values should appear like section names. If we capture variable names. too, we could report them as "<name> = <value>". Otherwise we could do just "<value>", or something like, "For generated value(s): <first>, <second> ..." - where <first> etc would be the generated values.

One complication is that you might want to use the variable in a section name (as in the Cucumber example, earlier). In that case it would be a shame to have the value reported twice. Not sure there's an easy way around that.

myrgy · 2018-08-17T11:21:21Z

Hi guys,
would it be possible to use generators to test multiple implementations. which has no virtual interface - so I had to specify types.

like

TEST_CASE_T("Templated", std::vector<int>, std::queue<char>) {
  SECTION("push_back") {
    T s;
   s.push_back(1);
   }  
}

Thanks!

The support is to be considered experimental, that is, the interfaces, the first party generators and helper functions can change or be removed at any point in time. Related to #850

Quincunx271 · 2018-10-29T04:54:26Z

There's currently no built-in generator that just takes something like a std::vector<T>. That would be helpful, since currently the only easy way to reuse the values from one GENERATE(...) in another is to extract out a std::initializer_list<T>. Doing so, I've been running into lifetime bugs on clang-6.0 or less.

I'm imagining something like this: GENERATE(each(container.begin(), container.end()))

atomgalaxy · 2019-09-25T13:02:37Z

Phil, are there plans for the "" bits mentioned above?

horenmar · 2019-09-30T15:35:23Z

@Quincunx271 That's a good idea that sadly got lost -- I'll give it a try later.

@atomgalaxy Not sure which bits you mean.

atomgalaxy · 2019-09-30T16:03:39Z

@horenmar Sorry, github seems to have eaten the markup.

I mean the printing the current value idea that @philsquared details above:

@ekrieger001 currently there is no change to the test names - but this relates to the open question, above, about capturing variable names.

I'm thinking that generated variable values should appear like section names. If we capture variable names. too, we could report them as "<name> = <value>". Otherwise we could do just "<value>", or something like, "For generated value(s): <first>, <second> ..." - where <first> etc would be the generated values.

One complication is that you might want to use the variable in a section name (as in the Cucumber example, earlier). In that case it would be a shame to have the value reported twice. Not sure there's an easy way around that.

horenmar · 2019-10-02T17:23:26Z

There have been no changes to how generators are handled in regards to stringification, so they are still anonymous until the user stringifies them manually (e.g. via CAPTURE).

There are some examples on issue catchorg#850 of using this feature, but they are not easily found from the documentation. Adding them here as an example makes them more findable and ensures they keep working if the API changes.

There are some examples on issue #850 of using this feature, but they are not easily found from the documentation. Adding them here as an example makes them more findable and ensures they keep working if the API changes.

matthew-limbinar · 2021-04-08T13:24:05Z

@horenmar: Can you identify what is left on this feature request? Seems like it's potentially closable.

horenmar · 2021-04-11T12:57:43Z

@matthew-limbinar Everything that makes it actually property testing. There is data generation, but no case reduction, and no property testing loop support.

tsondergaard · 2022-11-15T22:34:37Z

@horenmar: Can you identify what is left on this feature request? Seems like it's potentially closable.

You cannot address a particular combination that you would like to run. Lets say that I break a particular case in a large set of generated combinations.

The reporter doesn't report the virtual section the test failure - if it did it would be easier to identify the failure
I cannot specify on the command line the specific test I would like to run. This can be a big disadvantage if running the test across all parameters take much longer than running just the single case that is failing

philsquared added Feature Request Catch2 labels Mar 10, 2017

This was referenced Mar 10, 2017

Run test case N times with N different inputs #558

Closed

Parametrized tests #423

Closed

Multiple-inputs test case macro #358

Closed

Support for templated tests? #46

Closed

ruipacheco mentioned this issue Mar 10, 2017

Environment setup()/teardown() #851

Closed

philsquared added this to the Catch2 milestone Mar 10, 2017

philsquared removed the Catch2 label Mar 10, 2017

This was referenced Aug 11, 2017

Documentation for GENERATE macro and generators? #988

Closed

Callable test case? #950

Closed

horenmar mentioned this issue Feb 8, 2018

Proposal: Dataproviders #1177

Closed

johnthagen mentioned this issue Mar 4, 2018

How to make parameterized tests? #1108

Closed

traversaro mentioned this issue Mar 6, 2018

Using Catch2 instead of our custom unit test system robotology/yarp#1575

Closed

philsquared self-assigned this Jun 21, 2018

JoeyGrajciar mentioned this issue Oct 28, 2018

Type parametrised test cases #1425

Closed

JoeyGrajciar mentioned this issue Nov 8, 2018

Type parametrised test cases v2 #1437

Merged

alibabashack mentioned this issue Mar 11, 2019

Add support for capturing lambdas in GENERATE() #1566

Closed

richardash1981 mentioned this issue Jun 19, 2020

Add an example of using GENERATE(table()) #1963

Merged

horenmar modified the milestones: Catch2, Catch2 + Feb 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Property Based Testing/ Data Generators #850

Implement Property Based Testing/ Data Generators #850

philsquared commented Mar 10, 2017

capsocrates commented Jul 20, 2017

Quincunx271 commented Aug 17, 2017 •

edited

Trass3r commented Aug 19, 2017

ArekPiekarz commented Oct 3, 2017

mlimber commented Oct 23, 2017

horenmar commented Oct 26, 2017

ThirtySomething commented Feb 10, 2018

johnthagen commented Mar 4, 2018 •

edited

Leandros commented Mar 8, 2018 •

edited

greenrobot commented Jun 10, 2018

philsquared commented Jun 21, 2018

ArekPiekarz commented Jun 21, 2018

philsquared commented Jun 22, 2018

johnthagen commented Jun 22, 2018

philsquared commented Jun 22, 2018

philsquared commented Jun 26, 2018

philsquared commented Jun 26, 2018

ekrieger001 commented Jun 26, 2018

philsquared commented Jun 26, 2018 •

edited

myrgy commented Aug 17, 2018

Quincunx271 commented Oct 29, 2018

atomgalaxy commented Sep 25, 2019

horenmar commented Sep 30, 2019

atomgalaxy commented Sep 30, 2019

horenmar commented Oct 2, 2019

matthew-limbinar commented Apr 8, 2021

horenmar commented Apr 11, 2021

tsondergaard commented Nov 15, 2022

Implement Property Based Testing/ Data Generators #850

Implement Property Based Testing/ Data Generators #850

Comments

philsquared commented Mar 10, 2017

capsocrates commented Jul 20, 2017

Quincunx271 commented Aug 17, 2017 • edited

Trass3r commented Aug 19, 2017

ArekPiekarz commented Oct 3, 2017

mlimber commented Oct 23, 2017

horenmar commented Oct 26, 2017

ThirtySomething commented Feb 10, 2018

johnthagen commented Mar 4, 2018 • edited

Leandros commented Mar 8, 2018 • edited

greenrobot commented Jun 10, 2018

philsquared commented Jun 21, 2018

ArekPiekarz commented Jun 21, 2018

philsquared commented Jun 22, 2018

johnthagen commented Jun 22, 2018

philsquared commented Jun 22, 2018

philsquared commented Jun 26, 2018

philsquared commented Jun 26, 2018

ekrieger001 commented Jun 26, 2018

philsquared commented Jun 26, 2018 • edited

myrgy commented Aug 17, 2018

Quincunx271 commented Oct 29, 2018

atomgalaxy commented Sep 25, 2019

horenmar commented Sep 30, 2019

atomgalaxy commented Sep 30, 2019

horenmar commented Oct 2, 2019

matthew-limbinar commented Apr 8, 2021

horenmar commented Apr 11, 2021

tsondergaard commented Nov 15, 2022

Quincunx271 commented Aug 17, 2017 •

edited

johnthagen commented Mar 4, 2018 •

edited

Leandros commented Mar 8, 2018 •

edited

philsquared commented Jun 26, 2018 •

edited