Extend PropEr with parallel execution #263

pablocostass · 2021-03-25T19:12:25Z

This PR extends PropEr with parallel execution. It is based on the work I did for my undergraduate thesis and has some limitations at the moment (e.g., targeted properties). Nonetheless, it should pretty much work (famous last words) and one can find the relevant options to test with parallel PropEr in the documentation of this branch, so please do take a look at it too.

The PR has two extra commits that I considered relevant but might be dropped upon request, those are:

Add verbosity to make test output (df7a7d8)
Adds verbosity when running the tests, as that way it is easier to check at what point parallel PropEr failed in the suite.
Add a workflow to test with parallel PropEr (4c81af9)
This commit adds a new workflow to be ran with Github Actions that uses 2 workers to run the test suite and ignores failures in the workflow; still, the warnings/failures are recorded by GA, so they will not be lost. This way, no PR would be blocked by some flakyness coming from parallel PropEr.

In short, this PR brings four new options to the table, all related to this new kind of execution:

{numworkers, <Non_negative_number>} sets the number of workers to use during testing.
pure (side effect free) or impure (with side effects) tells PropEr the kind of property it is going to test; impure properties start nodes to isolate the workers from the others and avoid possible test clashes.
{strategy_fun, <Strategy_function>} overrides the default function that divides the workload among the total of workers.
{stop_nodes, true | false} tells PropEr whether it should stop the nodes (if any was started) after finishing testing. This one is mainly used for proper:module/1,2, but it will probably be useful for some (e.g., build tools).

codecov-io · 2021-03-25T19:16:52Z

Codecov Report

Merging #263 (6bf55fe) into master (275c218) will increase coverage by 0.30%.
The diff coverage is 94.02%.

@@            Coverage Diff             @@
##           master     #263      +/-   ##
==========================================
+ Coverage   88.65%   88.95%   +0.30%     
==========================================
  Files          14       14              
  Lines        4408     4601     +193     
==========================================
+ Hits         3908     4093     +185     
- Misses        500      508       +8

Impacted Files	Coverage Δ
src/proper.erl	`88.92% <94.02%> (+1.58%)`	⬆️
src/proper_statem.erl	`94.67% <0.00%> (-0.39%)`	⬇️
src/proper_typeserver.erl	`79.90% <0.00%> (+0.11%)`	⬆️
src/proper_types.erl	`95.09% <0.00%> (+0.65%)`	⬆️
src/proper_arith.erl	`92.70% <0.00%> (+1.04%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 275c218...6bf55fe. Read the comment docs.

Before GitHub actions, we separated some lengthy (and flaky) examples tests out of `proper_tests` and placed them in a separate test target in the `Makefile`. For increased coverage, some of the examples were still tested in `proper_tests` as whole modules. Both for consistency and in preparation for using the examples as tests for #263, we now have a _single_ place (GitHub action) where the complete set of examples is tested.

Makefile

src/proper.erl

pablocostass · 2021-04-11T13:12:31Z

@kostis I had forgotten to add the new make target to the Github Action workflow, so I did that after rebasing on top of the latest changes on the main branch.

I see that the Github Action is still failing due to the cache, however this time it is because the current cache is from the main branch and in said branch we do not build the PLT file with tools too, so I guess that in 7 days from now on we will have to trigger another run :/

kostis · 2021-04-12T10:19:57Z

I've pushed a small change to the Makefile that subsumes the addition of tools to the PLT -- so please rebase.

In order to achieve some progress, perhaps it's a good idea to temporarily disable the -Wunknown from the dialyzer call so that testing can continue with the tests.

pablocostass · 2021-04-13T08:35:48Z

Yup, that commit of yours helped @kostis, thanks! I rebased and pushed with the addition of a commit that disables the unknown warning, as you suggested. We can drop it in the future whenever we want :)

codecov-commenter · 2021-04-25T16:32:56Z

Codecov Report

Merging #263 (a2497b2) into master (e3a12c4) will decrease coverage by 3.30%.
The diff coverage is 13.22%.

@@            Coverage Diff             @@
##           master     #263      +/-   ##
==========================================
- Coverage   88.47%   85.17%   -3.31%     
==========================================
  Files          14       14              
  Lines        4408     4586     +178     
==========================================
+ Hits         3900     3906       +6     
- Misses        508      680     +172

Impacted Files	Coverage Δ
src/proper.erl	`70.38% <13.22%> (-17.27%)`	⬇️
src/proper_statem.erl	`92.77% <0.00%> (-1.91%)`	⬇️
src/proper_gen.erl	`86.69% <0.00%> (-0.50%)`	⬇️
src/proper_typeserver.erl	`78.76% <0.00%> (-0.46%)`	⬇️
src/proper_erlang_abstract_code.erl	`94.24% <0.00%> (+0.25%)`	⬆️
src/proper_gen_next.erl	`77.41% <0.00%> (+0.26%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e3a12c4...a2497b2. Read the comment docs.

Modify the `user_opts` type and `opts` record to allow the possibility of spawning multiple processes to perform the tests Update `inner_test` function: each process is assigned a cerain number of tests to run from the total count. The main process aggregates the output received in case of success, otherwise returns early with the error or fail Some type specs were updated to reflect the changes needed to use the `spawn_link_migrate` function

This way, when using workers to distribute tests among processes, PropEr will first create a node for them to be spawned on, avoiding crashing the whole VM if something goes wrong.

…ach instead of lists:map

…sions When testing stateful programs, the node where the tests were being performed would crash because normally stateful programs use gen_servers or unique elements, which made processes modify each other's states. To avoid that now each process is spawned on a different node, which makes using many processes a bit more expensive.

As previously stated, for stateful properties each worker would need a node of his own to avoid clashing with the other workers, whereas on stateless properties all the workers could be performing tests on the same node. This commit fixes last one by starting only one node if it detects a stateless property is being tested, or a new node per worker in the case of a stateful property.

As `lists:zip/2` expects two lists of the same length, it would crash on stateful properties when `numtests` was less than `num_processes`. In that case, we only need to spawn `numtests` processes (since otherwise we would end up having idle workers).

Also did some refactor, mainly changin from *processes* to *workers* (and so `num_processes` to `num_workers`, etc), and added some documentation.

Also enforce a more strict pattern matching when trying to determine if a property is stateless or stateful.

Instead of adding a workflow to run the test suite with parallel PropEr, we should have a target in the Makefile to run the examples with it. So, this commit removes all references to the `num_workers` option in the suite except for the test case that runs the examples, `examples_are_ok_test_/0`, as we do want to test them in parallel and the test case will be removed in the future.

This commit can and will be dropped at a future point in time, and is only being added to help continue testing parallel PropEr in Github Actions.

…ng API

.github/workflows/ci.yml

src/proper.erl

kostis · 2021-05-11T16:47:36Z

src/proper.erl

+-spec parallel_perform(test(), opts()) -> imm_result().
+parallel_perform(Test, #opts{property_type = pure, numtests = NumTests,
+                             numworkers = NumWorkers, strategy_fun = StrategyFun} = Opts) ->
+    TestsPerWorker = StrategyFun(NumTests, NumWorkers),


I would suggest to factor out all lines between

TestsPerWorker = StrategyFun(NumTests, NumWorkers), .... ok = maybe_stop_cover_server([]),

into a separate function and use it both for this clause and the next one (by passing it [] or NodeList).

This will help both for maintainability and for understanding what is common and what is not.

You also need to construct and pass to this function the appropriate SpawnFun, of course.

kostis · 2021-05-11T16:53:22Z

src/proper.erl

+perform(Passed, NumTests, Test, Opts) ->
+    Size = size_at_nth_test(Passed, Opts),
+    put('$size', Size),
+    perform(Passed, NumTests, 3 * ?MAX_TRIES_FACTOR * NumTests, Test, none, none, Opts).


You should probably add a comment where this magic number 3 comes from... what is its role.

src/proper.erl

kostis · 2021-05-11T16:57:12Z

src/proper.erl

+
+%% @private
+-spec update_worker_node_ref({node(), {already_running, boolean()}}) -> list(node()).
+update_worker_node_ref(Node) ->


Node is a confusing variable name here...

src/proper.erl

kostis · 2021-05-11T17:01:02Z

src/proper.erl

+%% @doc Starts multiple (NumNodes) remote nodes.
+-spec start_nodes(non_neg_integer()) -> list(node()).
+start_nodes(NumNodes) ->
+    StartNode =


StartNode -> StartNodeFun

kostis

Do the suggested changes, and then I will merge this.

pablocostass · 2021-05-14T19:20:22Z

@kostis As the commit that had added the get_all_application_env() function also fixed some typos, I rebased to change it to only bring in the latter and also pushed some commits to address your comments.

As I am not really happy with how I did the refactoring of the common code that parallel_perform/2 had, I left that as a different commit. Review it and tell me if you thing something else should be changed.

kostis · 2021-05-15T07:31:22Z

src/proper.erl

+-spec parallel_perform(test(), opts()) -> imm_result().
+parallel_perform(Test, #opts{property_type = pure, numtests = NumTests,
+                             numworkers = NumWorkers, strategy_fun = StrategyFun} = Opts) ->
+    _ = maybe_start_cover_server([]),


This line / call is not needed here; it will be done by spawn_workers_and_get_result. (Or am I missing something?)

Oh no, that was totally my mistake, thanks for pointing it out!

Amended the last commit to remove the line.

pablocostass changed the title ~~Parallel proper~~ Extend PropEr with parallel execution Mar 25, 2021

kostis self-requested a review March 26, 2021 08:43

pablocostass force-pushed the parallel_proper branch from 6bf55fe to c85d408 Compare March 29, 2021 17:40

kostis reviewed Mar 31, 2021

View reviewed changes

Makefile Outdated Show resolved Hide resolved

kostis reviewed Mar 31, 2021

View reviewed changes

src/proper.erl Outdated Show resolved Hide resolved

kostis reviewed Mar 31, 2021

View reviewed changes

src/proper.erl Outdated Show resolved Hide resolved

pablocostass force-pushed the parallel_proper branch from c85d408 to e057d79 Compare March 31, 2021 17:20

pablocostass requested a review from kostis April 1, 2021 08:52

pablocostass force-pushed the parallel_proper branch from e057d79 to 03937b3 Compare April 11, 2021 13:06

pablocostass force-pushed the parallel_proper branch from 03937b3 to 1b8de04 Compare April 13, 2021 06:32

pablocostass and others added 14 commits May 3, 2021 20:42

Fix the case where there are more processes than tests

86838f6

Rename options and variables to be more clear

dd0ae5f

Add the first steps towards a more safe parallelization

1b0138f

Make PropEr use a node when parallelizing

5977671

This way, when using workers to distribute tests among processes, PropEr will first create a node for them to be spawned on, avoiding crashing the whole VM if something goes wrong.

Properly handle the processes termination

0198b75

Clean up the code a bit and fix dialyzer warnings

6228f19

Fix a pattern matching of received test results

c390806

Avoid unneeded pattern matching and an extra line by using lists:fore…

8391168

…ach instead of lists:map

Fix wrong number of tests on property fail

7e80a84

Also did some refactor, mainly changin from *processes* to *workers* (and so `num_processes` to `num_workers`, etc), and added some documentation.

Fix error handling on perform not being sent back

daf3423

Also enforce a more strict pattern matching when trying to determine if a property is stateless or stateful.

pablocostass added 6 commits May 3, 2021 20:47

Stop using nodes with stateless properties

26c246f

Avoid stopping nodes when proper:module was the entry point

ca75a6d

Rename make target to test-parallel and address review comments

84cd4b4

Disable unknown warning in Dialyzer

831ed72

This commit can and will be dropped at a future point in time, and is only being added to help continue testing parallel PropEr in Github Actions.

Rename num_workers to numworkers to be consistent with the existi…

c5ce0cc

…ng API