Hypothesis warnings and speed up some tests #1238

marcharper · 2019-02-05T13:06:34Z

The first commit removes the hypothesis decorator kwarg max_iterations that give deprecation warnings (and are reported as no-ops). I changed hypothesis to be pinned at >= 3.2 instead of ==. [Edit: see below, also limited to <=3.68] See #1214 .

The second commit reduces the run time of some of the longer tests. These changes cut the local total test run time down by 65% (excluding doctests, which take about 60 seconds to run). For example some tests now avoid the Meta strategies (like the filter set tests). Others now only test a random sample of strategies. (We may need to add some random seeding.) There are still tests that run larger tournaments so I think the risk for this change is low.

Also, I marked one test as skippable if the matplotlib version is 3+, and removed the restriction in requirements.txt. Alternatively we could require matplotlib 3+ and update the test. It's just the colorbar position, so I think we can live without the test for a short while. See #1209 .

Closes #1209

marcharper · 2019-02-05T13:31:07Z

OK so we still need to limit the version of hypothesis somewhat, else it reports slow tests as errors.

axelrod/tests/strategies/test_meta.py

axelrod/tests/integration/test_tournament.py

marcharper · 2019-02-05T14:19:54Z

On further investigation, it appears that the colorbar location is more dynamic in mpl 3+. Calling tight _layout, for example, causes it to move. I removed that portion of the test, which removes the need to skip the test and to limit mpl to < 3.

marcharper · 2019-02-07T01:54:01Z

This PR has revealed a tangential issue (causing a test to intermittently fail) that was somewhat tricky to track down. The strategy ShortMem is listed as memory_depth=10 and is failing this memory depth test. Removing the max_iterations hypothesis variable somehow led to discovery of a counterexample seed.

I fixed it and some other tests. Even though the strategy only uses the last 10 rounds, its behavior changes at round 10, therefore it has to know the exact number of rounds played, i.e. the full history.

marcharper · 2019-02-07T02:03:40Z

I also re-pinned hypothesis. I was getting some odd coverage issues we using a newer version. Essentially coverage thought that some of the custom hypothesis decorators were not covered even though they have direct tests.

drvinceknight · 2019-02-07T08:17:15Z

I also re-pinned hypothesis. I was getting some odd coverage issues we using a newer version. Essentially coverage thought that some of the custom hypothesis decorators were not covered even though they have direct tests.

I noticed that: how weird...

I'll have a proper look through everything later today @marcharper, nice work 👍

axelrod/tests/integration/test_filtering.py

axelrod/tests/integration/test_tournament.py

drvinceknight · 2019-02-07T18:50:20Z

axelrod/tests/integration/test_filtering.py

    )
    @example(
        min_memory_depth=float("inf"),
        max_memory_depth=float("inf"),
        memory_depth=float("inf"),
+        strategies=strategy_lists(min_size=20, max_size=20),


This should be an actual list of strategies here for the @example. To do this properly with hypothesis I believe we should remove the example case here, and write a second test OR we could go with just strategies=short_run_time_strategies (the correct import will be needed at the top). I've run this locally and it seems fine without a large speedup.

It's strange because these tests are running fine for me locally but not on Travis, and we didn't explicitly list strategies in this example previously.

Strange that it runs locally... Perhaps a slightly different version of hypothesis has implemented the ability to use strategies inside examples... Would be neat if it did...

Previously I think we were just using all_strategies from the global space.

drvinceknight · 2019-02-07T18:52:41Z

.travis.yml

@@ -24,7 +24,7 @@ script:
  - cd docs; make clean; make html
  # Run the test suit with coverage
  - cd ..
-  - travis_wait 60 coverage run --source=axelrod -m unittest discover
+  - travis_wait 60 coverage run --source=axelrod -m unittest discover -v


Double checking that the verbose tag isn't just something you wanted for debugging @marcharper? I don't necessarily mind either way (it does make scrolling through the travis log a big long but no big deal).

I'll take it out

I'm fine with adding it back in, I did add it just for debugging Travis

drvinceknight · 2019-02-08T10:00:35Z

Very nice work @marcharper, this is a welcome speedup/refactor of the tests 👍

marcharper force-pushed the warnings branch from ffe2bb0 to 8554dbe Compare February 5, 2019 13:13

drvinceknight requested changes Feb 5, 2019

View reviewed changes

axelrod/tests/strategies/test_meta.py Outdated Show resolved Hide resolved

drvinceknight requested changes Feb 5, 2019

View reviewed changes

axelrod/tests/integration/test_tournament.py Outdated Show resolved Hide resolved

marcharper force-pushed the warnings branch 2 times, most recently from e3c1896 to b8f7867 Compare February 6, 2019 05:18

marcharper added 5 commits February 5, 2019 21:42

Fix some deprecation warnings

a498170

Use sampling to reduce run of some long tests, mostly

57cf85c

Update seeding to use hypothesis on test_clone

9a02a68

Hypothesize test_full_tournament into test_big_tournaments

99d2c72

Update payoff plot test for matplotlib 3+

6966a5d

marcharper force-pushed the warnings branch 3 times, most recently from 49eda27 to 8d5318a Compare February 7, 2019 01:43

drvinceknight requested changes Feb 7, 2019

View reviewed changes

axelrod/tests/integration/test_filtering.py Outdated Show resolved Hide resolved

axelrod/tests/integration/test_tournament.py Outdated Show resolved Hide resolved

marcharper force-pushed the warnings branch from 2374e8b to 867c9f6 Compare February 7, 2019 15:58

drvinceknight reviewed Feb 7, 2019

View reviewed changes

marcharper added 4 commits February 7, 2019 17:32

Pin hypothesis back to 3.2

3402207

Fix memory depth bug revealed by test

2897d57

Update some newly modified tests to use hypothesis

d7ce789

Update hypothesis example

dc30880

marcharper force-pushed the warnings branch from 867c9f6 to dc30880 Compare February 8, 2019 01:33

drvinceknight approved these changes Feb 8, 2019

View reviewed changes

drvinceknight added the ready-to-merge label Feb 8, 2019

meatballs merged commit 09b92e5 into master Feb 11, 2019

meatballs deleted the warnings branch February 11, 2019 14:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hypothesis warnings and speed up some tests #1238

Hypothesis warnings and speed up some tests #1238

marcharper commented Feb 5, 2019 •

edited

Loading

marcharper commented Feb 5, 2019

marcharper commented Feb 5, 2019 •

edited

Loading

marcharper commented Feb 7, 2019 •

edited

Loading

marcharper commented Feb 7, 2019

drvinceknight commented Feb 7, 2019

drvinceknight Feb 7, 2019 •

edited

Loading

marcharper Feb 8, 2019

drvinceknight Feb 8, 2019

drvinceknight Feb 7, 2019

marcharper Feb 8, 2019

marcharper Feb 8, 2019

drvinceknight commented Feb 8, 2019

Hypothesis warnings and speed up some tests #1238

Hypothesis warnings and speed up some tests #1238

Conversation

marcharper commented Feb 5, 2019 • edited Loading

marcharper commented Feb 5, 2019

marcharper commented Feb 5, 2019 • edited Loading

marcharper commented Feb 7, 2019 • edited Loading

marcharper commented Feb 7, 2019

drvinceknight commented Feb 7, 2019

drvinceknight Feb 7, 2019 • edited Loading

Choose a reason for hiding this comment

marcharper Feb 8, 2019

Choose a reason for hiding this comment

drvinceknight Feb 8, 2019

Choose a reason for hiding this comment

drvinceknight Feb 7, 2019

Choose a reason for hiding this comment

marcharper Feb 8, 2019

Choose a reason for hiding this comment

marcharper Feb 8, 2019

Choose a reason for hiding this comment

drvinceknight commented Feb 8, 2019

marcharper commented Feb 5, 2019 •

edited

Loading

marcharper commented Feb 5, 2019 •

edited

Loading

marcharper commented Feb 7, 2019 •

edited

Loading

drvinceknight Feb 7, 2019 •

edited

Loading