More combinatorial functions #1026

jeromekelleher · 2020-11-21T19:53:04Z

Adds some more generators for common tree shapes, and makes some overall improvements to the combintorics code.

Supersedes #944

PR Checklist:

Tests that fully cover new/changed functionality.
Documentation including tutorial content if appropriate.
Changelogs, if there are API changes.

AdminBot-tskit · 2020-11-21T19:54:41Z

📖 Docs for this PR can be previewed here

codecov · 2020-11-21T19:59:07Z

Codecov Report

Merging #1026 (645ae81) into main (e5ef2d5) will increase coverage by 0.02%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##             main    #1026      +/-   ##
==========================================
+ Coverage   93.69%   93.71%   +0.02%     
==========================================
  Files          26       26              
  Lines       20848    20920      +72     
  Branches      859      875      +16     
==========================================
+ Hits        19533    19606      +73     
  Misses       1277     1277              
+ Partials       38       37       -1

Flag	Coverage Δ
c-tests	`92.49% <ø> (ø)`
lwt-tests	`93.58% <ø> (ø)`
python-c-tests	`94.90% <100.00%> (+0.04%)`	⬆️
python-tests	`98.61% <100.00%> (+0.04%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
python/tskit/combinatorics.py	`99.16% <100.00%> (+0.09%)`	⬆️
python/tskit/trees.py	`97.46% <100.00%> (+0.05%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e5ef2d5...645ae81. Read the comment docs.

jeromekelleher · 2020-11-22T23:46:00Z

This is ready for review now I think. The main thing that's added is the generate_balanced function, which can do arbitrary arities, .e.g:

       15             
  ┏━━━━━╋━━━━━━┓      
  ┃     ┃     14      
  ┃     ┃   ┏━┳┻━━┓   
 11    12   ┃ ┃  13   
┏━╋━┓ ┏━╋━┓ ┃ ┃ ┏━╋━━┓
0 1 2 3 4 5 6 7 8 9 10

I've made sure that the generated trees are identical to the trees we'd get from unrank, and documented how we allocate internal nodes there.

@daniel-goldstein, would you mind taking a quick look here to makre sure I haven't done anything silly?

jeromekelleher · 2020-11-23T09:30:42Z

@benjeffery - any idea how we should do abstract test classes using pytest? It's annoying me that we need to duplicate the code for tests like test_provenance here. I'd have done a Mixin class here in the old days, so that all the test classes inherit the test, but the test isn't executed on the abstract class.

Also, good to get your thoughts on the style that's evolving here for pytest...

benjeffery · 2020-11-23T11:00:45Z

Well you can still use a Mixin if you like! Often i think a plain function at module-level is simpler, but there are times Mixins feel right.

Generally i think the tests here look good - using the parameterisation decorator is good as it isolates failing cases easily, note that if you want all combinations you can stack the decorator. Also I like that we're grouping tests by class.

jeromekelleher · 2020-11-23T14:31:18Z

@benjeffery, any chance of another look at the tests? I'm quite pleased how this turned out, might be quite a nice pattern to adopt/adapt elsewhere?

python/tskit/combinatorics.py

python/tskit/trees.py

benjeffery · 2020-11-23T15:12:25Z

@benjeffery, any chance of another look at the tests? I'm quite pleased how this turned out, might be quite a nice pattern to adopt/adapt elsewhere?

I've just realised another way to do this as you can parameterise a whole class:

@pytest.mark.parametrize('method', ["flip", "catch", "hurl"])
class TestCommonsStuff:
    def test_foobar(self, method):
        assert ....
    def test_other(self, method):
        assert ....

and then put the specific tests in their own class.

It is a matter of taste as always - I find what you have already very clear.

jeromekelleher · 2020-11-23T16:21:14Z

Good idea @benjeffery! I think I'll stick with the subclasses here as it's nice to be able to tack on extra specific tests as well as the common ones.

python/tskit/trees.py

- Tree.generate_balanced and improvements to unrank. - Update some of the testing code for combinatorial functions. - Add Tree.generate_comb. - Implement balanced tree for arbitrary arity.

jeromekelleher · 2020-11-24T09:19:24Z

Thanks for the feedback, I've addressed your comments @hyanwong. I'm marked this as "merge-ready" so it just needs an approval and it'll go in.

daniel-goldstein · 2020-11-24T12:54:31Z

Sorry spent all yesterday moving out for the holidays. I can still give it a look now if you'd like

jeromekelleher · 2020-11-24T13:54:41Z

Sorry spent all yesterday moving out for the holidays. I can still give it a look now if you'd like

That would be great, thanks @daniel-goldstein! No particular hurry.

daniel-goldstein

This looks really great! Loving the pytest decorators. Just one little comment

daniel-goldstein · 2020-11-24T14:01:20Z

python/tskit/combinatorics.py

+        raise ValueError("Number of chunks must be a positive integer")
+
+    if n > 0:
+        chunk_size = max(1, n // k)


Can you do this instead with yield from itertools.islice(lst, chunk_size)?

Hey, good idea. No, I don't think islice will do it, the semantics of what we want are really quite specific in order to line up with the unranked trees. numpy has a function which almost does what we want, but not quite.

jeromekelleher force-pushed the more-tree-shapes branch from eca6fc6 to 2e6920b Compare November 22, 2020 23:38

jeromekelleher requested review from daniel-goldstein and hyanwong November 22, 2020 23:40

jeromekelleher marked this pull request as ready for review November 22, 2020 23:40

jeromekelleher force-pushed the more-tree-shapes branch from 2e6920b to 5939fe6 Compare November 23, 2020 00:51

jeromekelleher force-pushed the more-tree-shapes branch from 5939fe6 to 0a083b0 Compare November 23, 2020 14:30

jeromekelleher force-pushed the more-tree-shapes branch from 0a083b0 to b879c24 Compare November 23, 2020 14:46

hyanwong reviewed Nov 23, 2020

View reviewed changes

python/tskit/combinatorics.py Show resolved Hide resolved

hyanwong reviewed Nov 23, 2020

View reviewed changes

python/tskit/trees.py Outdated Show resolved Hide resolved

hyanwong reviewed Nov 23, 2020

View reviewed changes

python/tskit/trees.py Outdated Show resolved Hide resolved

Improvements to tree generation and unranking

645ae81

- Tree.generate_balanced and improvements to unrank. - Update some of the testing code for combinatorial functions. - Add Tree.generate_comb. - Implement balanced tree for arbitrary arity.

jeromekelleher force-pushed the more-tree-shapes branch from b879c24 to 645ae81 Compare November 24, 2020 09:18

jeromekelleher added the AUTOMERGE-REQUESTED label Nov 24, 2020

jeromekelleher removed the AUTOMERGE-REQUESTED label Nov 24, 2020

daniel-goldstein approved these changes Nov 24, 2020

View reviewed changes

jeromekelleher merged commit 71beec5 into tskit-dev:main Nov 24, 2020

jeromekelleher deleted the more-tree-shapes branch November 24, 2020 18:16

jeromekelleher mentioned this pull request Nov 25, 2020

Generate balanced tree #944

Closed

More combinatorial functions #1026

More combinatorial functions #1026

Uh oh!

Conversation

jeromekelleher commented Nov 21, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Checklist:

Uh oh!

AdminBot-tskit commented Nov 21, 2020

Uh oh!

codecov bot commented Nov 21, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

jeromekelleher commented Nov 22, 2020

Uh oh!

jeromekelleher commented Nov 23, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

benjeffery commented Nov 23, 2020

Uh oh!

jeromekelleher commented Nov 23, 2020

Uh oh!

Uh oh!

Uh oh!

benjeffery commented Nov 23, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jeromekelleher commented Nov 23, 2020

Uh oh!

Uh oh!

jeromekelleher commented Nov 24, 2020

Uh oh!

daniel-goldstein commented Nov 24, 2020

Uh oh!

jeromekelleher commented Nov 24, 2020

Uh oh!

daniel-goldstein left a comment

Choose a reason for hiding this comment

Uh oh!

daniel-goldstein Nov 24, 2020

Choose a reason for hiding this comment

Uh oh!

jeromekelleher Nov 24, 2020

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

jeromekelleher commented Nov 21, 2020 •

edited

Loading

codecov bot commented Nov 21, 2020 •

edited

Loading

jeromekelleher commented Nov 23, 2020 •

edited

Loading

benjeffery commented Nov 23, 2020 •

edited

Loading