Extract measures #117

scanny · 2018-11-06T19:38:47Z

This refactoring extracts a _Measures object and a hierarchy of measure objects that subclass _BaseMeasure. Specifically, these are _MeanMeasure, _UnweightedCountMeasure, and _WeightedCountMeasure.

These objects extract the measure-related logic out of CrunchCube as pure-functions (lazyproperties). The new objects have 100% unit test coverage and simplify CrunchCube.

coveralls · 2018-11-06T19:41:56Z

Coverage remained the same at 100.0% when pulling 6e10548 on extract-measures into efea848 on master.

* Move work of resolving type of cube-response and parsing JSON to a helper method (._cube_dict). * Change all instances in unit tests of `CrunchCube({})` to `CrunchCube(None)`. This is a mild case, but shows the main reason to avoid real work in the constructor, because all unit tests have to satisfy the arguments required by the real-work.

Get rid of warning-spew that occurred on test runs from calling deprecated CrunchCube.index() in tests.

There are no callers using the optional `adjusted` argument. Having such an argument conflates two separate operations. If adjustment is required, it can easily be performed by the caller on the end-result of as_array().

Remove redundant (copies) integration tests from test_headers_and_subtotals.py

* Remove unused `margin` parameter from CrunchCube.as_array(). If in future such a return value is required it should be provided using a distinct method rather than overloading `.as_array()`. * Remove now unnecessary `margin` parameter from CrunchCube._as_array(). Change call for raw_cube_array to use measure object returned by new `._measure()` method. * Remove now-dead `.raw_cube_array()` method.

* Replace calls to CrunchCube._data() with calls to ._counts() or ._measure() depending on context, using the .raw_cube_array property of the returned measure object to access the appropriate array formerly provided by ._data(). * Remove CrunchCube._data() and its helpers `._flat_values()` and `._shape` as they are now dead code.

* Add `make unit-coverage` task to Makefile that shows our unit test coverage is pretty spotty, indicating an over-reliance on integration tests (which do not test fine-grained behaviors).

slobodan-ilic

Overall 💯 🎉
Just a couple of comments regarding the style and unit test coverage. See if you can address them, and then we can merge... I can even merge like this, but want your input on comments I made.

slobodan-ilic · 2018-11-07T07:51:10Z

src/cr/cube/crunch_cube.py

+        unweighted counts are returned.
+        """
+        return (
+            self._measures.means if self._measures.means is not None else


Do we require is not None part here? It seems that the return value is either None or an instance, in which case the if self._measures.means would be enough.

Maybe the explicit form would be more readable here:

measures = self._measures if measures.means is not None: return measures.means if weighted: return measures.weighted_counts return measures.unweighted_counts

I believe it would also not hide potential misses by coverage (not sure about this though). I tried it locally, and it increases the unit test coverage from 65.94% to 66.10%

slobodan-ilic · 2018-11-07T08:22:28Z

src/cr/cube/crunch_cube.py

+                else json.loads(cube_response)
+            )
+            # ---cube is 'value' item in a shoji response---
+            return cube_dict.get('value', cube_dict)


The case when there is a "value" key doesn't seem to be tested by unit tests.

slobodan-ilic · 2018-11-07T09:08:52Z

src/cr/cube/crunch_cube.py

@@ -1466,6 +1466,13 @@ def means(self):
            return None
        return _MeanMeasure(self._cube_dict, self._all_dimensions)

+    @lazyproperty
+    def missing_count(self):
+        """numeric representing count of missing rows in cube response."""


I'm not sure that "row" is the thing that's being counted as missing here...

slobodan-ilic · 2018-11-07T10:38:43Z

src/cr/cube/crunch_cube.py

+        return (
+            self._measures.weighted_counts if weighted else
+            self._measures.unweighted_counts
+        )


Writing it like this:

if weighted: return self._measures.weighted_counts return self._measures.unweighted_counts

Increases the unit test coverage from 65.94 to 66.01 percent.

slobodan-ilic · 2018-11-07T10:39:46Z

src/cr/cube/crunch_cube.py

        Returns
            res (ndarray): Tabular representation of crunch cube
        """
        return self._apply_missings_and_insertions(
-            self._raw_cube_array(weighted, margin),
+            self._measure(weighted).raw_cube_array,


slobodan-ilic · 2018-11-07T10:42:35Z

tests/fixtures/scale_means/__init__.py

@@ -1,22 +0,0 @@
-import os


I'm not sure that this particular commit corresponds to the message.

scanny requested a review from slobodan-ilic November 6, 2018 19:38

scanny added 28 commits November 6, 2018 17:09

rfctr: rename CrunchCube._cube to _cube_dict

6722cda

test: remove deprecated cube.index() calls

2c3aa9f

Get rid of warning-spew that occurred on test runs from calling deprecated CrunchCube.index() in tests.

rfctr: remove 'adjusted' arg from ._as_array()

b2dce84

There are no callers using the optional `adjusted` argument. Having such an argument conflates two separate operations. If adjustment is required, it can easily be performed by the caller on the end-result of as_array().

cube: add xfails to drive _Measures TDD

648eced

cube: add _Measures.is_weighted

63cd839

cube: add _Measures.means

4067b5f

cube: add _Measures.missing_count

6e1b177

cube: add _MeanMeasure.missing_count

16c112d

cube: add _Measures.population_fraction

7e87398

cube: add _Measures.unweighted_counts

263fe2d

cube: add _Measures.unweighted_n

c32dea4

cube: add _Measures.weighted_counts

f1075b9

cube: add _Measures.weighted_n

c2e6f97

tdd: add xfails for _BaseMeasure subclasses

e8ad515

cube: add _BaseMeasure.raw_cube_array

3a933bd

cube: add _MeanMeasure._flat_values

ee91586

cube: add _UnweightedCountMeasure._flat_values

9387d75

cube: add _WeightedCountMeasure._flat_values

4de0e30

rfctr: reimplement CrunchCube.count()

f642554

Remove redundant (copies) integration tests from test_headers_and_subtotals.py

cube: add CrunchCube._measures

3af112e

rfctr: reimplement CrunchCube.has_means

f083e75

rfctr: reimplement CrunchCube.is_weighted

4400638

rfctr: reimplement CrunchCube.missing

76a6bca

rfctr: reimplement CrunchCube.population_fraction

1da96ce

cube: add CrunchCube._measure()

bcc1144

cube: add CrunchCube._counts()

195470e

scanny added 2 commits November 6, 2018 17:09

test: scrub overall test coverage

6e10548

* Add `make unit-coverage` task to Makefile that shows our unit test coverage is pretty spotty, indicating an over-reliance on integration tests (which do not test fine-grained behaviors).

scanny force-pushed the extract-measures branch from 392005a to 6e10548 Compare November 7, 2018 01:11

slobodan-ilic requested changes Nov 7, 2018

View reviewed changes

slobodan-ilic mentioned this pull request Nov 7, 2018

Extract denominator #118

Closed

slobodan-ilic approved these changes Nov 7, 2018

View reviewed changes

scanny merged commit 6e10548 into master Nov 7, 2018

scanny deleted the extract-measures branch May 23, 2019 23:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extract measures #117

Extract measures #117

scanny commented Nov 6, 2018

coveralls commented Nov 6, 2018 •

edited

Loading

slobodan-ilic left a comment

slobodan-ilic Nov 7, 2018

slobodan-ilic Nov 7, 2018

slobodan-ilic Nov 7, 2018

slobodan-ilic Nov 7, 2018

slobodan-ilic Nov 7, 2018

slobodan-ilic Nov 7, 2018

slobodan-ilic Nov 7, 2018

Extract measures #117

Extract measures #117

Conversation

scanny commented Nov 6, 2018

coveralls commented Nov 6, 2018 • edited Loading

slobodan-ilic left a comment

Choose a reason for hiding this comment

slobodan-ilic Nov 7, 2018

Choose a reason for hiding this comment

slobodan-ilic Nov 7, 2018

Choose a reason for hiding this comment

slobodan-ilic Nov 7, 2018

Choose a reason for hiding this comment

slobodan-ilic Nov 7, 2018

Choose a reason for hiding this comment

slobodan-ilic Nov 7, 2018

Choose a reason for hiding this comment

slobodan-ilic Nov 7, 2018

Choose a reason for hiding this comment

slobodan-ilic Nov 7, 2018

Choose a reason for hiding this comment

coveralls commented Nov 6, 2018 •

edited

Loading