Fixed a bug in which totals doesn't work for column-indexed tables #79

x8lucas8x · 2016-12-12T09:43:22Z

In a nutshell, this fixes fixes #49. While doing that I tried to make the design a bit simpler in terms of handling totals. For such, I got rid of most hardcoded Totals strings I found and passed a display label for Totals as part of display_options, so that the Total implementation have less corner cases and look more like an ordinary metric. That will also make future refactoring easier. Besides, the NaN in columns with rollups are being replaced with Totals.key right at the source (i.e. query_data), even before setting the indexes. Some tests were also fixed, especially test_rollup_cont_cat_cat_dim_multi_metric() in test_datatables, which had all values for Totals set to None and therefore was essentially wrong.

coveralls · 2016-12-12T09:50:01Z

Coverage decreased (-0.05%) to 97.505% when pulling 44d5622 on x8lucas8x:fix-totals-for-column-indexed-tables into 96f1493 on kayak:master.

twheys · 2016-12-12T13:20:52Z

fireant/slicer/queries.py


        querystring = str(query)
        logger.info("Executing query:\n----START----\n{query}\n-----END-----".format(query=querystring))

        dataframe = database.fetch_dataframe(querystring)
+
+        for dimension_key in rollup:
+            dataframe[dimension_key].replace([np.nan], [Totals.key], inplace=True)


This is tricky because there could be real null values in the query. I'm not sure off the top of my head how it works with rollup but I think they might get merged together with the totals. This fixes one problem with null values be labeled as totals when they're not, though.

Could you maybe look into using totals on a dimension with null values in the database? The currently solution is to expect the user to use Coaesce on the dimension, but would be nice if we could make this automatic, but that isn't ideal.

@twheys Rollup returns totals as none/NaN, so any null value will get merged as you mentioned. Apparently there is no way to address that in the server. One workaround in python would be to indirectly calculate the amount for null entries by creating columns which are equal to the max value, which will definitely be the total, minus all the other ids/categories. Either right before or after that replace above. By the way, It's worth mentioning that the previous implementation also had the same problem. It just tried to set the totals label in a different place while maintaining the keys.

Right, it's not a new issue, just bringing it up

twheys · 2016-12-12T13:27:02Z

fireant/slicer/transformers/datatables.py

+        for level in dataframe.index.levels[1]:
+            metric_data = dict(self._recurse_dimensions(dataframe[:, level], dimensions[1:], metrics))
+
+            if not metric_data:


If this check is necessary here, wouldn't it be necessary in the above loop on line 283?

@twheys Good catch. I fixed that.

twheys · 2016-12-12T13:27:41Z

fireant/slicer/transformers/highcharts.py

-                if value and not (isinstance(value, (float, int)) and np.isnan(value))
-                else 'Totals'
-                for value in dataframe.index]
+        return [display_options.get(value, value) for value in dataframe.index]


This is much cleaner, thanks

twheys · 2016-12-12T13:28:32Z

fireant/tests/mock_dataframes.py

@@ -31,7 +33,7 @@ def rollup(dataframe, levels):
    'd': 'D',
    'y': 'Y',
    'z': 'Z',
-    np.nan: 'Total',
+    '_total': 'Total',


could you please use the constant for this here and everywhere else in the tests?

@twheys Done.

coveralls · 2016-12-12T15:26:40Z

Coverage decreased (-0.04%) to 97.511% when pulling b76e143 on x8lucas8x:fix-totals-for-column-indexed-tables into 96f1493 on kayak:master.

coveralls · 2016-12-12T15:33:34Z

Coverage decreased (-0.04%) to 97.511% when pulling 909e8c4 on x8lucas8x:fix-totals-for-column-indexed-tables into 96f1493 on kayak:master.

twheys · 2016-12-14T10:27:06Z

fireant/slicer/transformers/datatables.py

-                    for key, display in zip(*dataframe.index.levels[1:3])]
+            levels = zip(*dataframe.index.levels[1:3])
+            format_key = lambda level: level[0]
+            generate_dataframe = lambda level: dataframe[:, level[0], level[1]]


Could this be called something like slice_metric_data ?

@twheys Done.

twheys · 2016-12-14T10:28:25Z

fireant/slicer/transformers/datatables.py

+            generate_dataframe = lambda level: dataframe[:, level[0], level[1]]
+        else:
+            levels = dataframe.index.levels[1]
+            format_key = lambda level: str(level)


This raises a codacy issue. You could just set format_key = str to avoid the lambda.

@twheys Done.

coveralls · 2016-12-14T13:16:05Z

Coverage decreased (-0.04%) to 97.52% when pulling 8465272 on x8lucas8x:fix-totals-for-column-indexed-tables into 2c3f528 on kayak:master.

twheys suggested changes Dec 12, 2016

View reviewed changes

x8lucas8x force-pushed the fix-totals-for-column-indexed-tables branch from 44d5622 to b76e143 Compare December 12, 2016 15:21

x8lucas8x force-pushed the fix-totals-for-column-indexed-tables branch from b76e143 to 909e8c4 Compare December 12, 2016 15:28

twheys reviewed Dec 14, 2016

View reviewed changes

Fixed a bug in which totals doesn't work for column-indexed tables.

8465272

x8lucas8x force-pushed the fix-totals-for-column-indexed-tables branch from 909e8c4 to 8465272 Compare December 14, 2016 13:10

mikeengland merged commit 7a4d7ed into kayak:master Dec 14, 2016

mikeengland mentioned this pull request Dec 16, 2016

(v0.6.0) Column index tables are raising exceptions due to a number of column mismatch in the JSON output #87

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed a bug in which totals doesn't work for column-indexed tables #79

Fixed a bug in which totals doesn't work for column-indexed tables #79

x8lucas8x commented Dec 12, 2016 •

edited

Loading

coveralls commented Dec 12, 2016 •

edited

Loading

twheys Dec 12, 2016

x8lucas8x Dec 12, 2016 •

edited

Loading

twheys Dec 14, 2016

twheys Dec 12, 2016

x8lucas8x Dec 12, 2016

twheys Dec 12, 2016

twheys Dec 12, 2016

x8lucas8x Dec 12, 2016

coveralls commented Dec 12, 2016 •

edited

Loading

coveralls commented Dec 12, 2016 •

edited

Loading

twheys Dec 14, 2016

x8lucas8x Dec 14, 2016

twheys Dec 14, 2016

x8lucas8x Dec 14, 2016

coveralls commented Dec 14, 2016 •

edited

Loading

Fixed a bug in which totals doesn't work for column-indexed tables #79

Fixed a bug in which totals doesn't work for column-indexed tables #79

Conversation

x8lucas8x commented Dec 12, 2016 • edited Loading

coveralls commented Dec 12, 2016 • edited Loading

Choose a reason for hiding this comment

x8lucas8x Dec 12, 2016 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

coveralls commented Dec 12, 2016 • edited Loading

coveralls commented Dec 12, 2016 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

coveralls commented Dec 14, 2016 • edited Loading

x8lucas8x commented Dec 12, 2016 •

edited

Loading

coveralls commented Dec 12, 2016 •

edited

Loading

x8lucas8x Dec 12, 2016 •

edited

Loading

coveralls commented Dec 12, 2016 •

edited

Loading

coveralls commented Dec 12, 2016 •

edited

Loading

coveralls commented Dec 14, 2016 •

edited

Loading