BUG: df.agg(sum, axis=1) uses different method than when axis=0 #21222

topper-123 · 2018-05-27T09:57:24Z

closes BUG: df.agg(sum, axis=1) gives wrong result when Nan value is in frame #21134
xref ENH: add np.nan funcs to _cython_table #21123
tests added / passed
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

This is a splitoff from #21123, to only fix #21134. #19629 will be fixed in a separate PR afterwards.

Passing builtins to df.agg is ok when axis=0, but can give wrong result, when axis=1 when NaNs are supplied.

Explanation

Passing the functions in SelectionMixin._cython_table to df.agg should defer to use the relevant cython functions. This currently works as expected when axis=0, but not always when axis=1.

The reason for this difference is that df.aggregate currently defers to df._aggregate when axis=0, but defers to df.apply, when axis=1, and these give different result when passed funcions and the series/frame contains Nan values. I've solved this by transposing df in _aggragate when axis=1.

The tests have been heavily parametrized, helping ensure that the various ways to call df.agg now give correct result.

pep8speaks · 2018-05-27T09:57:31Z

Hello @topper-123! Thanks for updating the PR.

Cheers ! There are no PEP8 issues in this Pull Request. 🍻

Comment last updated on May 27, 2018 at 10:02 Hours UTC

topper-123 · 2018-05-27T10:31:36Z

I’ve thought of a couple issues that should be tested. I mark this a WIP untill this is done.

Temporarily closed.

topper-123 mentioned this pull request May 27, 2018

ENH: add np.nan funcs to _cython_table #21123

Closed

3 tasks

Fix bug where df.agg(..., axis=1) gives wrong result

0caac6b

topper-123 force-pushed the agg_funcs_axis_1 branch from f1365a6 to 0caac6b Compare May 27, 2018 10:02

topper-123 changed the title ~~BUG: bug where df.agg(..., axis=1) gives wrong result~~ WIP/BUG: bug where df.agg(..., axis=1) gives wrong result May 27, 2018

topper-123 closed this May 27, 2018

topper-123 changed the title ~~WIP/BUG: bug where df.agg(..., axis=1) gives wrong result~~ BUG: bug where df.agg(..., axis=1) uses different method than when axis=0 May 27, 2018

topper-123 changed the title ~~BUG: bug where df.agg(..., axis=1) uses different method than when axis=0~~ BUG: df.agg(sum, axis=1) uses different method than when axis=0 May 27, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: df.agg(sum, axis=1) uses different method than when axis=0 #21222

BUG: df.agg(sum, axis=1) uses different method than when axis=0 #21222

topper-123 commented May 27, 2018 •

edited

Loading

pep8speaks commented May 27, 2018 •

edited

Loading

topper-123 commented May 27, 2018 •

edited

Loading

BUG: df.agg(sum, axis=1) uses different method than when axis=0 #21222

BUG: df.agg(sum, axis=1) uses different method than when axis=0 #21222

Conversation

topper-123 commented May 27, 2018 • edited Loading

Explanation

pep8speaks commented May 27, 2018 • edited Loading

Comment last updated on May 27, 2018 at 10:02 Hours UTC

topper-123 commented May 27, 2018 • edited Loading

topper-123 commented May 27, 2018 •

edited

Loading

pep8speaks commented May 27, 2018 •

edited

Loading

topper-123 commented May 27, 2018 •

edited

Loading