Skip to content

Implement user defined aggregations #1483

Merged
jcrist merged 2 commits intodask:masterfrom
jcrist:df-reductions
Aug 19, 2016
Merged

Implement user defined aggregations #1483
jcrist merged 2 commits intodask:masterfrom
jcrist:df-reductions

Conversation

@jcrist
Copy link
Copy Markdown
Member

@jcrist jcrist commented Aug 18, 2016

Adds a reduction method on dask.dataframe objects for applying user defined aggregations. This is a small wrapper around apply_concat_apply, with some slight changes to make applying
reductions simpler and more intuitive. All use of _aca_agg internally has been replaced with this functionality.

Fixes #1313, supersedes #1362.

jcrist added 2 commits August 18, 2016 17:25
Adds a `reduction` method on `dask.dataframe` objects for applying user
defined aggregations. This is a small wrapper around
`apply_concat_apply`, with some slight changes to make applying
reductions simpler and more intuitive. All use of `_aca_agg` internally
has been replaced with this functionality.

Fixes dask#1313, supersedes dask#1362.
No longer needed, superseded by `reduction` method.
If the output of ``chunk`` is a:
- scalar: Input is a Series, with one row per partition.
- Series: Input is a DataFrame, with one row per partition. Columns
are the rows in the output series.
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not a concatenated Series?

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah, never mind, I now see that this is is aligned with how dataframes handle individual rows.

@mrocklin
Copy link
Copy Markdown
Member

+1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants