Join GitHub today
GitHub is home to over 20 million developers working together to host and review code, manage projects, and build software together.
ENH: add to cython groupby functions #4095
Comments
|
@jreback can I take this? If it's for 0.13 you have some time and I'd be really interested in a chance to dig into more of the Cython internals. |
|
go for it! you will need to add a function template in src/generate_code.py and add the appropriate places in groupby.py |
|
@jreback cool - thanks :) I'm looking forward to figuring all that out |
|
@jreback were you thinking this would cover time series shifting too? |
|
yes the index type actually doesn't matter though its based on position shiftting (well the time-series stuff happens at a high level and is just translated to positions to move anyhow) |
|
@jreback - if it's already translate to positions to move, then that makes On Tue, Jul 9, 2013 at 7:34 PM, jreback notifications@github.com wrote:
|
|
yep...look at pandas/core/frame/shift.... |
jtratner
was assigned
Aug 23, 2013
|
I've let this slide... I will try to circle back to this when I have a chance, but if someone else wants to take it go go go |
jreback
modified the milestone: 0.15.0, 0.14.0
Feb 18, 2014
jreback
added the
Groupby
label
Feb 18, 2014
jreback
referenced
this issue
Apr 29, 2014
Merged
ENH/BUG: add count to grouper / ensure that grouper keys are not included in the returned #7000
jreback
changed the title from
ENH: add shift to cython groupby functions to ENH: add to cython groupby functions
Feb 11, 2015
|
I've started looking at this one. |
|
@jreback I have a good start here on Currently I've done this by creating a list of ops that won't compress (currently just |
|
these are kind of like (and I think should be implemented like) transform ops another example is fillna can u show a branch that u have so far? |
|
Alright, that helps. Let me get this cleaned up a little bit and I'll start a PR so that we can look at real code. |
jreback
modified the milestone: 0.16.0, Next Major Release
Mar 6, 2015
jreback
added Prio-medium Difficulty Intermediate Effort Medium
labels
Aug 21, 2015
jreback
modified the milestone: 0.17.1, Next Major Release
Oct 11, 2015
chris-b1
added a commit
to chris-b1/pandas
that referenced
this issue
Nov 15, 2015
|
|
chris-b1 |
9d11734
|
jreback
added a commit
that referenced
this issue
Nov 16, 2015
|
|
chris-b1 + jreback |
b07dd0c
|
|
closed by #10901 |
jreback
closed this
Nov 16, 2015
Oleg-Krivosheev
commented
Nov 25, 2015
|
Looks like there might be a problem http://stackoverflow.com/questions/33903624/python-pandas-memoryerror-skiplist-failed-when-using-rolling-median-and-apply |
randomgambit
commented
Apr 22, 2016
•
|
@jreback sorry to revive this but in my dataframe, a simple
takes just forever. Is this just because the data is big, or this is related to the old problems that are mentioned here? Happy to help if I can! |
|
anything with a lambda function will by definition be slow its basically a python loop. but what you are doing is NOT a transform, which must return a scalar per group. |
|
you probably want
|
randomgambit
commented
Apr 22, 2016
•
|
Thanks Jeff
Wait but my understanding is that
|
|
that's what I said |
randomgambit
commented
Apr 22, 2016
|
got it thanks. by the way |
randomgambit
commented
Apr 22, 2016
•
|
I think I will write a book soon: the 100 most common errors every Pandas user has to make bestseller on amazon for sure |
jreback commentedJul 1, 2013