Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement optimized method to select subsets or rows of a DataFrame from groups by rank (-1 -> last) or order statistics #978

Closed
wesm opened this issue Mar 28, 2012 · 2 comments
Labels
Algos Non-arithmetic algos: value_counts, factorize, sorting, isin, clip, shift, diff Enhancement Performance Memory or execution speed performance

Comments

@wesm
Copy link
Member

wesm commented Mar 28, 2012

Motivated by: http://stackoverflow.com/questions/9850954/pandas-get-most-recent-value-of-a-particular-column-indexed-by-another-column

@jreback jreback modified the milestones: 0.15.0, 0.14.0 Feb 18, 2014
@jreback jreback modified the milestones: 0.16.0, 0.17.0 Jan 26, 2015
@ron819
Copy link

ron819 commented Dec 10, 2018

@jreback
Is this still needed? It's marked for 0.17 which was released on 2015.

@jreback
Copy link
Contributor

jreback commented Dec 10, 2018

@ron819 this is not marked for 0.17 at all. the soln which is reasonable is
df.loc[df.groupby('obj_id').data_date.idxmax(),:]

@jreback jreback closed this as completed Dec 10, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Algos Non-arithmetic algos: value_counts, factorize, sorting, isin, clip, shift, diff Enhancement Performance Memory or execution speed performance
Projects
None yet
Development

No branches or pull requests

3 participants