Distinct .keep_all parameter #2107

rrremedio · 2016-09-02T00:12:05Z

Would not it be more natural to the parameter ".keep_all" in distinct function to be value as TRUE? I mean, as the 'd' of dplyr denotes, i'd like to subset the data.frame (here comes the 'd') based on a set of unique values of certain variables or in all of them and keep all columns, that is, keep the data.frame.

const-ae · 2016-09-16T14:00:03Z

Actually as far as I understand it was the default for distinct to keep all variables until dplyr version 0.5 (release notes). This issue contains more information why they changed the behavior.

krlmlr · 2016-11-07T20:51:36Z

It's probably not worth the effort to change that, as it's easy to change the behavior or write your own wrapper. @hadley: Please comment.

hadley · 2016-11-07T21:11:17Z

I don't think it is more natural because the values of the other columns are essentially arbitrary (because row order generally shouldn't be considered meaningful)

hadley closed this as completed Nov 7, 2016

rubenarslan mentioned this issue Feb 27, 2017

Distinct without arguments on a grouped df #2476

Closed

lock bot locked as resolved and limited conversation to collaborators Jun 8, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distinct .keep_all parameter #2107

Distinct .keep_all parameter #2107

rrremedio commented Sep 2, 2016

const-ae commented Sep 16, 2016

krlmlr commented Nov 7, 2016

hadley commented Nov 7, 2016

Distinct .keep_all parameter #2107

Distinct .keep_all parameter #2107

Comments

rrremedio commented Sep 2, 2016

const-ae commented Sep 16, 2016

krlmlr commented Nov 7, 2016

hadley commented Nov 7, 2016