Distinct creates duplicate columns when groups are included in the distinct #2001

ajongbloets · 2016-07-06T17:23:04Z

When I use distinct on a grouped data.frame and explicitly put the group columns in the distinct, then I get duplicated columns:

df <- data.frame( a = c(1,2,3), b=c(4,5,6), c=c(7,8,9))

df %>% group_by(a) %>% distinct(a,b)
# creates a data.frame with two 'a' columns

Since putting 'a' in the distinct is not necessary as the group_by already makes sure that every value in 'a' is unique, this problem is easily fixed on the user side.

krlmlr · 2016-11-07T22:50:38Z

Thanks, confirmed. Would you like to contribute a testthat test?

krlmlr added the bug an unexpected problem or unintended behavior label Nov 7, 2016

hadley added the data frame label Feb 20, 2017

krlmlr mentioned this issue Feb 21, 2017

distinct create duplicated column names #2109

Closed

hadley closed this as completed in 267ebb8 Feb 26, 2017

lock bot locked as resolved and limited conversation to collaborators Jun 8, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distinct creates duplicate columns when groups are included in the distinct #2001

Distinct creates duplicate columns when groups are included in the distinct #2001

ajongbloets commented Jul 6, 2016

krlmlr commented Nov 7, 2016

Distinct creates duplicate columns when groups are included in the distinct #2001

Distinct creates duplicate columns when groups are included in the distinct #2001

Comments

ajongbloets commented Jul 6, 2016

krlmlr commented Nov 7, 2016