Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.
Sign upget_dupes has incorrect duplicates and dupe_count when there's a column called "n" in the data frame #162
Comments
|
Great bug report! Thank you. Providing the source of the problem ( |
|
(also I had not seen |
If the data frame that I'm trying to check for duplicates has a column called n (ex. from
dplyr::count()ordplyr::add_count(), thenget_dupes()gets a bit confused in terms of the numbering. I believe this is becauseget_dupes()usesdplyr::count(), which adds a count column called "nn" instead of "n", butget_dupes()is still using n as the count column.For example, in this toy example student 102 does not have a duplicate for
subject = 1, yetget_dupes()is giving three duplicates instead of 2.