Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

dplyr:distinct is not more performant than base:unique #1519

Closed
ljos opened this issue Nov 8, 2015 · 1 comment
Closed

dplyr:distinct is not more performant than base:unique #1519

ljos opened this issue Nov 8, 2015 · 1 comment

Comments

@ljos
Copy link

ljos commented Nov 8, 2015

library(microbenchmark)
library(nycflights13)
library(dplyr)
tails <- select(flights, tailnum)
microbenchmark(unique(tails), distinct(tails), times=1000)
Unit: milliseconds
            expr       min       lq     mean   median       uq      max neval
   unique(tails)  8.728119 10.25836 12.32073 11.92595 13.18344 73.76151  1000
 distinct(tails) 49.051397 51.84157 53.33416 52.98128 54.06816 67.46064  1000

R version:
x86_64, darwin15.0.0
R version 3.2.2 (2015-08-14)
Fire Safety
(Built with openblas)

dplyr version:
0.4.3.9000
(Installed with devtools::install_github() on 2015.11.08)

@romainfrancois
Copy link
Member

related to #977

@hadley hadley closed this as completed Mar 1, 2016
@lock lock bot locked as resolved and limited conversation to collaborators Jun 9, 2018
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants