Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
Plotting fails on ROCs for large data sets #1
I have a data set with ~2.7 million entries heavily skewed toward negative samples. I made a boot.roc object via this call:
boot.roc(data$prediction, data$actual, n.boot=1000, use.cache = TRUE)
Calling plot on the object then fails with this error immediately:
I then tried a small number of bootstrap samples (100), which does not provide an error message, but appears to hang.
Thank you for the report. Unfortunately (or fortunately for me) I am in the first week of a three week vacation. I suspect I won't get to work for it before the second half of August.
Is the data set confidential or can you share it? If it is confidential I still would like to know:
Now fixed on github. Was caused by a stupid default for the number of points at which the ROC curve was calculated. Should also increase performance when plotting.
Until a new version is on CRAN (a while yet) either use the newest version from Github or switch the positive and negative class as a workaround.