-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ARROW-17788: [R][Doc] Add example of using Scanner #14184
ARROW-17788: [R][Doc] Add example of using Scanner #14184
Conversation
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for adding these examples! Just one small change to suggest here.
r/R/dataset-scan.R
Outdated
#' dir.create(tf) | ||
#' on.exit(unlink(tf)) | ||
#' | ||
#' data <- dplyr::group_by(mtcars, cyl) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Given we don't have dplyr as a dependency, could it be worth removing the group_by()
call here, and manually specifying the partitioning
argument to write_dataset()
instead?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good point. I've replace the call I added as well as the one in the other example I copied from.
ad62dc5
to
d6cb1f7
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Excellent, thanks!
Lots of CI failures here, should this be rebased? |
@@ -131,8 +131,7 @@ | |||
#' dir.create(tf) | |||
#' on.exit(unlink(tf)) | |||
#' | |||
#' data <- dplyr::group_by(mtcars, cyl) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The complete removal of data
here breaks the example further down write_dataset(data, tf2, format = "ipc")
- please can you update that too?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for pointing that out. I've updated that example and tested locally on my computer to make sure it works now.
My bad, missed an error caused by a change as reviewed before the CI had finished running - doesn't need rebasing. Added feedback now. |
d6cb1f7
to
78a8816
Compare
Benchmark runs are scheduled for baseline = 66e8ba5 and contender = 959a9d5. 959a9d5 is a master commit associated with this PR. Results will be available as each benchmark for each run completes. |
No description provided.