Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Request for *_ply to take parallel? #60

Closed
jameshowison opened this issue Sep 22, 2011 · 5 comments
Closed

Request for *_ply to take parallel? #60

jameshowison opened this issue Sep 22, 2011 · 5 comments

Comments

@jameshowison
Copy link

Sorry if this is the wrong place, but I wondered why d_ply doesn't have a parallel argument, I'm splitting up a data frame and drawing graphs with it, seems a good use for parallelization ... did a quick search of the list and Hadley says it isn't supported back in Feb. Is this something that might be supported?

Thanks,
James

@hadley
Copy link
Owner

hadley commented Sep 26, 2011

Yes, definitely. Just need to find the time to work on it.

@myersdb
Copy link

myersdb commented Jun 27, 2012

Glad to see this will be happening. I commonly operate on a large set of large files, performing batch operations of GIS analysis via sp and raster on a large cluster. Similar to the above case where intermediate and final products are stored as files rather than as R objects.

@kenahoo
Copy link

kenahoo commented Aug 15, 2012

While we're on this topic - would it be beneficial for the **ply() family of functions to also take an argument indicating whether side-effects should be propagated or not? If they knew that they only needed to transmit the return value back to the caller, and not transmit the entire set of changes that happened to the R environment, they could optimize the parallel execution environment.

Or is that best handled by encapsulating the behavior in the foreach backend?

@hadley
Copy link
Owner

hadley commented Aug 16, 2012

I think that should be the default - and if you want side effects propagated outside, you need to use the parallel tools directly.

@kenahoo
Copy link

kenahoo commented Aug 16, 2012

Yeah, that seems reasonable, because it's not clear what automatic/default strategy should be used to merge all the effects back in to the main process.

@hadley hadley closed this as completed in fb3d2bb Oct 8, 2012
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants