Request for *_ply to take parallel? #60

jameshowison · 2011-09-22T23:16:53Z

Sorry if this is the wrong place, but I wondered why d_ply doesn't have a parallel argument, I'm splitting up a data frame and drawing graphs with it, seems a good use for parallelization ... did a quick search of the list and Hadley says it isn't supported back in Feb. Is this something that might be supported?

Thanks,
James

hadley · 2011-09-26T13:29:31Z

Yes, definitely. Just need to find the time to work on it.

myersdb · 2012-06-27T15:19:28Z

Glad to see this will be happening. I commonly operate on a large set of large files, performing batch operations of GIS analysis via sp and raster on a large cluster. Similar to the above case where intermediate and final products are stored as files rather than as R objects.

kenahoo · 2012-08-15T01:38:57Z

While we're on this topic - would it be beneficial for the **ply() family of functions to also take an argument indicating whether side-effects should be propagated or not? If they knew that they only needed to transmit the return value back to the caller, and not transmit the entire set of changes that happened to the R environment, they could optimize the parallel execution environment.

Or is that best handled by encapsulating the behavior in the foreach backend?

hadley · 2012-08-16T13:36:07Z

I think that should be the default - and if you want side effects propagated outside, you need to use the parallel tools directly.

kenahoo · 2012-08-16T13:48:44Z

Yeah, that seems reasonable, because it's not clear what automatic/default strategy should be used to merge all the effects back in to the main process.

hadley closed this as completed in fb3d2bb Oct 8, 2012

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request for *_ply to take parallel? #60

Request for *_ply to take parallel? #60

jameshowison commented Sep 22, 2011

hadley commented Sep 26, 2011

myersdb commented Jun 27, 2012

kenahoo commented Aug 15, 2012

hadley commented Aug 16, 2012

kenahoo commented Aug 16, 2012

Request for *_ply to take parallel? #60

Request for *_ply to take parallel? #60

Comments

jameshowison commented Sep 22, 2011

hadley commented Sep 26, 2011

myersdb commented Jun 27, 2012

kenahoo commented Aug 15, 2012

hadley commented Aug 16, 2012

kenahoo commented Aug 16, 2012