You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
While running the distributed PCA defined here, I've found that I need to tune my cluster so that I can accommodate the finalize task without nodes crashing out due to memory usage.
It would be nice to be able to estimate what amount of memory is required for a particular data size ahead of time.
The text was updated successfully, but these errors were encountered:
Yeah, this would be nice to document. You might want to check what’s in the dask-array documentation, or the paper the implementation is based on: https://arxiv.org/abs/1301.1071
While running the distributed PCA defined here, I've found that I need to tune my cluster so that I can accommodate the
finalize
task without nodes crashing out due to memory usage.It would be nice to be able to estimate what amount of memory is required for a particular data size ahead of time.
The text was updated successfully, but these errors were encountered: