Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

issues when loading a rsample tibble with rdata #36

Closed
csetraynor opened this issue Apr 26, 2018 · 3 comments
Closed

issues when loading a rsample tibble with rdata #36

csetraynor opened this issue Apr 26, 2018 · 3 comments

Comments

@csetraynor
Copy link

Hi,

I know this isn't the great contribution, but I wanted to report a ("bug") problem I found when loading a big dataset. My dataset has 86 rows and 7000 columns (gene expression dataset) and when I wanted to save my environment and loading it again I found out the memory limit of my CPU (32 GB Windows 10 currently) was going overflowed. I understand that this is because rsample is using a memory-efficient storage of large data when working but when loading this isn't working because R needs to read each dataset before?...
I was using the classic commands:
save from Rstudio Environment option
load(my_env.Rdata)

Error message (classic):
Memory Allocation “Error: cannot allocate vector of size 75.1 Mb”

I already increased memory.limit, checked the CPU Performance.

carlos,

@csetraynor
Copy link
Author

I just want to comment that I closed the issue because I understand is probably not the place to ask this. Just to add, that I solved the problem by saving each model and measurements and cleaning the workspace and fitting a new model obtaining the measures and cleaning and so on...
This is just being a bit more clever than I was when asking...

@topepo
Copy link
Member

topepo commented Apr 29, 2018

Thanks for the comment. This is very helpful.

This is a big issue and one that I think is impossible to get around. caret doesn't save the intermediate models for this exact reason. I think that it will be people's expectation (that they should save the models) and it can be surprising how much space some of the models occupy.

@github-actions
Copy link

This issue has been automatically locked. If you believe you have found a related problem, please file a new issue (with a reprex https://reprex.tidyverse.org) and link to this issue.

@github-actions github-actions bot locked and limited conversation to collaborators Feb 22, 2021
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants