Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Memory errors for large datasets #141

Open
mortonjt opened this issue Oct 21, 2020 · 4 comments
Open

Memory errors for large datasets #141

mortonjt opened this issue Oct 21, 2020 · 4 comments

Comments

@mortonjt
Copy link
Collaborator

mortonjt commented Oct 21, 2020

For datasets with >10k samples, the memory requirements can be quite high.
If there isn't enough memory available, this can throw an out-of-memory error.

@mortonjt mortonjt reopened this Oct 24, 2020
@fedarko
Copy link
Collaborator

fedarko commented Jun 3, 2021

One relatively straightforward thing that might help with this (for the QIIME 2 version, at least) is adding an extra command or parameter* that disables the construction of the biplot. I was running Songbird on a large-ish dataset (~60k features, ~100 samples: this matrix), where the call to np.linalg.svd(differentials) here caused an error about there not being enough memory to allocate for the array (it was something like 16 GB of memory that was needed? this was on my laptop). I commented out the biplot code so that this line was always used to create an empty biplot, and then Songbird seemed to work without a problem.

With the advent of BIRDMAn this may not be an urgent issue, tho.

* This might need to be a command, I guess, since I don't think QIIME 2 currently has ways of varying the number of outputs. Or it could just be a parameter where "hey if you specify this an empty biplot will be generated"

@mortonjt
Copy link
Collaborator Author

mortonjt commented Jun 3, 2021 via email

@fedarko
Copy link
Collaborator

fedarko commented Jun 3, 2021 via email

@mortonjt
Copy link
Collaborator Author

mortonjt commented Jun 3, 2021 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants