Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Large cohort #15

Closed
sararselitsky opened this issue Jun 4, 2018 · 7 comments
Closed

Large cohort #15

sararselitsky opened this issue Jun 4, 2018 · 7 comments

Comments

@sararselitsky
Copy link

I tried running X-shift by command line on 112 samples (>100K cells each) using 50 threads and 100G of RAM. It ran for 16 hours and then had an out of memory error. I can keep tweaking the submission parameters for the cluster computer, but I was wondering what the maximum number of cells this has been successfully run on. Besides sub-sampling, is there a parameter I should use to decrease the computation? My cohort will soon increase in 350 samples and I need a method capable of handling 40 million cells.

Thanks!

Sara

@nsamusik
Copy link
Contributor

nsamusik commented Jun 4, 2018 via email

@sararselitsky
Copy link
Author

Sure! See below:
CentOS Linux release 7.3.1611 (Core)
openjdk version "1.8.0_102"
OpenJDK Runtime Environment (build 1.8.0_102-b14)
OpenJDK 64-Bit Server VM (build 25.102-b14, mixed mode)

@nsamusik
Copy link
Contributor

nsamusik commented Jun 4, 2018 via email

@sararselitsky
Copy link
Author

There are 112 FCS files, each with around 100K cells. The original error I got was from the job submission program, not the program or java, so I am rerunning it. It has currently been running for 19 hours (80 threads, maximum 200G of RAM). Since my cohort will soon more than double in size, I was wondering if you have experience with cohorts of a comparable size and what I can do to improve the speed, besides sub-sampling.

@sararselitsky
Copy link
Author

I wanted to let you know that X-shift has been running 112 samples with around 100K cells per FCS file, for 3 days and 19 hours. It is running on 80 threads, 200G of RAM. Have you tested a cohort of a comparable size? If so, did you see these types of times? Thanks!

@nsamusik
Copy link
Contributor

nsamusik commented Jun 8, 2018 via email

@sararselitsky
Copy link
Author

Ok, thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants