-
Notifications
You must be signed in to change notification settings - Fork 93
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error in GENIE run #3
Comments
Hi Assaf, I am having the same problem when running my data. Did you figure out what your problem was? Thanks, Florian |
Hello, That code is at the end of the parallel computation, so as temporary solution you might want to use nCores=1. In order to reproduce the error (and provide more useful help...) I would need more info... What type of system are you using? (Windows/Linux/Mac? Some of the parallel functions are not available on Windows...) Can you provide a minimal example (or part of the data that is producing the error?) and the output of sessionInfo() to try to reproduce the error? |
Hi there, I am running GENIE3 on a Linux cluster using a torque scheduler system. I am running the code on 1 node with 12 cores, therefore, using parallel computation at 12 cores. My data is a matrix with normalized expression values from Drop-seq. I tried runnin a subset of data for computation speed but also tried full matrix and both gave the same error. The current matrix I am trying to run is 225 cells x 7566 genes. Here is my code snippet, mainly adopted from the "Running SCENIC" tutorial. I load my expression matrix from a precomputed seurat object and then filter out genes: Thanks for your help in advance!
|
Hi again, Thanks for the info! I have run GENIE3 using your code and some Drop-seq data with similar characteristics, but the only way I have managed to reproduce the error is by artificially changing the dimensions of So, just to make sure... can you confirm that the size of the expression matrix and the row names just before entering GENIE3 are what you expect? (the matrix should contain the gene names as rownames() ...)
I have added some extra checks for the next version (in case a similar error appears in the future...), but if you would like to help finding out exactly what is causing your error, you can re-run GENIE3 with the same settings after running
|
Thank you for the quick feedback. I was also troubleshooting and I found that running with only 1 core (nCores=1) seems to work just fine, so it suggests to me that there might be some issue with the Parallelization going on when running it on a remote node rather than a local machine... Could there be an issue with the remote node not having any dependencies or similar? On our cluster we run jobs via qsub to the torque scheduler which then launches a remote node that will run the job. All the required R packages will be loaded but maybe I am missing a linux package that is not generally loaded on our worker nodes but is present on the login node? I will let you know whether I can get it to work with multiple cores but so far I did not have any luck... |
Have you checked if the basic example in GENIE3 works? (adding multiple cores, of course) (We often run GENIE3 also on cluster with qsub, and we have not come up with this error so far...)
|
Thanks for the running example, should've tried with a simple small snippet like this, my bad. I will go over my data again and see what might cause the problem... |
So all smaller examples I have run so far have worked now even with multiple cores on the cluster. Quick optimization question for other people as well. I had actually not considered this before but does using normalized data slow down the GENIE3 run as well since we are using double values instead of integers and therefore have to load a lot more data into the function? |
Working fine now! I have to say I can't really pinpoint what the problem was. I guess when people run into this error the best way is to make sure that the input is a matrix and that inputTFs are in rownames of that matrix. I think your examples will help many people running into this issue. |
Hi, the problem maybe some parallel jobs die due to lack of memory. Perhaps move to a larger memory cluster would help. |
Hello,
I get the following when I run GENIE:
Error in weightMatrix[regulatorNames, ] <- weightMatrix.reg :
number of items to replace is not a multiple of replacement length
Any idea of what might it be?
Thanks,
Assaf
The text was updated successfully, but these errors were encountered: