-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
KeyError: 'P1' #53
Comments
Dear Rafal, Just to be sure, for other phenotypes with the same kmers-table the pipeline run normally ? So it is something specific for these two phenotypes? Can you upload the log file from the run directory, and also the list of all files under the run directory ? Best, |
Hello Yoav, I too had same issue. In my case, it was because all the values were not rounded to same number of digits. With best regards Sandip |
Hi Sandip, Thank you very much for sharing this info and nice catch/debugging. I have no understanding of why this happened. I will check if I can fix or at least present a normal error message for these cases. Rafal, can Sandip explanation explain your problem? Best, |
Hi Yoav and Sandip, I am still trying to figure it out; I don't think it is just the number of digits. I have one phenotype file that has similar variation in number of digits and it works well. I am experimenting with modifying the file that didn't work, but have no conclusions yet. If that helps, I can send to you two phenotype files (one that works, and one that doesn't). Best, |
Dear Rafal, I think it can be faster from your side if you will let me help you debug this. Can you respond to my previous message: Can you upload the log file from the run directory, and also the list of all files under the run directory ?" Best, |
I was trying to reproduce this error today to get the list and the log file that you requested, but no error showed up. I got assigned to CPU's rather than GPU's on cluster today, but I don't know if that should make any difference... Please keep this issue opened for a while, and I will report when I see the error again. There is a lot of phenotypes I will run next week, so hoping (not really... but would be happy to help debug) to see the error again. Best, |
Yes,
I also observed that, it is relared to CPUs. I run same job on two
different servers. It worked well on one but reported error on another. I
can share the log files.
With best regards
Sandip
…On Friday, July 31, 2020, Rafal M Gutaker ***@***.***> wrote:
I was trying to reproduce this error today to get the list and the log
file that you requested, but no error showed up. I got assigned to CPU's
rather than GPU's on cluster today, but I don't know if that should make
any difference...
Please keep this issue opened for a while, and I will report when I see
the error again. There is a lot of phenotypes I will run next week, so
hoping (not really... but would be happy to help debug) to see the error
again.
Best,
Rafal
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#53 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AGW37GARRTWVNELEQMJ72DTR6H6JRANCNFSM4PIGLRPQ>
.
|
Dear Sandip and Rafal, I would be curious to look at the log files, and also if you have the list of files it succeed in creating. I have two guesses for the reason for this, one is that the R packages which it depends on are missing from one cluster. The other, is that the compilation of the C++ code does not work in this system or it lacks the SSE4.1 commands the code depends on (less likely). Best, |
test.tar.gz
<https://drive.google.com/file/d/1bAxEiGKgPIfzNmS96yjn1w-RkNtx766o/view?usp=drive_web>
Hello Yoav,
Thanks for reply. I checked the log but didn't find any obvious issues.
Here are the files.
Along with that, it also created pheno.0.phenotype_value.bim
and pheno.0.phenotype_value.bed files. But they are big and so not
included.
Let me know if you need further information
Thanks and Regards
Sandip
…On Fri, Jul 31, 2020 at 9:17 AM Yoav Voichek ***@***.***> wrote:
Dear Sandip and Rafal,
I would be curious to look at the log files, and also if you have the list
of files it succeed in creating.
I have two guesses for the reason for this, one is that the R packages
which it depends on are missing from one cluster. The other, is that the
compilation of the C++ code does not work in this system or it lacks the
SSE4.1 commands the code depends on (less likely).
Best,
Yoav
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#53 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AGW37GCRISSZKQB4YGWVHE3R6JVY5ANCNFSM4PIGLRPQ>
.
|
Dear Sandip, Thanks for sharing this! I downloaded the file so if you want you can remove the sharing. It seems that GEMMA was not able to run as there is nothing in the output directory (kmers/output). Can you check if when you just run the GEMMA (e.g. with --help), it returns an error on this server? In any case, my two previous guesses seems to be wrong from this log files. |
I have run into the "P1" issue again. I traced it back and could see that it happens in the same partition of our slurm cluster. I logged in onto that partition and ran gemma -h without issue. If you still need it, you can download the whole run directory here: https://drive.google.com/file/d/1qXEzpK8w7q3T7b0PbYIpFGs3ADAfq3vG/view?usp=sharing |
Hello Yoav,
I will try.
Thanks and regards
Sandip
…On Fri, Jul 31, 2020 at 10:59 AM Yoav Voichek ***@***.***> wrote:
Dear Sandip,
Thanks for sharing this! I downloaded the file so if you want you can
remove the sharing.
It seems that GEMMA was not able to run as there is nothing in the output
directory (kmers/output).
Can you check if when you just run the GEMMA (e.g. with --help), it
returns an error on this server?
In any case, my two previous guesses seems to be wrong from this log files.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#53 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AGW37GCD4RZ7J25G6RLQ7ETR6KBXZANCNFSM4PIGLRPQ>
.
|
Dear Rafal,
I was very shocked that the number of total SNPs = 10001 . Actually my total kmer number is 25248947,and there were 25248947 in the file “/kmers/pheno.tested_kmers”.
But why? n_kmers=10001 and /associate_kmers ... -n 10001 ... ???? Much obliged for your help. |
Hi Lierk, When you have an unrelated issue, it is good practice to start a new thread. Additionally, if you need help you should address the pipeline developer (not me, a random guy) in your message. Finally, to save you some time, please read the paper by Voichek and Weigel carefully - they refer to the methods that result in your k-mer number. Best, |
Dear Rafal, First, thanks for answering Lierk. Indeed, this issue is explained in our methods. I've downloaded the directory you shared, thank you. However, it seems to have finished running the pipeline without errors, right? Regarding running GEMMA on the part of the clusters that produce the error. Just to make sure, did you try to run the same gemma we use in the pipeline. In your case, it is located in: If you just typed gemma, it might tried to run a local version, and then it is not the right control. Best, |
Hi Rafal, Best, |
Dear Yoav, Best, |
Can I close this until it will bother someone again? |
Ah, yes, sorry, I got into a whirlwind of things. Once I have a little more time to check and properly identify my failed run, I will reopen. Thanks, |
Hey all, thanks for the discussion on this topic. I am currently trying to run the pipeline and get the same error. I have ~850 genomes and associated phenotype data, i.e. normalised growth under different conditions. I obtain the same error as @grafau . I tried to look into the logs and it seems the error might be connected with the permutations running with gemma. If I call gemma without any parameters there doesn't seem to be a problem. If I run one of the permutations from the log file I obtain an 'Illegal instruction (core dump)' error and no output is generated. I assume that this leads to the python error when trying to access the output. Thus, do you have an idea what might cause the error in the gemma execution? Thanks Michael |
I think this error can occur from multiple reasons. First, it is good you checked that gemma can run. Also can you copy here the full command line you used? |
Thanks for your quick reply. The files are attached. The gemma command is:
and the overall command is:
The tar.gz of the run directory is here: https://www.dropbox.com/s/cpd5pqa4zrg88q4/testdir.tar.gz?dl=0 Thanks |
Dear Michael, From looking at your directory it seems that indeed the gemma step was not able to run. I now run the following two commands (from the directory you sent me with my local gemma executable): Both of them run and outputted the expected files. BTW, if you can run the first command you can already get the strongest k-mers, inside the kmers/output directory. You will not know the permutation defined threshold, but you can use in the meantime the a Bonferroni threshold using the number of unique k-mers presence/absence patterns. The number of tests/unique patterns in your case is 17,378,387 (found in Best wishes, |
Hi Yoav, thanks a lot for your efforts. No, unfortunately I am getting the error with both commands (Illegal instruction (core dumped)). Thus I think it has to do something with the Gemma on my system. When I just run Gemma without any options it does show the version (or Gemma -h gives the help). Cheers |
Dear Michael, Please try to run the same GEMMA command with the new release of GEMMA (0.98.1) If it will work then I can will modify the pipeline a bit so it can also take results from the newer version of gemma. Please let me know if it works, otherwise we should open an issue on their github page. Best, |
Dear Yoav, the commands run with the static build of GEMMA (0.98.1). So there is something not okay on my system with the build that is delivered with your package. I also downloaded the 0.96 build from GitHub and obtained the same error. Cheers |
Dear Michael, I tried to do a quick fix so the pipeline will run with gemma 0.98.1, please try to run the pipeline after replacing these two files with the files with the same names in the main library directory and under src/py/ Then you can indicate to the pipeline to use the gemma 0.98.1 you downloaded using the Please let me know if this solves the problem. |
Hi Yoav, replacing the file and running the pipeline works. I now get the pass_threshold_x files with the significant k-mers. Now I can try to check out what these k-mers are ;) Thanks a lot for your help Cheers |
Dear Michael, This is good to hear, I will try to find time to incorporate this changes for a next release so people can use different GEMMA versions. Best wishes, |
Thanks for a quick reply to my previous issue. Here is another one, and I would be grateful for your help.
For two of many phenotypes I tried, I receive the following error from cluster:
It seems that the pipeline finished all 100 permutations, but collapsed around summarizing the results and deleting plink files (below are last 3 lines of log_file)
Here are EMMA_perm.log
Much obliged for your help.
The text was updated successfully, but these errors were encountered: