Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Roary seemed to have stopped prematurely; any way to continue the run? #380

Closed
sentausa opened this issue Jan 12, 2018 · 8 comments
Closed

Comments

@sentausa
Copy link

Hi,

I don't know why, my roary run seemed to have stopped too soon after producing gene_presence_absence.Rtab file. Is there a way to continue the run from this point?

This is the command that I used to run it:
roary -p 14 -e --mafft -r -o pan-bacteria -v ./gff/*.gff

And these are the output that I have:
-rw-r--r-- 1 es249628 big 6,5G janv. 11 21:16 gene_presence_absence.Rtab
-rw-r--r-- 1 es249628 big 11G janv. 11 20:10 gene_presence_absence.csv
-rw-r--r-- 1 es249628 big 96M janv. 11 18:34 core_accessory_graph.dot
-rw-r--r-- 1 es249628 big 96M janv. 11 18:21 accessory_graph.dot
-rw-r--r-- 1 es249628 big 123K janv. 11 16:01 _accessory_clusters.clstr
-rw-r--r-- 1 es249628 big 594K janv. 11 16:01 _accessory_clusters
-rw-r--r-- 1 es249628 big 115K janv. 11 15:54 accessory_binary_genes.fa.newick
-rw-r--r-- 1 es249628 big 13M janv. 11 15:51 accessory_binary_genes.fa
-rw-r--r-- 1 es249628 big 368M janv. 11 10:01 pan-bacteria
-rw-r--r-- 1 es249628 big 368M janv. 11 08:45 _labeled_mcl_groups
-rw-r--r-- 1 es249628 big 354M janv. 11 08:45 _inflated_mcl_groups
drwx------ 2 es249628 big 4,0K janv. 11 08:43 2agvKs2tFr
drwx------ 2 es249628 big 188K janv. 11 08:23 07NXVoFAfK
-rw-r--r-- 1 es249628 big 354M janv. 11 07:12 _inflated_unsplit_mcl_groups
-rw-r--r-- 1 es249628 big 43M janv. 11 07:11 _uninflated_mcl_groups
-rw-r--r-- 1 es249628 big 65 janv. 11 06:15 blast_identity_frequency.Rtab
-rw-r--r-- 1 es249628 big 819M déc. 26 10:33 _clustered.clstr
-rw-r--r-- 1 es249628 big 635M déc. 26 10:32 _clustered
-rw-r--r-- 1 es249628 big 6,2G déc. 26 00:47 _combined_files
-rw-r--r-- 1 es249628 big 0 déc. 24 12:45 _combined_files.groups

The 07NXVoFAfK directory contains .gff.proteome.faa files, while 2agvKs2tFr contains "group" files.

Thanks in advance for your kind help.

@andrewjpage
Copy link
Member

andrewjpage commented Jan 12, 2018 via email

@sentausa
Copy link
Author

sentausa commented Jan 12, 2018

Actually they are from the same genus, different species. The whole genus. Roary doesn't like that?

If I want to continue anyway, how to run the other roary scripts? Where can I find them? I just want to get the core_gene_alignment.aln file to make tree later.

Thanks very much.

@andrewjpage
Copy link
Member

andrewjpage commented Jan 12, 2018 via email

@sentausa
Copy link
Author

So you mean that roary failed to produce the alignment? Because I couldn't see anything that indicates that it was done during the run.

@andrewjpage
Copy link
Member

It failed before producing an alignment. The cause is likely to be your input data.

@sentausa
Copy link
Author

Does it have anything to do with the fact that there are many draft genomes in my data? Do you think it'll be better if I use only complete genomes?

Thank you again.

@andrewjpage
Copy link
Member

andrewjpage commented Jan 12, 2018 via email

@sentausa
Copy link
Author

I'm reusing publicly available genomes from a whole genus, and it's true that there are highly fragmented assemblies. So well, I think I'll try to remove them. Or it might be easier to just ignore the draft genomes.... I just need a global picture of the genus anyway.

Thank you for all your help!

Regards,

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants