-
Notifications
You must be signed in to change notification settings - Fork 190
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Could not obtain pan_genome_sequences #426
Comments
@pedrorvc type |
So I have been trying to understand what causes this error and, after various attempts, the problem seems to lie in the amount of .gff files given to process. |
Apologies for closing the issue due to a misclick. |
Maybe you are running out of RAM ? |
It don't think so. I accompanied the RAM usage of my most recent failed run and I still had about 6GB of RAM available. |
@pedrorvc I have used Roary with more than 1000 files and have not had any issues. |
What does |
For my account
|
Your allowable number of open files is too low. For example on my system its: I would recommend you increase it (your system administrator can assist if you don't know how to do it). |
Additionally you appear to be running quite old versions of software which would indicate your running a very old version of linux? |
@andrewjpage I increased the number of open files and it solved the problem, thank you very much! |
Hi andrewjpage, I changed the value of open files to 1048576, but the problem is still not solved. I used 1800 gff files to produce pan genome. My machine information: 16 VCPUS RAM 64G Instances 2 5T I don't konw how to solve this problem. I have the same mistake information as him. My mistake information as follows: please cite: O. Tange (2011): GNU Parallel - The Command-Line Power Tool, This helps funding further development; AND IT WON'T COST YOU A CENT. To silence this citation notice: run 'parallel --citation'. Academic tradition requires you to cite works you base your article on. O. Tange (2011): GNU Parallel - The Command-Line Power Tool, This helps funding further development; AND IT WON'T COST YOU A CENT. To silence this citation notice: run 'parallel --citation'. Academic tradition requires you to cite works you base your article on. O. Tange (2011): GNU Parallel - The Command-Line Power Tool, This helps funding further development; AND IT WON'T COST YOU A CENT. To silence this citation notice: run 'parallel --citation'. Attribute (fasta_file) does not pass the type constraint because: Validation failed for 'Str' with value undef at reader Bio::Roary::Output::GroupsMultifastaNucleotide::fasta_file (defined at /usr/local/share/perl/5.22.1/Bio/Roary/Output/GroupsMultifastaNucleotide.pm line 29) line 15 Looking forward to your reply. Thank you! Best regards, Jinxiang |
Hi, I used 2472 gff files to produce pan genome and encountered this problem too, while the software could successfully run with 232 gff files. This is my mistake information: "Please cite Roary if you use any of the results it produces: Use of uninitialized value in require at /usr/local/lib64/perl5/Moose/Meta/TypeConstraint.pm line 60. Looking forward to any reply. Thank you! Best regards, |
Hello, I am using Roary to generate a pan-genome with 274 .gff files obtained from Prokka.
The Roary command I'm using is
roary -e -f roary_intermediates -n -p 4 -z *.gff
, in order to obtain the nucleotide sequences of the results.Roary completes the job but the following warning is given:
When I checked the results the pan_genome_sequences was empty.
Does this warning mean that some of my .gff files do not contain the fasta sequence?
P.S.: here goes the output of
roary -a
and one of my .gff files (in .txt format only to upload it here)9_Escherichia_coli_FAP1_CP009578.1_Netherlands.txt
The text was updated successfully, but these errors were encountered: