-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Annotation Paused #61
Comments
Hi Zack, The issue is caused because prokka doesn't like the long naming of some of the contigs. You can rename the contigs so that prokka will like them and then rerun just the fasta files through tormes again. Copy this script into a file on your linux computer: type in:
then paste this:
then type ctrl-x Then make the script executable:
Then run your fasta files through the script sending the output to a new filename like so:
Do that for your three samples so you have three new files. Here is the difference between two examples in the contig naming: before the script: $ head Sample-1.fasta
after the script: $ head Sample-1a.fasta
Then run those three new files through tormes and it should annotate properly. |
Thank you so much for the reply I will try this and report back. One question for the new tormes run. Obviously the contig fasta files are not paired forward/reverse reads so are they then submitted as genomes with a corresponding metadata file? Thanks again for the help. |
Hi Zack, in the metadata file, in the place of read 1, put the word GENOME and put the fasta file name in read 2.
|
Hi Zack, Brad is right! The issue comes with Prokka and long contigs' names (such as the automatic one generated with SPAdes). You can find a shortcut to generate metadata files here: https://github.com/nmquijada/tormes/wiki/Shortcut-to-generate-the-metadata-file-for-TORMES Let us know! |
Hello,
I recently ran Tormes and while the pipeline completed and I was able to generate an HTML report file and assembly.tgz files for each sample (3 samples total). However I am missing what seems to be the bulk of the prokka outputs. While a directory was created for each sample the only file within was the FNA file along with the log for each sample. All three ended at the identical step where there seemed to be an issue with the contig ID name (see representative log file below).
Sample-1.log
I was wondering if there is a way to solve this issue within Tormes and rerun a shorter version of the pipeline perhaps with the assembled genomes that will provide annotation as well.
Any help would be greatly appreciated and thank you for the time.
Best,
Zack
The text was updated successfully, but these errors were encountered: