Which is the advantage to pre-use prokka to perform analysis using genbank (.gbk and gbff) files? #412

felipelira · 2018-07-27T09:40:37Z

May it is a silly question but it would not so efficient to include this steps if you are using a set of 600 genomes, for example. Ok, it is not a lot, but... Any statistics (just for curiosity).

andrewjpage · 2018-07-27T12:36:36Z

You can use annotation from genbank (or RAST) if you wish,and there are instructions on the roary webpage. The important thing is that all annotation & ORF prediction is performed using the same method, otherwise you will just get lots of noise and false signals. GenBank is not ideal since the submitters of genomes can submit the annotation, hence you can get a big mixture of different annotation methods. RefSeq is much better because they use PGAP to ensure consistent annotation (some exceptions to watch out for).

felipelira · 2018-07-27T12:39:42Z

For an accurate study, I prefer to use RefSeq .gbff genomes because they share the same annotation process. Thank you Andrew. I will try both files and methods.

tseemann · 2018-08-06T01:42:19Z

@felipelira i think the issue with refseq .gff files is that they do not have the FASTA file appended to them, and sometimes the GFF "ID" does not match the FASTA "ID".

andrewjpage closed this as completed Jul 27, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Which is the advantage to pre-use prokka to perform analysis using genbank (.gbk and gbff) files? #412

Which is the advantage to pre-use prokka to perform analysis using genbank (.gbk and gbff) files? #412

felipelira commented Jul 27, 2018

andrewjpage commented Jul 27, 2018

felipelira commented Jul 27, 2018

tseemann commented Aug 6, 2018

Which is the advantage to pre-use prokka to perform analysis using genbank (.gbk and gbff) files? #412

Which is the advantage to pre-use prokka to perform analysis using genbank (.gbk and gbff) files? #412

Comments

felipelira commented Jul 27, 2018

andrewjpage commented Jul 27, 2018

felipelira commented Jul 27, 2018

tseemann commented Aug 6, 2018