-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issue with translating genes on complement strand #1
Comments
Interesting we will look at it. Could you provide the fasta for the sequence BANY00001 ? Thank you for having pointed that. |
No problem, here's the file. Cheers! |
It looks like it's due to your odd gff3. Indeed, when you have a gene on the minus strand, the sub-features (exons and cds) are inverse sorted. I will add a fix to systematically sort those sub-features when we go through them to avoid such cases. |
Ah, gff3, the file format with no fixed format... This gff was downloaded directly from an ensembl database (ensembl.lepbase.org) too. Could you say what exactly is odd about it? Thanks for the fix! |
Usually all the cds or exon features are sorted in increasing order of their locations, no matter their strand. |
Issue fixed. |
Hello,
I've come across an issue with how CDS features are printed for genes encoded on the complementary strand. The problem manifests itself clearly when using the
--translate
flag, as it produces lots of erroneous translations riddled with stop codons*
.I give an example below.
The EMBL output for an affected gene looks like:
The mRNA feature looks fine, but there are some puzzling
<
and>
characters in the CDS feature that I think may be the problem. The translation is then subsequently messed up, and in fact appears to be the translation for the exons in reverse order, asQKFI*
corresponds to the first 4 "codons" of the last exon (E4, 123273..123484).Hopefully an easy issue, and thanks for a great tool, this is going to extremely useful :-)
Or maybe something funny in the GFF? the entry for this gene is:
Running biopython version: 1.67 and bcbio-gff version: 0.6.4
The text was updated successfully, but these errors were encountered: