You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Traceback (most recent call last):
File "testcase", line 6, in <module>
print genome.transcripts_at_locus(11, 119477254, 119477256)
File "/<home>/.local/lib/python2.7/site-packages/pyensembl/genome.py", line 466, in transcripts_at_locus
for transcript_id in transcript_ids
File "/<home>/.local/lib/python2.7/site-packages/pyensembl/genome.py", line 832, in transcript_by_id
require_valid_biotype=("transcript_biotype" in field_names))
File "/<home>/.local/lib/python2.7/site-packages/pyensembl/transcript.py", line 48, in __init__
biotype, transcript_id, transcript_name))
ValueError: Invalid biotype 'bidirectional_promoter_lncRNA' for transcript with ID=ENSMUST00000207133, name=RP23-25M3.6-002
Looking at pyensembl/biotypes.py I see "bidirectional_promoter_lncrna" all in lower case so is_valid_biotype() fails. Should the function is case insensitive ?
The text was updated successfully, but these errors were encountered:
I assumed that the biotypes would be identical across different GTF (including case), but apparently I was wrong. Do you think I should make is_valid_biotype case insensitive?
Alternatively, I can just get rid of all this biotype checking code and let it be a string without structure.
I'm not an expert in biotypes, but isn't that what is checked to know if a variant affects a coding transcript ? I have no clue how often biotypes are likely to change. Looking at what they mean, I think case insensitivity makes sense. In case then there is an unknown biotype, it's best to just log a warning and ignore the transcript during annotation. That way, pyensembl does not break and can be gracefully updated later.
I have the following VCF file
and am using release 84 of the mouse reference genome
So, a simple testcase is
and I get the exception
Looking at pyensembl/biotypes.py I see
"bidirectional_promoter_lncrna"
all in lower case sois_valid_biotype()
fails. Should the function is case insensitive ?The text was updated successfully, but these errors were encountered: