-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Mikado util stats error on NCBI gff3 #226
Comments
Dear @bbista , Looking more in detail at the GFF file, there honestly seem to be a lot of similar problems, such as coding genes without mRNAs, or tRNAs without a gene parent. All of these break the gene ontology and Mikado's model of how a GFF should look like. I also tried using the GTF, cleaning it up first with GffRead, but to no avail. The only solutions are
|
Dear @bbista, I have started fixing the problems you found. With the latest commit, Changes will be reflected in Mikado2 (and live in Mikado 2.0rc6). Kind regards |
…rectly. Tests to be implemented.
…lly broke the parsing for other strange GFF3 cases)
Current status: |
I was trying to look at the stats for a gff3 file I downloaded off NCBI. I get this error message.
mikado util stats GCF_000241765.genomic.gff genomic.stats
/home/bbista/.local/lib/python3.6/site-packages/Mikado/configuration/configurator.py:529: YAMLLoadWarning: calling yaml.load() without Loader=... is deprecated, as the default Loader is unsafe. Please read https://msg.pyyaml.org/load for full details.
scoring = yaml.load(scoring_file)
2019-10-02 19:04:43,336 - main - init.py:124 - ERROR - main - MainProcess - Mikado crashed, cause:
2019-10-02 19:04:43,336 - main - init.py:125 - ERROR - main - MainProcess - gene-LOC112059410
{}
Traceback (most recent call last):
File "/home/bbista/.local/lib/python3.6/site-packages/Mikado/init.py", line 110, in main
args.func(args)
File "/home/bbista/.local/lib/python3.6/site-packages/Mikado/subprograms/util/stats.py", line 711, in launch
calculator()
File "/home/bbista/.local/lib/python3.6/site-packages/Mikado/subprograms/util/stats.py", line 335, in call
self.parse_input()
File "/home/bbista/.local/lib/python3.6/site-packages/Mikado/subprograms/util/stats.py", line 324, in parse_input
current_gene.add_exon(record)
File "/home/bbista/.local/lib/python3.6/site-packages/Mikado/loci/reference_gene.py", line 165, in add_exon
raise AssertionError("{}\n{}".format(parent, self.transcripts, row))
AssertionError: gene-LOC112059410
{}
Do you have any idea what is going wrong?
Best,
Basanta
The text was updated successfully, but these errors were encountered: