Error: exon_lines[row.id]["tid"] = row.transcript KeyError: None #196

alelim-bio · 2019-07-18T17:33:38Z

Hello Mikado,

I have been working through your pipeline and have ran into an error. I noticed it has been reported previously but, I have been unable to solve the problem. I was wondering if I could get your help?

Some additional information:

Using the recent 2.0 version of Mikado in order to run the pipeline.
CentOS Linux 7
Running a conda environment
Python version 3.6.8
Mikado test passed.

This is a subsection of what seems to be the main error.

File "/pylon5/mc5fr6p/alelim/CONDA/anaconda3/envs/bio/lib/python3.6/site-packages/Mikado/preparation/annotation_parser.py", line 548, in load_from_gtf
    exon_lines[row.id]["tid"] = row.transcript
KeyError: None

I have attached the .log file and the toy samples I have been using. If there is anything else I can provide, please let me know.

Kind Regards,

Alex

prepare.log
St.toy_sample.zip

The text was updated successfully, but these errors were encountered:

lucventurini · 2019-07-19T00:03:36Z

Dear Alex,
thank you for your bug report. I will try to solve it as quickly as I can. It looks like a problem in parsing, hopefully it will not be too long to fix.

Kind regards,

Luca

lucventurini · 2019-07-19T14:12:15Z

Dear @AsclepiusDoc,
unfortunately I could not reproduce the bug with the toy data (as a reference genome, I used chromosome 1 of G. raimondii, having inferred from the log that you were analysing cotton - please let me know whether this is incorrect). However, I made a slight modification to the offending section of the code which could provide a fix.

May I also ask you to please look inside the GTF files whether there is any line missing a "transcript_id"? I think that Mikado might be crashing because of that.

Kind regards

alelim-bio · 2019-07-19T17:55:03Z

Hello @lucventurini ,

Thank you for your reply. You are correct that we are working with cotton.

Also, I have looked through the GTF files and didn't seem to find any lines that were missing a transcript_id except for the two header lines in the stringtie file. Would it be crashing because of these headers?

Kind Regards,

Alex

lucventurini · 2019-07-19T21:43:12Z

Dear @AsclepiusDoc ,
the header should not pose a problem, but would you be able to send it here so that I can check?
Mikado should ignore lines that start with "#". If it does not, that is definitely the bug to be solved.

lucventurini · 2019-07-24T13:24:56Z

Dear @AsclepiusDoc , any news? Have you managed to try the amended version? If you could send me another snippet of the GTF, I can have a go.

alelim-bio · 2019-07-24T18:47:07Z

Hello @lucventurini ,

Apologies for the late reply, to update you I have tried using the header and have now received a new issue using the amended version. I have attached the prepare.log.

Additionally, I have attached another set of GTF files.

Kind Regards,

Alex

prepare.log
St.toy_sample (2).zip

lucventurini · 2019-07-25T09:42:17Z

Dear @AsclepiusDoc ,
many thanks for the updated files. I am now able to reproduce the bug on my workstation, so I should be able to track the problem down and resolve it soon.

Thank you for your patience and collaboration.

Kind regards

lucventurini · 2019-07-25T10:12:17Z

Dear @AsclepiusDoc , it should now be fixed in 3bcae56 (see previous commit, ffc6ec3, under Mikado/preparation/annotation_parser.py, for the actual bug fix). If you could pull from the branch and test on your data, we can close the issue and merge back into master.

Thank you again for reporting this bug, this was quite nasty! if you had not reported it, this would have required a hasty patch right after releasing the next version!

alelim-bio · 2019-07-25T20:02:00Z

Hello @lucventurini ,

I have updated the my branch and tested with my full dataset and it has gone through with no errors! I will continue running the pipeline to see if there are any other issues. Thank you so much for all your help!

Kind Regards,

Alex

lucventurini · 2019-07-25T21:27:25Z

Dear @AsclepiusDoc ,
excellent news! I will merge the fix back into master and update to v2.0rc2.

Thank you again for reporting the bug!

lucventurini self-assigned this Jul 19, 2019

lucventurini added this to the 2.0 milestone Jul 19, 2019

lucventurini added a commit that referenced this issue Jul 23, 2019

Potential fix for #196 (#9)

643558b

lucventurini closed this as completed Jul 25, 2019

lucventurini mentioned this issue Jul 31, 2019

sqlite3.OperationalError: database is locked #205

Closed

lucventurini added this to Closed in Version 2 Oct 15, 2020

lucventurini added a commit to lucventurini/mikado that referenced this issue Feb 11, 2021

Potential fix for EI-CoreBioinformatics#196 (#9)

97be3ae

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error: exon_lines[row.id]["tid"] = row.transcript KeyError: None #196

Error: exon_lines[row.id]["tid"] = row.transcript KeyError: None #196

alelim-bio commented Jul 18, 2019

lucventurini commented Jul 19, 2019

lucventurini commented Jul 19, 2019

alelim-bio commented Jul 19, 2019

lucventurini commented Jul 19, 2019

lucventurini commented Jul 24, 2019

alelim-bio commented Jul 24, 2019

lucventurini commented Jul 25, 2019

lucventurini commented Jul 25, 2019

alelim-bio commented Jul 25, 2019

lucventurini commented Jul 25, 2019

Error: exon_lines[row.id]["tid"] = row.transcript KeyError: None #196

Error: exon_lines[row.id]["tid"] = row.transcript KeyError: None #196

Comments

alelim-bio commented Jul 18, 2019

lucventurini commented Jul 19, 2019

lucventurini commented Jul 19, 2019

alelim-bio commented Jul 19, 2019

lucventurini commented Jul 19, 2019

lucventurini commented Jul 24, 2019

alelim-bio commented Jul 24, 2019

lucventurini commented Jul 25, 2019

lucventurini commented Jul 25, 2019

alelim-bio commented Jul 25, 2019

lucventurini commented Jul 25, 2019