New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Getting sensitivity and specificity scores at the mRNA level #190
Comments
This should work smoothly. I guess one of this file is problematic. |
Oh. Both files get parsed properly. No problems found. The comparison is taking a long time. The transcript alignment file is 27MB and the Reference annotation file is 196MB |
Then, there is probably a bug lying around. Would it be possible to send me your files? |
Hi Jacques, I am using AGAT v 0.6.2. Attached is the reference annotation and transcript alignment files. I'll rerun with the verbose flag and let you know. Thanks |
Hi Jacques, I reran with the verbose flag. The code appears to get stuck at the following location in the reference annotation
That reference gene has three mRNAs each with identical start and end coords for the mRNA but small differences in the exon and CDS coords. It corresponds to the following location in the query transcript annotation
The above gene alignment has a single mRNA. The tail end of the verbose stdout continuously produces the attached text. The file is over 215G in size, so not attaching the same. |
It is definitly a bug, it goes into an infinite loop. We need to fix that. |
Oh I see. Thank you for looking into it. Will this be corrected in the next AGAT release? -Abhijit |
Hi Jacques, Can I replace the sensitivity_specificity code in my copy of AGAT from the master branch and use the same? Thanks |
Yes but you will have to call the script with the path to be sure to use that one: Otherwise follow those instruction: https://agat.readthedocs.io/en/latest/troubleshooting.html#how-to-use-a-version-of-agat-from-a-specific-branch |
Hi Jacques, I tried the code with the same sample files I sent you. It worked!!! Thanks. I get an output as below.
I was hoping to get the sensitivity and specificity values on a per mRNA basis. Something like below
Can this be achieved by modifying this code, or piecing together a few other AGAT tools? Thanks |
Per mRNA basis is something difficult because you need to be sure to compare the same gene in the different annotation, while genes can overlap a lot, and name are rarely synchornised. |
Ok... Thank you Jacques for the quick turnaround on this. I have another small request. I'll put it up as a new issue. Best Regards |
Hi Jacques, I got a chance to try the new "agat_sp_sensitivity_specificity.pl" after the update. I am attaching my test files. The Sensitivity for the exon and cds are above one.
|
Thank you for pointing it, indeed there was a bug. |
Hi Jacques, Sorry to bother you again. But the issue seems to be occurring again in a few cases. Attached are 3 reference and 3 query files and their respective outputs. I cannot pin down the cause since in the ref3.gff3 and query3.gff3 case, both the start and end of the query lie within the bounds of the reference gene coords. But in the other two examples, just one of the query predictions has its end outside the gene bounds of the reference. Thanks |
No worries, |
This is working very well. No issues so far in several thousand runs. Thank you Jacques Best Wishes |
Good to hear. Thank you for your feedbacks. |
Hi Jacques,
I would like to get the sensitivity and specificity values using agat_sp_sensitivity_specificity.pl at the mRNA level of the reference prediction. My command is as follows
agat_sp_sensitivity_specificity.pl --gff1 Reference_Prediction.gff --gff2 transcript_alignment.gff -o Output.txt
However, I notice that this script is taking a very long time. I waited for 24 hours and then killed the process. So I was wondering if the tool was at all designed to do this. Or was it designed to compare just the final annotations.
If it was designed to compare two final annotations, I am wondering if it can be repurposed to do a reference to transcript alignment comparison and generate the Sn and Sp values.
Thanks
Abhijit
The text was updated successfully, but these errors were encountered: