galaxyproject / tools-iuc Public
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Stringtie 1.3.3 errors when the option to output Deseq2/EdgeR is used #1322
Comments
|
When used at http://usegalaxy.org, there is another small warning that comes up. Not sure if related to current server issues. If "Ballgown" output is selected (same run as above), this is printed in the info comments for the green results datasets: |
|
This error occurred on jobs run yesterday. I can't reproduce it today. Maybe was a cluster issue. Closing for now, but will reopen if can figure out what triggers it. |
|
Reports of this error have now come in. I can also now reproduce the error again in a current Galaxy RNA-seq tutorial history, using the latest version of the tool. The last four stringtie jobs were run with the four different related options - three pass, one fails ("additional output files = DEseq/EdgeR"). This seems important to fix with priority. @MoHeydarian @davebx @martenson @natefoo |
|
Specific problem: Stringtie-merge performs summary calculations and produces new "transcript" lines in the result (and resets the "Source" and removes other content lines). Running just iGenomes/external GTF through the Stringtie-merge tool will groom it and allow a successful job. Work-around for now to avoid the If using an external GTF source with Stringtie, and not currently using Stringtie merge transcripts, running just the external GTF dataset through the Stringtie merge transcripts tool (as an "input_gtf", leaving the input for "guide_gff" with no dataset selected) will produce a groomed version of the reference annotation that can be used by Stringtie. How to resolve? One or more of these? Some are @MoHeydarian ideas and other solutions are likely possible.
ping @bgruening There is a related error when using a reference GTF and the DEseq2/EdgeR outputs, but it has not been reproduced. Tagging it here for reference.
|
|
One of the issues is that the |
|
Still errors. @bgruening Do you know if a correction for this is in progress or planned for? ETA? Or is cycling/prepping through strigtie-merge the solution? If so, I can add that to the help. Thanks! |
|
Upps, sorry missed this completely, will look into it tomorrow. |
|
Hi - any updates? In short, the tool fails with external GTF files unless process through Stringtie merge first. Also - any chance that we could modify Stringtie merge to include the "merge" in the output dataset names (and have merge in bold as part of the core tool name) when other changes are made? The same change probably impacts both. Would help a great deal in distinguishing between the assembly vs merge tools in tool panel plus (more importantly for usage) the different output datasets in the history. Thanks!!! |
|
@jennaj with Could you provide some text for 1) and 2) and I can get this into the tool. |
|
Sorry for delay. Yes, after Dave's detective work, 3 & 4 are confirmed to be beyond our control. So, our options are to trap the error and provide help about usage. Current test history for an example: https://usegalaxy.org/u/jen/h/test-history-stringtie-and-stringtie-merge-133 Suggested text (is what I send to users):
2a. "StringTie transcript assembly and quantification" tool form
2b. "Stringtie merge transcripts" tool form
-- This has to be detailed because the GTF are often the reference GTF. When doing a merge of 2 or more GTFs, and a reference GTF is used, it is entered in a different form field ("guide_gff"). This is entirely different usage and goes against the tool name, so is a bit confusing. The tool could be renamed to something like: "Stringtie merge and prepare transcripts". It needs a rename anyway, and this has been discussed and agreed upon (gitter?), because both Stringtie tools have just the "Stringtie" part of the tool name highlighted as a link - Galaxy UI, Tool Shed, Workflow editor, etc. Core tool names are intended to be unique across tools. If you want to coordinate, I could still do the form text edits, but I wouldn't know quite how to do the error trapping/message or changing the tool name (if we even want to, I vote yes still). Lmk. ping @bgruening Thank you!!! |
|
Hi everyone! We had the same issue with 't_id' not defined in prepDE.py. I think I can present some solution here but I don't know how to send commits. So maybe someone else can add the changes to the 'stringtie.xml'-file and test if that resolves the problem. We patched the command-section and the output-section of the stringtie.xml as follows:` ` ` ` ` ` |
|
Hi @NCEichner I only learnt how to make commits here pretty recently and I have some notes I started making for our in-house documentation (on how to test Galaxy tools and submit to Github). I've attached them here in case they're any use to you, if you wanted to try making commits yourself sometime (it sounds like you've got good suggestions!). GAD-TestingatoolusingPlanemo-121017-1229-18.pdf |
|
Oops, I just noticed something I had added to the wrong place in that planemo doc, corrected version is attached here: |
This is my current version of the updated stringtie.xml that should fix the error described in galaxyproject#1322. Compared to my first post in galaxyproject#1322 I've added some 'cosmetic' changes regarding the temp-files
|
Tests pass: https://usegalaxy.org/u/jen/h/test-history-stringtie-and-stringtie-merge-133 data 187-188. Many thanks! |
Switching to no additional output or to Ballgown works fine. All inputs appear to meet specification and this is reproducible across different input datasets/target genomes.
Error produced:
Example job info
cc @davebx
The text was updated successfully, but these errors were encountered: