-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use multiple genomes and annotation for transcriptome mapping #46
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looking really good! 👍
Very good timing too! I just started implementing this yesterday - you really saved me a whole lot of headache with this one so thank you very much! 😁
Just a few things here and there from me that might make it a bit more robust/speed up certain processes.
Also was thinking it might be good to alter the output sub directories to reflect whether a sample was aligned to genome or txome - as this will not show up in the file name. What do you reckon?
Also, I'll go through and change the samplesheet now so we can get tests passing. Happy to implement some of the suggestions below too after a merge!
main.nf
Outdated
// def fix_channel(ArrayList ch) { | ||
// | ||
// if (ch.size() == 7) { | ||
// return [ file(ch[0]), file(ch[1]), ch[2], ch[5], ch[6] ] // [ fasta, gtf, bed, sizes, is_transcripts ] | ||
// } else if (ch.size == 5) { | ||
// return [ file(ch[0]), false, false, ch[3], ch[4] ] | ||
// } | ||
// } |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we still need this?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It should have been removed 🤔
Ok. Ive tested all of the commands in Thanks for adding the I dont think we should complicate things by having |
Would be good if you can double-check the logic I have used for |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Logic for minimap2
and graphmap2
call look good!
Will have a solid test with lots of different references formats and combos once we're merged! 👍
Many thanks for contributing to nf-core/nanoseq!
Please fill in the appropriate checklist below (delete whatever is not relevant). These are the most common things requested on pull requests (PRs).
PR checklist
nextflow run . -profile test,docker
).nf-core lint .
).docs
is updatedCHANGELOG.md
is updatedREADME.md
is updatedLearn more about contributing: https://github.com/nf-core/nanoseq/tree/master/.github/CONTRIBUTING.md
transcriptome
entry tosamplesheet.csv
input that can either be agtf
,fasta
or will be resolved ifiGenomes
reference.if
statements withwhen
where possible to make it easier to read the pipelinegtf2bed
conversion process and containerIm not going to have much time to work on this for the rest of the week because Ive spent the past few days pretty much glued to getting this solved. Some things left to do:
genome
andtranscriptome
, and by using the different--skip
flags to see if the channels are all defined properly.samplesheet.csv
ontest-datasets
doesnt have atranscriptome
entryIf someone fancies addressing the remaining points then its probably wise to update the
samplesheet.csv
on test-datasets, re-trigger the Travis tests here, merge after passing (hopefully will work first time, if not I can push a commit or two if required) and then some heavy testing 👍 😅