-
Notifications
You must be signed in to change notification settings - Fork 430
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add fastp tool to IUC #1763
Add fastp tool to IUC #1763
Conversation
tools/fastp/.shed.yml
Outdated
A tool designed to provide fast all-in-one preprocessing for FastQ files. This tool is developed in C++ with multithreading supported to afford high performance. | ||
remote_repository_url: https://github.com/bgruening/galaxytools/tree/master/tools/fastp | ||
name: fastp | ||
owner: rnateam |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
iuc :)
@nsoranzo I increased the Galaxy version against which we are testing here. I don't know why this matters for the test but it worked in Can we merge it with this change, as 18.01 will be out "soon"? |
@bgruening I re-ran the last Travis build using 17.09 (i.e. the one for commit e8a18b1) and it passed this time, but I see no problem running tests on 18.01 at this point of the release cycle. |
Danke Nicola! |
tools/fastp/.shed.yml
Outdated
homepage_url: https://github.com/OpenGene/fastp | ||
long_description: | | ||
A tool designed to provide fast all-in-one preprocessing for FastQ files. This tool is developed in C++ with multithreading supported to afford high performance. | ||
remote_repository_url: https://github.com/bgruening/galaxytools/tree/master/tools/fastp |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
tools/fastp/.shed.yml
Outdated
name: fastp | ||
owner: iuc | ||
type: unrestricted | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove empty line
tools/fastp/fastp.xml
Outdated
@@ -0,0 +1,357 @@ | |||
<tool id="fastp" name="fastp" version="0.12.4"> | |||
<description>Fast all-in-one preprocessing for FastQ files</description> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
s/FastQ/FASTQ/
tools/fastp/fastp.xml
Outdated
<requirements> | ||
<requirement type="package" version="0.12.4">fastp</requirement> | ||
</requirements> | ||
<version_command>fastp --version | tail -1</version_command> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
tail -n 1
, -1
is not POSIX
tools/fastp/fastp.xml
Outdated
</requirements> | ||
<version_command>fastp --version | tail -1</version_command> | ||
<command><![CDATA[ | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove empty line
tools/fastp/fastp.xml
Outdated
</tests> | ||
<help> | ||
<![CDATA[ | ||
A tool designed to provide fast all-in-one preprocessing for FastQ files. This tool is developed in C++ with multithreading supported to afford high performance. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
s/FastQ/FASTQ/
tools/fastp/fastp.xml
Outdated
|
||
10. support long reads (data from PacBio / Nanopore devices). | ||
|
||
This tool is being intensively developed, and new features can be implemented soon if they are considered useful. If you have any additional requirement for fastp, please file an issue:https://github.com/OpenGene/fastp/issues/new |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'd remove this line.
tools/fastp/fastp.xml
Outdated
<![CDATA[ | ||
A tool designed to provide fast all-in-one preprocessing for FastQ files. This tool is developed in C++ with multithreading supported to afford high performance. | ||
|
||
**features** |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Capital F
tools/fastp/.shed.yml
Outdated
@@ -0,0 +1,11 @@ | |||
categories: | |||
- Sequence Analysis | |||
description: Fast all-in-one preprocessing for FastQ files |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
s/FastQ/FASTQ/
tools/fastp/.shed.yml
Outdated
description: Fast all-in-one preprocessing for FastQ files | ||
homepage_url: https://github.com/OpenGene/fastp | ||
long_description: | | ||
A tool designed to provide fast all-in-one preprocessing for FastQ files. This tool is developed in C++ with multithreading supported to afford high performance. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
s/FastQ/FASTQ/
Thanks for the review @nsoranzo ! I've made those changes. |
tools/fastp/fastp.xml
Outdated
</section> | ||
</when> | ||
<when value="paired"> | ||
<param name="in1" argument="-i" type="data" format="fastq" label="Input 1" help="Input FASTQ file #1."/> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we should be able to add fastq.gz here and for the other inputs, right ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Well I wondered that and so tested it with a fastq.gz file and it ran fine so it didn't seem necessary to add 😕. But what do you think? Should I instead add all possibilities- fastq, fastq.gz, fastqsanger, fastqsanger.gz?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fastqsanger, fastqillumina etc are all subclasses of fastq, so that's fine.
fastqsanger.gz, fastqillumina.gz etc, are subclasses of fastq.gz, and should normally require implicit conversion to their uncompressed equivalent if you want to use them with a fastq input.
Is that perhaps why it worked for you ? The disadvantage of not specifying fastq.gz is the additional, unnecessary uncompressed dataset that would be created.
tools/fastp/fastp.xml
Outdated
</inputs> | ||
|
||
<outputs> | ||
<data name="out1" format="fastq" label="${tool.name} on ${on_string}: Read 1 Output"/> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can we make this format_source="in1"
(and below) ?
tools/fastp/fastp.xml
Outdated
<test> | ||
<param name="in1" value="R1.fq"/> | ||
<param name="single_paired_selector" value="single"/> | ||
<output name="out1" file="out1.fq" ftype="fastq"/> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this would be fastqsanger now, unless you change <param name="in1" value="R1.fq"/>
to <param name="in1" value="R1.fq" ftype="fastq"/>
tools/fastp/fastp.xml
Outdated
|
||
#if $single_paired.adapter_trimming_options.disable_adapter_trimming: | ||
$single_paired.adapter_trimming_options.disable_adapter_trimming | ||
#end if |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The #if
and #end if
are not needed since you're defining truevalue
and falsevalue
in the <param>
. Same for most other boolean params below.
@mblue9 Thanks a lot for the changes! There are still 3 standing comments from my previous review at https://github.com/galaxyproject/tools-iuc/pull/1763/files#diff-12b0b82edec68d4ab4e953daf7087da6 , if you have time. |
@bgruening The merge of |
So done. |
tools/fastp/fastp.xml
Outdated
|
||
fastp | ||
|
||
-i input1.${in1.ext} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just a doubt: does fastp support all our weird ext, like fastqillumina.gz
or fastqsolexa
, which are all subtypes of fastq/fastq.gz?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Most likely not and if we want to support this we porbably need to add --phred64
as well.
tools/fastp/fastp.xml
Outdated
#set $ext = 'fastq' | ||
#else: | ||
#set $ext = $in1.ext | ||
#end if |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this should be:
#if $in1.is_of_type('fastq.gz')
#set ext = 'fastq.gz'
#else
#set ext = 'fastq'
#end if
tools/fastp/fastp.xml
Outdated
#end if | ||
|
||
|
||
## Run fastp | ||
|
||
fastp | ||
|
||
-i input1.${in1.ext} | ||
-o first.${in1.ext} | ||
#if $in1.ext in ['fastqillumina', 'fastqsolexa', 'fastqillumina.gz', 'fastqsolexa.gz']: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
#if $in1.is_of_type('fastqillumina', 'fastqsolexa', 'fastqillumina.gz', 'fastqsolexa.gz')
Sorry @nsoranzo I missed those other changes! I've added your suggestions now and if you think I should make more changes please let me know. |
:( I don't know why it's failing here on the 9th test with "No output has been received in the last 20m0s", it passes fine locally 😕 |
It's green after a restart! Thanks @mblue9 and @bgruening! |
Thanks a lot @nsoranzo ! I'm trying it now and it's looking good! 👍 |
fastp - a tool for fastq processing, trimming, QC etc (website: https://github.com/OpenGene/fastp)
has been wrapped by @HassanAmr (hooray! 🎉) bgruening/galaxytools#691
and this PR is to move it to IUC