updated TrimGalore for multi-core support #65

drewjbeh · 2019-11-21T14:26:40Z

Many thanks to contributing to nf-core/atacseq!

Please fill in the appropriate checklist below (delete whatever is not relevant). These are the most common things requested on pull requests (PRs).

PR checklist

This comment contains a description of changes (with reason)
Ensure the test suite passes (nextflow run . -profile test,docker).
Documentation in docs is updated
CHANGELOG.md is updated

Learn more about contributing: https://github.com/nf-core/atacseq/tree/master/.github/CONTRIBUTING.md

drpatelh

Perfect! Thanks @drewjbeh . Just some very pedantic changes so I can sleep at night 😄

drpatelh · 2019-11-21T14:30:16Z

CHANGELOG.md

@@ -13,6 +13,7 @@ and this project adheres to [Semantic Versioning](http://semver.org/spec/v2.0.0.

 * [#118](https://github.com/nf-core/chipseq/issues/118) - Running on with SGE
 * Make executables in `bin/` compatible with Python 3
+* [#63](https://github.com/nf-core/atacseq/issues/63) - Added multicore support for Trim Galore!


Maybe put this in the Added section instead?

CHANGELOG.md

main.nf

drpatelh · 2019-11-21T14:34:35Z

Also, if you feel inspired at any point would be great if you can make a similar PR to nf-core/chipseq. Should be pretty much the same.

Co-Authored-By: Harshil Patel <drpatelh@users.noreply.github.com>

drewjbeh · 2019-11-21T15:05:54Z

All done! Thanks for your help - and it was a pleasure.

Will look at nf-core/chipseq tomorrow!

drpatelh · 2019-11-21T15:43:41Z

Great. Ill wait for the Travis CI tests to pass and then merge 👍

ewels · 2019-11-23T20:25:55Z

Hi chaps,

I think that this change may break things on strict systems... --cores 2 uses more than 2 cpus, see https://github.com/FelixKrueger/TrimGalore/blob/master/Docs/Trim_Galore_User_Guide.md#full-list-of-options-for-trim-galore

Actual core usage: It should be mentioned that the actual number of cores used is a little convoluted. Assuming that Python 3 is used and pigz is installed, --cores 2 would use 2 cores to read the input (probably not at a high usage though), 2 cores to write to the output (at moderately high usage), and 2 cores for Cutadapt itself + 2 additional cores for Cutadapt (not sure what they are used for) + 1 core for Trim Galore itself. So this can be up to 9 cores, even though most of them won't be used at 100% for most of the time. Paired-end processing uses twice as many cores for the validation (= writing out) step. --cores 4 would then be: 4 (read) + 4 (write) + 4 (Cutadapt) + 2 (extra Cutadapt) + 1 (Trim Galore) = 15, and so forth.

drpatelh · 2019-11-23T21:08:23Z

Hmmm...yes, I breezed over that hoping for the best! Maybe we need to do some funky maths based on the value of $task.cpus...

drpatelh · 2019-11-23T21:13:54Z

God forbid we specify an even number of cores 😅

ewels · 2019-11-23T21:34:34Z

Haha, yes.. It’s the same for the Bismark suite in the methylseq pipeline too. Once you know how many cpus are used per core you should just be able to copy and paste from there.

drpatelh · 2019-11-23T22:35:13Z

Bit confused by the difference in usage between SE and PE. Aside from the 2 extra cutadapt cores and the 1 for TrimGalore it seems that the calculation would be the same for SE and PE?

e.g.
SE and --cores 3 = 3 (read) + 3 (write) + 3 (cutadapt) + 2 (cutadapt extra) + 1 (TrimGalore) = 12 total
PE and --cores 3 = 3 (read) + 3 (write) + 3 (cutadapt) + 2 (cutadapt extra) + 1 (TrimGalore) = 12 total

Had a look at the methylseq pipeline but TrimGalore shouldn't be that fussed for the memory requirements as bismark is because read trimming is independent from the size of the genome? It has run fine with 1 CPU and 7GB of RAM for large fastqs up until now. Maybe there is a lower limit...

@FelixKrueger any advice before we construct the logic would be peachy 😄

FelixKrueger · 2019-11-23T22:50:08Z

Hi Harshil,

I would agree that SE and PE should have the same requirements in terms of CPU. And memory usage should indeed not be any issue. I could check next time I run it but I don't think it will ever exceed more than 1 or 2GB. Have never tested this thoroughly in multicore mode though. Does this help?

drpatelh · 2019-11-23T22:55:23Z

Thanks for the rapid response @FelixKrueger 😎

I have updated my representative example for 3 cores for clarity. Is it still correct? If so, then I think that should be enough to work out --cores from the specified CPUs.

ewels · 2019-11-25T11:51:32Z

I would agree that SE and PE should have the same requirements in terms of CPU.

@FelixKrueger but your documentation says that PE needs more cores?

Paired-end processing uses twice as many cores for the validation (= writing out) step.
https://github.com/FelixKrueger/TrimGalore/blob/master/Docs/Trim_Galore_User_Guide.md

drpatelh · 2019-11-25T11:57:18Z

Yes, it seems that either the documentation or the logic isnt right. Assumed it was the documentation based on @FelixKrueger comment.

FelixKrueger · 2019-11-25T13:47:42Z

Sorry I was stuck in a meeting. As a very detailed breakdown I would say the core usage is the following:

[process: cores / load]:

Trimming process:

trim_galore: 1 (high)
gunzip stream: 1 (low)
cutadapt: # cores given with -j (high)
write to gzip stream: 1 (low)

So in default mode (--cores 1) I would expect 2 cores at high load, and up to 2 cores for (de-)compression, albeit at negligible load for both SE and PE. --cores 2 would then formally require 5 cores, --cores 3 6, --cores 4 7 etc.

For paired-end only, there is a validation process:

trim_galore: 1 (high)
gunzip stream (decompression): 2 (low)
gzip stream (compression): 2 (medium) (I/O dependent)

So if you are extremely strict then this would amount to 5 cores, although in terms of load this is hardly fair as it is really reading/writing 2 files as a single pass. (If the user chooses to also output unpaired reads there might be another 2 write streams on low load in addition to that).

Maybe a very strict the formula you would want to be using would then be for
single-end: ( 3 + # cores [given by -j/--cores])
paired-end: ( 4 + # cores [given by -j/--cores])

I have just run a test run on 2 million paired-end reads, the memory consumption was 551M.

drpatelh · 2019-11-25T17:52:55Z

Thanks @FelixKrueger Based on that breakdown maybe the formula I used isnt correct then:

tcores = (((task.cpus as int) - 3) / 3) as int

We will have to come up with an alternative such as the below?

def cores = 1
if (task.cpus) {
    def tcores = ((task.cpus as int) - 3) as int
    if (!params.single_end) {
        tcores = ((task.cpus as int) - 4) as int
    }
    if (tcores > 1) {
        cores = tcores
    }
}

In any case, Im wondering whether we should cap it so that the max value of --cores=4. Just so that we are somewhat guarded against providing a large number of cores and I/O becoming the bottleneck?

FelixKrueger · 2019-11-26T10:40:00Z

I think limiting the number of cores to 4 is a good idea, as in my tests I found that:

It seems that --cores 4 could be a sweet spot, anything above has diminishing returns.

(from trim_galore --help)

See nf-core/atacseq#65 (comment)

ewels · 2019-12-10T14:44:17Z

I just had a go at doing this in nf-core/methylseq#137 - I used a slightly refactored version of your code @drpatelh:

cores = 1
if(task.cpus){
    cores = (task.cpus as int) - 4
    if (params.single_end) cores = (task.cpus as int) - 3
    if (cores < 1) cores = 1
    if (cores > 4) cores = 4
}

drpatelh · 2019-12-12T10:56:35Z

@FelixKrueger I just want to double-check the implementation below is ok before I roll it out to the chipseq and atacseq pipelines too:
https://github.com/nf-core/methylseq/blob/4de49c2780de070bafe6b94ae0c709b5af5aa735/main.nf#L491-L497

Thank you!

FelixKrueger · 2019-12-12T12:04:47Z

Hi Harshil,

That looks exactly like what you want to be doing. Good luck!

drpatelh · 2019-12-12T12:45:39Z

Great! Thanks @FelixKrueger 👍

updated TrimGalore for mult-core support

bb42dee

drpatelh requested changes Nov 21, 2019

View reviewed changes

drewjbeh and others added 4 commits November 21, 2019 15:56

Update CHANGELOG.md

32c7afa

Co-Authored-By: Harshil Patel <drpatelh@users.noreply.github.com>

Update main.nf

593233a

Co-Authored-By: Harshil Patel <drpatelh@users.noreply.github.com>

Apply suggestions from code review

fbdef55

Co-Authored-By: Harshil Patel <drpatelh@users.noreply.github.com>

Update CHANGELOG.md

3b77b99

drpatelh approved these changes Nov 21, 2019

View reviewed changes

drpatelh merged commit 60b1f68 into nf-core:dev Nov 21, 2019

drpatelh mentioned this pull request Nov 22, 2019

Possibility for new Trim Galore using cutadapt 2 and mult-cores #63

Closed

phue mentioned this pull request Nov 23, 2019

Use n/3 cores for methXtract multicore nf-core/methylseq#117

Merged

drpatelh mentioned this pull request Nov 24, 2019

Fix TrimGalore core logic and update MultiQC nf-core/chipseq#125

Merged

8 tasks

phue mentioned this pull request Nov 24, 2019

trim galore multicore nf-core/methylseq#120

Merged

5 tasks

drpatelh mentioned this pull request Nov 24, 2019

Fix TrimGalore core logic and update MultiQC #66

Merged

8 tasks

drpatelh mentioned this pull request Dec 6, 2019

Dev > Master, v1.5 release nf-core/methylseq#133

Merged

ewels added a commit to ewels/nf-core-methylseq that referenced this pull request Dec 10, 2019

Refactored how --cores is decided for TrimGalore!

5b66ad6

See nf-core/atacseq#65 (comment)

This comment has been minimized.

Sign in to view

apeltzer mentioned this pull request Jun 9, 2020

TrimGalore Error due to python version nf-core/sarek#215

Closed

FelixKrueger mentioned this pull request Jun 11, 2020

using --cores FelixKrueger/TrimGalore#94

Closed

j-andrews7 mentioned this pull request Feb 11, 2021

Update docs to mention trimgalore core usage nuances nf-core/rnaseq#567

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

updated TrimGalore for multi-core support #65

updated TrimGalore for multi-core support #65

drewjbeh commented Nov 21, 2019

drpatelh left a comment

drpatelh Nov 21, 2019

drpatelh commented Nov 21, 2019

drewjbeh commented Nov 21, 2019

drpatelh commented Nov 21, 2019

ewels commented Nov 23, 2019

drpatelh commented Nov 23, 2019

drpatelh commented Nov 23, 2019

ewels commented Nov 23, 2019

drpatelh commented Nov 23, 2019 •

edited

FelixKrueger commented Nov 23, 2019

drpatelh commented Nov 23, 2019

ewels commented Nov 25, 2019

drpatelh commented Nov 25, 2019

FelixKrueger commented Nov 25, 2019

drpatelh commented Nov 25, 2019 •

edited by ewels

FelixKrueger commented Nov 26, 2019

ewels commented Dec 10, 2019 •

edited

This comment has been minimized.

This comment has been minimized.

drpatelh commented Dec 12, 2019

FelixKrueger commented Dec 12, 2019

drpatelh commented Dec 12, 2019

updated TrimGalore for multi-core support #65

updated TrimGalore for multi-core support #65

Conversation

drewjbeh commented Nov 21, 2019

PR checklist

drpatelh left a comment

Choose a reason for hiding this comment

drpatelh Nov 21, 2019

Choose a reason for hiding this comment

drpatelh commented Nov 21, 2019

drewjbeh commented Nov 21, 2019

drpatelh commented Nov 21, 2019

ewels commented Nov 23, 2019

drpatelh commented Nov 23, 2019

drpatelh commented Nov 23, 2019

ewels commented Nov 23, 2019

drpatelh commented Nov 23, 2019 • edited

FelixKrueger commented Nov 23, 2019

drpatelh commented Nov 23, 2019

ewels commented Nov 25, 2019

drpatelh commented Nov 25, 2019

FelixKrueger commented Nov 25, 2019

drpatelh commented Nov 25, 2019 • edited by ewels

FelixKrueger commented Nov 26, 2019

ewels commented Dec 10, 2019 • edited

This comment has been minimized.

This comment has been minimized.

drpatelh commented Dec 12, 2019

FelixKrueger commented Dec 12, 2019

drpatelh commented Dec 12, 2019

drpatelh commented Nov 23, 2019 •

edited

drpatelh commented Nov 25, 2019 •

edited by ewels

ewels commented Dec 10, 2019 •

edited