Customize Preprocessing based on each tool #830

berguner · 2022-11-11T09:38:02Z

Description of feature

Hi,
It seems like the CNVkit workflow uses cram_recalibrated files as input here:

sarek/subworkflows/nf-core/variantcalling/cnvkit/main.nf

Lines 8 to 12 in bcd7bf9

    
           workflow RUN_CNVKIT { 
        
               take: 
        
                   cram_recalibrated   // channel: [mandatory] cram 
        
                   fasta               // channel: [mandatory] fasta

. As far as I remember, recalibrated files of WES or panel samples don't contain off-target reads because base recalibration is applied over the intervals only. It would be better using CRAM files containing all the reads (cram_markduplicates ?) for CNVkit analysis for utilizing off-target reads. This is especially important for custom panels where there are fewer target regions compared to WES.

The text was updated successfully, but these errors were encountered:

FriederikeHanssen · 2022-11-11T09:40:53Z

Hi! You can always achieve this by setting the parrameter --skip_tools baserecalibrator . I will add some docs on this.

berguner · 2022-11-11T09:48:06Z

But wouldn't that make the pipeline skip recalibration for SNV/indel calling also? I usually run the pipeline with --tools "mutect2,vep,cnvkit".

FriederikeHanssen · 2022-11-11T14:19:56Z

Yes, currently it is only possible to do one "type" of pre-processing.

I would transfer this to a bigger feature requests:

For scenarios such as above, it would be nice to allow different types of preprocessing. This would require tool based preprocessing steps, that ideally would still be customizable.

Such as:

md+ bqsr + haplotypecaller
no md + bqsr + deepvariant
md + no bqsr + cnvkit

(examples are completely made up)

This would llikely entail quite a massive change in how we manage data flow at the moment

FriederikeHanssen · 2022-11-11T14:22:33Z

Other current options as a work around:

Utilize the --step functions to run the one tool that needs different preprocessing on the respective csv file that is available in results/csv to avoid duplicate mapping for example and save time & resources

berguner added the enhancement New feature or request label Nov 11, 2022

FriederikeHanssen changed the title ~~Utilize off-target reads in CNVkit analysis of WES samples~~ Customize Preprocessing based on each tool Nov 11, 2022

maxulysse added this to the 3.2 milestone Feb 21, 2023

maxulysse modified the milestones: 3.2, 3.3 Jun 22, 2023

maxulysse modified the milestones: 3.3, 3.4, 3.5 Feb 8, 2024

FriederikeHanssen removed this from the 3.5 milestone Aug 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Customize Preprocessing based on each tool #830

Customize Preprocessing based on each tool #830

berguner commented Nov 11, 2022

FriederikeHanssen commented Nov 11, 2022

berguner commented Nov 11, 2022

FriederikeHanssen commented Nov 11, 2022 •

edited

Loading

FriederikeHanssen commented Nov 11, 2022

Customize Preprocessing based on each tool #830

Customize Preprocessing based on each tool #830

Comments

berguner commented Nov 11, 2022

Description of feature

FriederikeHanssen commented Nov 11, 2022

berguner commented Nov 11, 2022

FriederikeHanssen commented Nov 11, 2022 • edited Loading

FriederikeHanssen commented Nov 11, 2022

FriederikeHanssen commented Nov 11, 2022 •

edited

Loading