Join GitHub today
GitHub is home to over 20 million developers working together to host and review code, manage projects, and build software together.
Generic pre-processing and GVCF generation WDLs #104
Conversation
vdauwera
changed the title from
Generic pre-processing and GVCF generation WDL to Generic pre-processing and GVCF generation WDLs
Apr 22, 2017
vdauwera
requested a review
from cjllanwarne
Apr 22, 2017
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
vdauwera commentedApr 20, 2017
•
edited
This PR adds two WDLs:
A generic, stripped-down version of the Broad production / GOTC single sample pipeline (no QC steps included). Starts with unmapped bams per readgroup and ends with HaplotypeCaller GVCF generation. Inputs requirements are relaxed to be reference-agnostic (two jsons provided, for hg38 and b37) and can be applied to exomes instead of genomes by swapping out intervals files and specifying interval padding (which is important for all exome target lists but unnecessary for the Broad's WGS intervals list).
Same as above but without the GVCF generation, to serve as common pre-processing script for all variant discovery pipelines.
Note that the BQSR tasks have been tweaked to use all GATK3 tools instead of the more performant GATK4 tools, for benchmarking purposes.