GitHub - aselewa/cancerWGS_pipeline: A Snakemake pipeline for calling SSMs and CNVs from matched normal-tumor WGS data

Introduction

This repo contains a Snakemake pipeline written for the processing of whole-genome sequencing data of matched normal-tumor samples. The pipeline takes as input Illumina FASTQ files and will output:

Germline SNPs and tumor BAF at these positions (HaplotypeCaller)
Simple somatic mutations (Mutect2)
Somatic CNAs (TitanCNA)

The pipeline uses a combination of GATK4 and TitanCNA for calling somatic mutations and copy number alterations.

Environment

The environment is handled by the conda package manager. Use the given environment.yaml file to create the environment.

conda env create --file environment.yaml

Load environment

source activate biotools

Running the pipeline

Please edit the configfig.yaml and follow the instructions. You will need to add the absolute paths to your project directory, reference genome file (as well as index and dict), as well as information required fir TitanCNA.

Your working directory should look like:

my_project/
└── fastq
    ├── patientID_normal_R1.fastq.gz
    ├── patientID_normal_R2.fastq.gz
    ├── patientID_tumor_R1.fastq.gz
    └── patientID_tumor_R2.fastq.gz

Command line

After filling out the config file, simply type to run

snakemake --configfile configfile.yaml

SLURM

If running on the cluster, please edit the cluster.json config file to match your cluster configuration. Once complete, submit the given sbatch file to the cluster.

sbatch snakemake.sbatch

DAG of Snakemake protocol

A more detailed graph is obtained using the dag feature in snakemake.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
img		img
scripts		scripts
.gitignore		.gitignore
DAG.svg		DAG.svg
README.md		README.md
Snakefile		Snakefile
cluster.json		cluster.json
configfile.yaml		configfile.yaml
environment.yaml		environment.yaml
snakemake.sbatch		snakemake.sbatch

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

img

img

scripts

scripts

.gitignore

.gitignore

DAG.svg

DAG.svg

README.md

README.md

Snakefile

Snakefile

cluster.json

cluster.json

configfile.yaml

configfile.yaml

environment.yaml

environment.yaml

snakemake.sbatch

snakemake.sbatch

Repository files navigation

Introduction

Environment

Running the pipeline

Command line

SLURM

DAG of Snakemake protocol

About

Releases

Packages

Languages

aselewa/cancerWGS_pipeline

Folders and files

Latest commit

History

Repository files navigation

Introduction

Environment

Running the pipeline

Command line

SLURM

DAG of Snakemake protocol

About

Resources

Stars

Watchers

Forks

Languages