Skip to content
This repository has been archived by the owner on Jan 8, 2024. It is now read-only.

Nextflow pipeline to analyze SARS-Cov2 Illumina SISPA and/or amplicons data

Notifications You must be signed in to change notification settings

BU-ISCIII/SARS_Cov2_consensus-nf

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

logo

SARS_Cov2_consensus-nf

Nextflow install with bioconda

Introduction

BU-ISCIII/SARS_Cov2_consensus-nf is a bioinformatics analysis pipeline used to analyze SARS-Cov2 Illumina data. The approach followed by this pipeline is to create a consensus genome through mapping, variant calling and consensus genome generation.

The workflow processes raw data from FastQ inputs (FastQC, Trimmomatic), maps the reads (Bowtie2 and Samtools) against the host and the viral reference genome. Optionally, if the data has been obtaine through amplicon sequencing you can use iVar to trim the primers. Then the pipeline calls for variants and annotates them (VarScan, SnpEff), generates a genome sequence consensus (Bgzip and BCFtools) with the variants. Finally we generate a stats report with MultiQC See the output documentation for more details of the results.

The pipeline is built using Nextflow, a bioinformatics workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It comes with docker / singularity containers making installation trivial and results highly reproducible.

Documentation

The BU-ISCIII/SARS_Cov2 pipeline comes with documentation about the pipeline, found in the docs/ directory:

  1. Installation
  2. Running the pipeline
  3. Output and how to interpret the results

CREDIT: We inspired in nf-core templates