Skip to content
This repository has been archived by the owner on Jan 8, 2024. It is now read-only.

Latest commit

 

History

History
25 lines (17 loc) · 2.3 KB

README.md

File metadata and controls

25 lines (17 loc) · 2.3 KB

logo

SARS_Cov2_consensus-nf

Nextflow install with bioconda

Introduction

BU-ISCIII/SARS_Cov2_consensus-nf is a bioinformatics analysis pipeline used to analyze SARS-Cov2 Illumina data. The approach followed by this pipeline is to create a consensus genome through mapping, variant calling and consensus genome generation.

The workflow processes raw data from FastQ inputs (FastQC, Trimmomatic), maps the reads (Bowtie2 and Samtools) against the host and the viral reference genome. Optionally, if the data has been obtaine through amplicon sequencing you can use iVar to trim the primers. Then the pipeline calls for variants and annotates them (VarScan, SnpEff), generates a genome sequence consensus (Bgzip and BCFtools) with the variants. Finally we generate a stats report with MultiQC See the output documentation for more details of the results.

The pipeline is built using Nextflow, a bioinformatics workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It comes with docker / singularity containers making installation trivial and results highly reproducible.

Documentation

The BU-ISCIII/SARS_Cov2 pipeline comes with documentation about the pipeline, found in the docs/ directory:

  1. Installation
  2. Running the pipeline
  3. Output and how to interpret the results

CREDIT: We inspired in nf-core templates