Skip to content

sanger-pathogens/QC-training

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

74 Commits
 
 
 
 
 
 
 
 

Repository files navigation

QC Training

QC training developed by Pathogen Informatics at Wellcome Sanger Institute.

Content

Introduction

This is a training module for QC in the form of jupyter notebooks. Begin in the index notebook and this will guide you through the training.

Key aims of the training include:

  • Describe the different NGS data formats available (FASTQ, SAM/BAM, CRAM, VCF/BCF)
  • Perform conversions between the different data formats
  • Perform a QC assessment of high throughput sequence data
  • Identify possible contamination in high throughput sequence data

Feedback/Issues

Please report any issues to the issues page or email path-help@sanger.ac.uk

Software used

The tutorial runs in a container with the following software installed:

  • bcftools 1.10.2
  • bwa 0.7.17
  • kraken 1.1.1
  • perl 5.30.0
  • picard 2.22.2
  • samtools 1.10