Skip to content

TrisKast/Cas_mediated_enrichment_read_mapping_pipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Cas mediated enrichment read mapping pipeline

Nextflow

Docker Repository on Dockerhub

Introduction

This pipeline maps long DNA sequence reads to a reference genome, and evaluates the performance of a Cas9 based target enrichment strategy. The workflow is suitable for Oxford Nanopore fastq sequence collections and requires a reference genome and a BED file of target coordinates. The program is built using Nextflow, a workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It comes with docker / singularity containers making installation trivial and results highly reproducible.

The current workflow consists of:

  1. Mapping of the reads onto a reference genome
  2. Handling of generated sam files and transformation into bam files
  3. Evaluation of the performance of the enrichment
  4. Separation of reads into different files according to their mapping status

Documentation

The pipeline comes with documentation, found in the docs/ directory:

  1. Installation
  2. Pipeline configuration
  3. Running the pipeline
  4. Output

Credits

This pipeline was written by Tristan Kast (tristankast) at DZNE, using R scripts from the nanoporetech ont_tutorial_cas9 github repo (https://github.com/nanoporetech/ont_tutorial_cas9).

DZNE

About

Evaluation of read-mapping characteristics from a Cas-mediated PCR-free enrichment

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published