GitHub - czbiohub-sf/rnaseq: RNA sequencing analysis pipeline using STAR or HISAT2, with gene counts and quality control

Introduction

czbiohub/rnaseq is a bioinformatics analysis pipeline used for RNA sequencing data.

The workflow processes raw data from FastQ inputs (FastQC, fastp), aligns the reads (STAR or HiSAT2), generates gene counts (htseq-count, StringTie) and performs extensive quality-control on the results (RSeQC, dupRadar, Preseq, edgeR, MultiQC). See the output documentation for more details of the results.

Additionally, the pipeline is expanded to be able to quantify transcript, exon, alternative splicing and TxRevise expressions. See optional quantification methods for details.

The pipeline is built using Nextflow, a bioinformatics workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It comes with docker / singularity containers making installation trivial and results highly reproducible.

Documentation

The czbiohub/rnaseq pipeline comes with documentation about the pipeline, found in the docs/ directory:

General overview

The schema shown below represents the high level structure of the pipeline.

Credits

These scripts were originally written for use at the National Genomics Infrastructure, part of SciLifeLab in Stockholm, Sweden, by Phil Ewels (@ewels) and Rickard Hammarén (@Hammarn). They have since taken on a life of their own at Chan Zuckerberg Biohub where they are maintained by Olga Botvinnik.

Many thanks to other who have helped out along the way too, including (but not limited to): @Galithil, @pditommaso, @orzechoj, @apeltzer, @colindaven.

Name		Name	Last commit message	Last commit date
Latest commit History 1,162 Commits
assets		assets
bin		bin
conf		conf
data		data
docs		docs
.gitattributes		.gitattributes
.gitignore		.gitignore
.travis.yml		.travis.yml
CHANGELOG.md		CHANGELOG.md
Dockerfile		Dockerfile
LICENSE.md		LICENSE.md
Makefile		Makefile
README.md		README.md
Singularity		Singularity
environment.yml		environment.yml
main.nf		main.nf
nextflow.config		nextflow.config
run_salmon_nf.sh		run_salmon_nf.sh
run_salmon_nf_PE.sh		run_salmon_nf_PE.sh
run_test_macrophage_chr21_PE.sh		run_test_macrophage_chr21_PE.sh
run_test_macrophage_chr21_SE.sh		run_test_macrophage_chr21_SE.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Documentation

General overview

Credits

About

Releases

Packages

Languages

License

czbiohub-sf/rnaseq

Folders and files

Latest commit

History

Repository files navigation

Introduction

Documentation

General overview

Credits

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages