EpiDiverse-WGBS Pipeline

EpiDiverse/wgbs is a bioinformatics analysis pipeline for aligning whole genome bisulfite sequencing data from non-model plant species.

The workflow processes raw data from FastQ inputs (FastQC, cutadapt), aligns the reads (erne-bs5 or segemehl), and performs extensive quality-control on the results using custom scripts and Picard MarkDuplicates. Methylation calling and mbias correction is performed with Methyldackel.

See the output documentation for more details of the results.

The pipeline is built using Nextflow, a workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It comes with docker containers making installation trivial and results highly reproducible.

Quick Start

Install nextflow
Install one of docker, singularity or conda
Start running your own analysis!

NXF_VER=20.07.1 nextflow run epidiverse/wgbs -profile <docker|singularity|conda> \
--input /path/to/reads/directory --reference /path/to/reference.fasta

See the usage documentation for all of the available options when running the pipeline.

Test data

A minimal example dataset for testing purposes can be found in the EpiDiverse/datasets repository. You can either download the files manually and run the pipeline above as intended, or you can directly run the pipeline using the test profile option which will automatically download the data for you:

NXF_VER=20.07.1 nextflow run epidiverse/wgbs -profile test,<docker|singularity|conda>

Wiki Documentation

The EpiDiverse/wgbs pipeline is part of the EpiDiverse Toolkit, a best practice suite of tools intended for the study of Ecological Plant Epigenetics. Links to general guidelines and pipeline-specific documentation can be found below:

Credits

These scripts were originally written for use by the EpiDiverse European Training Network, by Adam Nunn (@bio15anu) and Nilay Can (@nilaycan).

This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No 764965

Citation

If you use epidiverse/wgbs for your analysis, please cite it using the following doi:

Name		Name	Last commit message	Last commit date
Latest commit History 122 Commits
assets		assets
bin		bin
config		config
data		data
docs		docs
env		env
lib		lib
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
main.nf		main.nf
nextflow.config		nextflow.config

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assets

assets

bin

bin

config

config

data

data

docs

docs

env

env

lib

lib

.gitignore

.gitignore

CHANGELOG.md

CHANGELOG.md

LICENSE

LICENSE

README.md

README.md

main.nf

main.nf

nextflow.config

nextflow.config

Repository files navigation

EpiDiverse-WGBS Pipeline

Quick Start

Test data

Wiki Documentation

Credits

Citation

About

Releases 2

Packages

Languages

License

EpiDiverse/wgbs

Folders and files

Latest commit

History

Repository files navigation

EpiDiverse-WGBS Pipeline

Quick Start

Test data

Wiki Documentation

Credits

Citation

About

Resources

License

Stars

Watchers

Forks

Languages