Nucleic Acid Observatory Viral Metagenomics Pipeline

This Nextflow pipeline is designed to process metagenomic sequencing data, characterize overall taxonomic composition, and identify and quantify reads mapping to viruses infecting certain host taxa of interest. It was developed as part of the Nucleic Acid Observatory project.

The pipeline currently consists of three workflows:

INDEX: Creates indices and reference files used by the RUN and RUN_VALIDATION workflows¹.
RUN: Performs the main analysis, including QC, viral identification, taxonomic profiling, and optional BLAST validation.
RUN_VALIDATION: Performs part of the run workflow dedicated to validation of taxonomic classification with BLAST².
DOWNSTREAM: Performs downstream analysis of the results from the run workflow ³.

Documentation

Installation and usage:
Workflow details:
Configuration and output:
- Configuration files
- Pipeline outputs
Other:

The INDEX workflow is intended to be run first, after which many instantiations of the RUN workflow can use the same index output files. ↩
The RUN_VALIDATION workflow is intended to be run after the RUN workflow if the optional BLAST validation was not selected during the RUN workflow. Typically, this workflow is run on a subset of the host viral reads identified in the RUN workflow, to evaluate the sensitivity and specificity of the viral identification process. ↩
The DOWNSTREAM workflow is designed to handle tasks that require cross-read comparisons, including potentially across multiple runs, e.g., marking duplicate reads. ↩

Name		Name	Last commit message	Last commit date
Latest commit History 2,196 Commits
.github/workflows		.github/workflows
bin		bin
configs		configs
docker		docker
docs		docs
modules/local		modules/local
ref		ref
subworkflows/local		subworkflows/local
test-data		test-data
tests		tests
workflows		workflows
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
expected-outputs-downstream.txt		expected-outputs-downstream.txt
expected-outputs-run.txt		expected-outputs-run.txt
index-min-pipeline-version.txt		index-min-pipeline-version.txt
main.nf		main.nf
nf-test.config		nf-test.config
pipeline-min-index-version.txt		pipeline-min-index-version.txt
pipeline-version.txt		pipeline-version.txt
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Nucleic Acid Observatory Viral Metagenomics Pipeline

Documentation

About

Uh oh!

Releases 31

Packages

Uh oh!

Contributors 9

Languages

License

securebio/nao-mgs-workflow

Folders and files

Latest commit

History

Repository files navigation

Nucleic Acid Observatory Viral Metagenomics Pipeline

Documentation

Footnotes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 31

Packages 0

Uh oh!

Contributors 9

Languages

Packages