This repository contains scripts and manuals for whole genome sequencing based analysis of non-tuberculous mycobacteria. Questions? Ask mdiricks@fz-borstel.de
The wrapper script starter_NTMseq.sh runs on linux and accepts FastQ (illumina) and FastA files as input. It includes:
- Quality control of raw sequence reads [Input: FastQ files; Required tools: FastQC and multiQC]
- Contamination detection [Input: FastQ files or FastA files; Required tools: kraken2]
- Preprocessing of raw sequence reads (e.g. adapter removal) [Required tools: fastp]
- Mycobacterium (sub)species and resistance prediction [Input: FastQ files; Required tools: NTMprofiler]
- Multi-locus sequence typing (MLST) [Input: FastQ files; Required tools: srst2; Required database: pubMLST (only available for M. abscessus)]
- Assembly of sequence reads [Input: FastQ files; Required tools: Shovill; Output: FastA files]
- Fast phylogenetic analysis using Mashtree [Input: FastQ files or FastA files; Required tools: Mashtree]
- Detection of known plasmids [Input: FastQ files; Required tools: srst2 and seqkit; Required database: PLSDB or custom]
- De novo prediction of plasmid contigs [Input: FastQ files; Required tools: platon and/or plasmidspades]
- Resistance and virulence gene prediction [Input: FastA files; Required tools: AMRfinder+]
For complete installation instructions of NTMseq, description and usage examples please send a mail to mdiricks@fz-borstel.de.
Diricks et al., 2022: https://genomemedicine.biomedcentral.com/articles/10.1186/s13073-022-01017-x --> see folder scripts/2022_Diricks_abscessus