Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
ZetaHunter is a command line script designed to assign user-supplied small subunit ribosomal RNA (SSU rRNA) gene sequences to OTUs defined by a reference sequence database.
By default, ZetaHunter uses a curated database of full-length, non-chimeric, Zetaproteobacteria SSU rRNA gene sequences derived from arb SILVA (release 128) and Zetaproteobacteria genomes from JGI's Integrated Microbial Genomes (IMG). OTU definitions are the same as those suggested by McAllister et al. (2011) at 97% identity, with novel OTUs discovered since that publication named ZetaOTU29 and higher (curated OTUs only). Infiles aligned by the arb SILVA SINA web aligner are masked using the same 1282 bp mask used in McAllister et al. (2011) to obtain reproducible OTU calls through closed reference OTU binning. User sequences that represent novel Zetaproteobacteria OTUs are de novo binned into NewZetaOTUs, numbered by abundance.
ZetaHunter also supports user-provided curated OTU databases for sequence OTU binning of any SINA-aligned SSU rRNA sequences.
- Stable SSU rRNA gene OTU binning to a curated database
- Supports import of multiple files for easy comparison of NewZetaOTUs across samples
- Database and mask management options
- Multi-threaded processing
- Chimera checking
- Flags for sequences not related to the curated database (i.e. not Zetaproteobacteria)
If you find ZetaHunter useful in your research, please cite:
McAllister, S. M., Moore, R. M., and Chan, C. S. ZetaHunter, a Reproducible Taxonomic Classification Tool for Tracking the Ecology of the Zetaproteobacteria and Other Poorly Resolved Taxa. Aug 2018, 7 (7) e00932-18; DOI: 10.1128/MRA.00932-18.