Skip to content

Software References

Gavin Douglas edited this page Sep 5, 2017 · 14 revisions

Under Construction

This page aims to list many useful bioinformatic tools for microbial ecology and related areas. This is not an exhaustive list, but should be useful to newcomers to the field.

Some other lists you may be interested in are:

Some of the below tools are highlighted by asterisks, which correspond to:

* Tools that are in the Microbiome Helper virtual box.
** Tools that we are looking into and may add soon.

If you there are tools you think we are missing from this list then please let us know!

Amplicon Sequence Processing Toolkits

Pre-processing

Quality Control

  • FastQC* (website) - Java program that provides simple graphical user interface (GUI) or command-line options to quickly generate summary graphs and tables describing raw sequence files.
  • fastqp (website) - Python alternative to FastQC that produces similar summaries.

Read Filtering and/or Trimming

  • BBDuk* (website, helpful seqanswers thread) - Java program implemented in the BBMap package, which includes bioinformatic tools for a number of diverse purposes. BBDuk's focus is on filtering, trimming, or masking reads based on matches to user-specified sequences/kmers, but can also be used for other purposes such as filtering reads by length and quality cut-off.
  • FASTX-Toolkit* (website) - Many different C and C++ command-line programs for filtering and transforming sequence files.

Chimera Filtering

  • CATCh (paper, website)
  • ChimeraSlayer (website)
  • UCHIME (paper, website) - Uses either a de novo or reference-based approach by analyzing the abundance of amplicon sequences.
  • VSEARCH* (website, paper) - open-source implementation of the UCHIME algorithm.

Error Correction

Amplicon Functional Prediction

  • PanFP (paper, website) - Perl program that infers functions by pooling together all genes at a given OTU's taxonomic lineage. This package comes with a database of prokaryotic genomes functionally annotated by KEGG.
  • PAPRICA (paper, website, blog post) - Python package that uses phylogenetic placement approach to infer MetaCyc pathway abundances.
  • PICRUSt* (paper, website, online version, command-line tutorial) - Python package that performs ancestral reconstruction to infer functional abundances. Currently limited to OTUs within the Greengenes database.
  • Piphillin (paper, website) - Web-interface tool that is not dependent on a phylogeny and instead uses a nearest-neighbour approach.
  • SINAPS (pre-print) - Fast kmer approach that is not dependent on alignments or trees.
  • Tax4Fun (paper, website) - R package that estimates the metabolic profile of 16S sequences based on the SILVA database.
  • Vikodak (paper, broken website?) - Includes extended features not covered by the above tools.

Tree Building

  • BEAST
  • FastTree*
  • MEGA
  • RAxML**

Phylogenetic Placement

Tree Analyzing

Clone this wiki locally