Skip to content
No description, website, or topics provided.
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.

Quick Start

Download the JSON version of the database at data/microbe-directory.json. The keys of the database are the MetaPhlAn2 species' names, which makes it easy to use the database downstream of taxonomic classifications.


This directory contains the database itself, as well as the scripts used to generate different versions of the database.

The main database is microbe-directory.sql. The JSON and CSV versions of the database, microbe-directory.json and microbe-directory.csv, are generated from the sql database using and, respectively.

We supplemented our manual curation by parsing the Microbe-Wiki for common keywords that could indicate particular features. We found that we could extract useful data for pathogenicity, biofilm formation, microbe shape, halophilicity, spore formation, and metabolism. We were able to extract some subset of these features for 331 of the microbes that had been manually curated. This supplemental data can be found in CSV format here: binomial_scraped.csv.


This directory contains a quick python example,, which shows users how to use the JSON version of the database given a MetaPhlAn2 output file.


  1. The Microbe Directory:
  2. Data GitHub:
  3. Website BitBucket:
You can’t perform that action at this time.