Skip to content
This repository has been archived by the owner on Nov 9, 2023. It is now read-only.


Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?

Latest commit


Git stats


Failed to load latest commit information.
Latest commit message
Commit time
December 10, 2014 12:34
April 23, 2015 18:19
December 10, 2014 12:34
December 10, 2014 18:48
July 11, 2015 12:23
June 4, 2015 15:56


Build Status Coverage Status

qiime-default-reference, canonically pronounced chime default reference, is a Python package containing default reference data files for use with QIIME. The current default reference data is compiled from the Greengenes 16S rRNA database version 13_8. Please see the Attribution section below for more details.


To install qiime-default-reference:

pip install qiime-default-reference

Running the tests

To run qiime-default-reference's unit tests:

nosetests --with-doctest qiime_default_reference


The reference data distributed in this Python package were copied from the Greengenes 16S rRNA database:

An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea. McDonald D, Price MN, Goodrich J, Nawrocki EP, DeSantis TZ, Probst A, Andersen GL, Knight R, Hugenholtz P. ISME J. 2012 Mar;6(3):610-8. doi: 10.1038/ismej.2011.139.

The Greengenes reference data is licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License.

The default template alignment column mask (i.e., "Lane mask") is derived from:

Lane,D.J. (1991) 16S/23S rRNA sequencing. In Stackebrandt,E. and Goodfellow,M. (eds), Nucleic Acid Techniques in Bacterial Systematics. John Wiley and Sons, New York, pp. 115-175.

Lane mask was originally available in ARB.

Lane mask taken from here.


qiime-default-reference makes it very easy to access the Greengenes 97% OTUs and some other key pieces of Greengenes from within Python. After installing, you can do the following:

import qiime_default_reference

# Get the absolute filepath to the reference sequences in fasta format.

# Get the absolute filepath to the PyNAST template alignment in fasta format.

# Get the absolute filepath to the reference phylogenetic tree in newick format.

# Get the absolute filepath to the reference taxonomy in tab-separated text format.

# Get the alignment column mask (currently the Lane mask) as 1s and 0s.
# This will be str/bytes in Python 2 (they're the same), and bytes in Python 3.

Getting Help

Please post your questions about this repository/package on the QIIME Forum.