scikit-bio is an open-source, BSD-licensed, Python package providing data structures, algorithms, and educational resources for bioinformatics.
Python Jupyter Notebook C Other
Latest commit ea901b3 Aug 28, 2017 @mortonjt mortonjt committed with RNAer BUG: Removing ValueError check when `replace=True` in subsample_counts (

* Adding in changelog

* Update

* flake8

* DOC: fixing docstring, changelog and error message

* DOC: Fixing diff
Failed to load latest commit information.
assets ENH: add inverted vector logo Jul 2, 2015
benchmarks BUG: fix performance regression by reinstating empty metadata optimiz… Jun 7, 2016
ci CI: force matplotlib Agg backend (#1427) Sep 16, 2016
doc REL: 0.5.1 (#1454) Nov 13, 2016
ipynbs/presentations From upstream May 2, 2015
licenses skbio.metadata.IntervalMetadata, skbio.metadata.Interval objects (#1414) Sep 6, 2016
skbio BUG: Removing ValueError check when `replace=True` in subsample_counts ( Aug 28, 2017
.coveragerc ENH: improved Sequence repr Jun 26, 2015
.gitignore MAINT: Remove invisible file and add to gitignore Jul 14, 2015
.travis.yml Fix power test failure and add py36 test in travis #1497 (#1498) Mar 8, 2017 BUG: Removing ValueError check when `replace=True` in subsample_counts ( Aug 28, 2017 Update (#1488) Feb 7, 2017
COPYING.txt MAINT: added names in place of placeholder Jun 17, 2014 DOC: add code review doc (#1464) Nov 29, 2016
Makefile CI: add --rcfile option to coverage command Aug 18, 2015 DOC/BUG: fix links in PR template (#1468) Nov 29, 2016
README.rst Merge pull request #1366 from jairideout/issue-1296 Jun 4, 2016 DOC: update release document (#1463) Nov 28, 2016 DOC: add code review doc (#1464) Nov 29, 2016
asv.conf.json DOC: update docs to reflect Python 3-only support Mar 25, 2016 CI/ENH: add directive to to ignore copyright header vali… Sep 2, 2016
setup.cfg TST: ` nosetests / test` support (#1477) Jan 30, 2017 TST: ` nosetests / test` support (#1477) Jan 30, 2017


scikit-bio logo

Build Status Coverage Status ASV Benchmarks Join the chat at Depsy Badge Anaconda Cloud Build Anaconda Cloud License Downloads Install

scikit-bio is an open-source, BSD-licensed Python 3 package providing data structures, algorithms and educational resources for bioinformatics.

To view scikit-bio's documentation, visit

Note: scikit-bio is no longer compatible with Python 2. scikit-bio is compatible with Python 3.4 and later.

scikit-bio is currently in beta. We are very actively developing it, and backward-incompatible interface changes can and will arise. To avoid these types of changes being a surprise to our users, our public APIs are decorated to make it clear to users when an API can be relied upon (stable) and when it may be subject to change (experimental). See the API stability docs for more details, including what we mean by stable and experimental in this context.


The recommended way to install scikit-bio is via the conda package manager available in Anaconda or miniconda.

To install the latest release of scikit-bio:

conda install -c scikit-bio

Alternatively, you can install scikit-bio using pip:

pip install numpy
pip install scikit-bio

You can verify your installation by running the scikit-bio unit tests:

python -m skbio.test

For users of Debian, skbio is in the Debian software distribution and may be installed using:

sudo apt-get install python3-skbio python-skbio-doc

Getting help

To get help with scikit-bio, you should use the skbio tag on StackOverflow (SO). Before posting a question, check out SO's guide on how to ask a question. The scikit-bio developers regularly monitor the skbio SO tag.

Projects using scikit-bio

Some of the projects that we know of that are using scikit-bio are:

If you're using scikit-bio in your own projects, feel free to issue a pull request to add them to this list.

scikit-bio development

If you're interested in getting involved in scikit-bio development, see

See the list of scikit-bio's contributors.


scikit-bio is available under the new BSD license. See COPYING.txt for scikit-bio's license, and the licenses directory for the licenses of third-party software that is (either partially or entirely) distributed with scikit-bio.

The pre-history of scikit-bio

scikit-bio began from code derived from PyCogent and QIIME, and the contributors and/or copyright holders have agreed to make the code they wrote for PyCogent and/or QIIME available under the BSD license. The contributors to PyCogent and/or QIIME modules that have been ported to scikit-bio are: Rob Knight (@rob-knight), Gavin Huttley (@gavin-huttley), Daniel McDonald (@wasade), Micah Hamady, Antonio Gonzalez (@antgonza), Sandra Smit, Greg Caporaso (@gregcaporaso), Jai Ram Rideout (@jairideout), Cathy Lozupone (@clozupone), Mike Robeson (@mikerobeson), Marcin Cieslik, Peter Maxwell, Jeremy Widmann, Zongzhi Liu, Michael Dwan, Logan Knecht (@loganknecht), Andrew Cochran, Jose Carlos Clemente (@cleme), Damien Coy, Levi McCracken, Andrew Butterfield, Will Van Treuren (@wdwvt1), Justin Kuczynski (@justin212k), Jose Antonio Navas Molina (@josenavas), Matthew Wakefield (@genomematt) and Jens Reeder (@jensreeder).


scikit-bio's logo was created by Alina Prassas.