Data sets for machine learning in Python
Latest commit fb08e82 Jul 3, 2015 @jaberg Merge pull request #67 from lmjohns3/master
More changes to enable support for py3k.
Failed to load latest commit information.
skdata Merge pull request #67 from lmjohns3/master Jul 3, 2015
.gitignore ENH: ignore build dir Jun 21, 2012 MANIFEST and requirements Jun 21, 2012
requirements.txt MANIFEST and requirements Jun 21, 2012 BLD: Don't remove the import fixer in 2to3 Jul 2, 2015


skdata (scikit-data)

Skdata is a library of data sets for machine learning and statistics. This
module provides standardized Python access to toy problems as well
as popular computer vision and natural language processing data sets.

The project is hosted at github:


There are several options for installation:

  * From scratch:

      * pip install --user

  * From a fresh git checkout:

      * python develop

      * python install




Join the mailing list:!forum/skdata


Github maintains an up-to-date list of direct contributors:

A special thanks goes to David Cox, who provided inspiration and design
guidance, and generally got this project started.

This work was supported in part by the National Science Foundation (IIS-0963668).