Skip to content
An archive of datasets distributed with R
HTML Other
Latest commit 1218370 Apr 13, 2015 @vincentarelbundock lme4
Failed to load latest commit information.
csv lme4 Apr 13, 2015
doc lme4 Apr 13, 2015
license 2nd initial commit Jul 12, 2012
Makefile add: HSAUR, psych Feb 5, 2013
README.rst lme4 Apr 13, 2015
Rdatasets.R lme4 Apr 13, 2015
datasets.csv lme4 Apr 13, 2015
datasets.html lme4 Apr 13, 2015 ENH: Walk html docs and generate rst Nov 15, 2012


What is this?

Rdatasets is a collection of 758 datasets that were originally distributed alongside the statistical software environment R and some of its add-on packages. The goal is to make these data more broadly accessible for teaching and statistical software development.

What is included?

The list of available datasets (csv and docs) is available here:

On the github repository you will also find:

  • Rdatasets.R: R script to download CSV copies and HTML docs for all datasets distributed in Base R and a list of R packages.
  • docs2rst: A python script which calls pandoc to convert the HTML documentation to human readable text files.
  • Plain text versions (reStructuredText) of all doc files

Adding data

Many R packages ship with associated datasets, but the script included here only downloads data from packages that are installed locally on the machine where it is run. If you spot interesting data in a package distributed on CRAN, let me know. I will try to install that package on my computer and I will re-run the download script to see if the data can be added to this repository. Requests should be filed on the Github issue tracker.


The R documentation which I copied to the Rdatasets html folder is licensed under GPL. You will find a copy of the GPL in the license folder. I made a good faith effort to determine the license under which the actual data (i.e. rows/columns of numbers) were distributed, but I was unable to find a definitive answer. My understanding is that these datasets are free to re-distribute. However, if you own the rights to data that are included here and you object to their inclusion in Rdatasets, send me an email at I will promptly remove the data in question and will make sure that all traces are erased from the git revision history.

Something went wrong with that request. Please try again.