Skip to content


Repository files navigation


r-cmd-check CRAN Status StackOverflow Mattermost

A small collection of interesting and educational machine learning data sets which are used as examples in the mlr3 book, the mlr3 gallery, or in other examples of mlr3 packages. All data sets are properly preprocessed and ready to be analyzed by most machine learning algorithms. Currently contains the following data sets:

  1. Housing prices in Kings County [link]
  2. Titanic passenger survival data [link]
  3. Optical recognition of handwritten digits data [link]
  4. Major League Baseball statistics 1962-2012 [link]
  5. Indian Liver Patient Data [link]
  6. Bike Sharing Demand [link]