Skip to content


Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?

Latest commit


Git stats


Failed to load latest commit information.
Latest commit message
Commit time
September 11, 2023 06:13
February 11, 2023 22:59
October 11, 2023 11:07
October 11, 2023 11:07
October 11, 2023 11:07
February 17, 2023 07:45
April 19, 2022 16:13
September 14, 2016 16:27
January 7, 2023 09:54


Docs Stable Docs Latest CI

This package represents a community effort to provide a common interface for accessing common Machine Learning datasets. In contrast to other data-related Julia packages, the focus of MLDatasets.jl is specifically on downloading, unpacking, and accessing benchmark datasets. Functionality for the purpose of data processing or visualization is only provided to a degree that is special to some dataset.

This package is a part of the JuliaML ecosystem. Its functionality is built on top of the package DataDeps.jl.

Available Datasets

Datasets are grouped into different categories. Click on the links below for a full list of datasets available in each category.

  • Graphs - Datasets with an underlying graph structure: Cora, PubMed, CiteSeer, ...
  • Misc - Datasets that do not fall into any of the other categories: Iris, BostonHousing, ...
  • Text - Datasets for language models.
  • Vision - Vision related datasets such as MNIST, CIFAR10, CIFAR100, ...


To install MLDatasets.jl, start up Julia and type the following code snippet into the REPL. It makes use of the native Julia package manger.

import Pkg

Contributing to MLDatasets

Pull requests contributing new datasets are warmly welcome. See the source code of any of the available implemented datasets for implementation examples.

Other data repositories for Julia

If you don't find here the dataset you are looking for, please let us know by opening an issue. Moreover, you can check out these other packages to find what you need:


This code is free to use under the terms of the MIT license.