Skip to content

komartom/MIDatasets.jl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Multiple Instance Datasets (MIDatasets.jl)

Build Status codecov.io

Musk1, Musk2, Fox, Tiger, Elephant,...

Datasets are taken from http://www.miproblems.org/datasets/

Structure of datasets is Vector{Matrix{Float32}}, where rows of matrices correspond to instances

Installation

You can download datasets using Julia's package manager

] add https://github.com/komartom/MIDatasets.jl.git

List of all available datasets

using MIDatasets
datasets = midatasets()

Load a dataset

X, y = midataset("Musk1");

Indexes for cross-validation

Each dataset also contains indexes of 5-times repeated 10-fold cross-validation for reproducible research

X, y, folds = midataset("Musk1", folds=true);

# Trainig samples and labels for the second fold of the first repetition
Xtrain = X[folds[1][2]]
ytrain = y[folds[1][2]]

# Testing samples and labels for the second fold of the first repetition
Xtest = X[.!folds[1][2]]
ytest = y[.!folds[1][2]]

About

Multiple Instance Datasets

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages