Missing data experiments

This repository contains a framework to generate, impute and analize missing data and imputation bias on different datasets. We have developed different missing data mechanisms: both MCAR and MAR; and different imputation methods: median and SVD. Our goal is to study the risk/benefit tradeoff of missing value imputation in the context of feature selection. We caution against using imputation methods that may yield false positives: features not associated to the target becoming dependent as a result of imputation. We also investigate situations in which imputing missing values may be beneficial to reduce false negatives. We have used a dataset called Gissette as a base example to perform the experimentation. Due to the size of the files we have only been able to provide the original dataset (without missingess and imputed datasets).

Repository content

data: Folder to store the original, missingned and imputed datasets.
graph: Folder to store the resulting graphs obtained fruit of the experimentation.
results: Folder to store the resulting data obtained.
src: Contains the experimentation source (only Matlab implementation available at the moment).

Experimental reproduction

Add all the project folders to Matlab path.
Execute the example function main_mcar_example or main_mar_example depending on the missingness type that we want to generate on Gisette dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 276 Commits
data		data
graphs		graphs
results		results
src		src
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Missing data experiments

Repository content

Experimental reproduction

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Missing data experiments

Repository content

Experimental reproduction

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages