Skip to content
Switch branches/tags
Go to file

Latest commit


Git stats


Failed to load latest commit information.


This repository contains the implementation of our Heterogeneous Incomplete Variational Autoendoder model (HI-VAE). It has been written in Python, using Tensorflow.

The details of this model are included in this paper. Please cite it if you use this code for your own research.

Databse description

There are three different datasets considered in the experiments (Wine, Adult and Default Credit). Each dataset has each own folder, containing:

  • data.csv: the dataset
  • data_types.csv: a csv containing the types of that particular dataset. Every line is a different attribute containing three paramenters:
    • type: real, pos (positive), cat (categorical), ord (ordinal), count
    • dim: dimension of the variable
    • nclass: number of categories (for cat and ord)
  • Missingxx_y.csv: a csv containing the positions of the different missing values in the data. Each "y" mask was generated randomly, containing a "xx" % of missing values.

You can add your own datasets as long as they follow this structure.

Files description

  • A script with a simple example on how to run the models.
  • Contains the main code for the HIVAE models.
  • loglik_ models_ In this file, the different likelihood models for the different types of variables considered (real, positive, count, categorical and ordinal) are included.
  • model_ Contains the HI-VAE with input dropout encoder model.
  • model_ Contains the HI-VAE with factorized encoder model


Alfredo Nazabal:

Code Pre-requisites


$ git clone
$ pip install virtualenv
$ cd mace
$ virtualenv -p python3 _venv
$ source _venv/bin/activate
$ pip install -r pip_requirements.txt
$ chmod +x

Then, run

$ ./


No description, website, or topics provided.




No releases published


No packages published