DCI-ES

This is the official repo of the paper DCI-ES: An Extended Disentanglement Framework with Connections to Identifiability

The paper explores the idea of analyzing the properties of a learned representation from the point of view of ease-of-use. The main tool to do this is the loss-capacity curves, as explained in the paper. For a given representation, we train a set of probes with increasing capacity that are learned (in a supervised way) to predict the latent factors of variations for a dataset. After training the set of probes, we can compute the proposed Explicitness score, as shown in the paper.

Computing the DCI-ES scores involves 4 steps:

get pretrained representations or train representations on evaluation datasets
compute the representations of the evaluation dataset and save the results
train the sets of probes
compute DCI-ES scores.

install dependencies

pip install -r requirements_pip.txt

# install liftoff, a small framework to manage the configuration of experiments
pip install git+https://github.com/tudor-berariu/liftoff.git#egg=liftoff

Datasets

We use MPI3D dataset for evaluation in this repo.

Train and save basic representations of the evaluation dataset

We can use the train_unsupervised_model.py to train a beta-VAE model on the evaluation datasets using the default parameters from default_config_dev.yaml.

liftoff train_unsupervised_model.py ../configs/default_config_dev.yaml

The previous script can be adapted to include any pretrained representation that we might want to test. As long as we save the vector representations in the same format as the previous script, we can use them for computing the DCI-ES scores.

Train a single probe

Train a single probes with default parameters (default_config_dev.yaml):

liftoff train_probe.py ../configs/default_config_dev.yaml

We can change the hyperparameters of the probe, defined in the config file, and train a new probe. The most important hyperparameter is the type of probe: MLPs, Random Fourier Features + Learned linear layer, Random Forest. This is selected by the flag: probe_type [MLP / RFF / RandomForest].

Train set of probes

We use liftoff to generate config files for different runs, and for managing a queue of experiments.

First, we create a set of config files, each one for training one probe. We define in ../configs/random_forest/default.yaml the default parameters used in all experiments of a set, and in ../configs/random_forest/config.yaml the hyperparameters that we vary.

liftoff-prepare ../configs/random_forest/ --runs-no 10 --results-path results/ --do

The previous command creates 10 runs (10 different seeds) for each hyperparameter configuration defined in ../configs/random_forest/config.yaml and saves them in a results folder.

We run all experiments in the queue using the command:

liftoff train_probe.py  ./results/date_random_forest/  --gpus 0 --per-gpu 4 --procs-no 4

The previous command starts 4 runs in parallel on one GPU.

Compute Explicitness score

After we have trained all probes of a certain type, we can compute the DCI-E scores using the following script:

python simple_gather.py --results_dir=results/date_random_forest

In this repo we make use of the following projects:

SAGE: https://github.com/iancovert/sage
loaders for MPI3D from: https://github.com/bethgelab/InDomainGeneralizationBenchmark
Random Fourier Features implementation: https://github.com/jmclong/random-fourier-features-pytorch

Citation

Please use the following BibTeX to cite our work.

@inproceedings{
eastwood2022dcies,
title={{DCI}-{ES}: An Extended Disentanglement Framework with Connections to Identifiability},
author={Eastwood, Cian and Nicolicioiu, Andrei Liviu and von K{\"u}gelgen, Julius and Keki{\'c}, Armin and Tr{\"a}uble, Frederik and Dittadi, Andrea and Sch{\"o}lkopf, Bernhard},
booktitle={International Conference on Learning Representations },
year={2023},
url={https://openreview.net/forum?id=462z-gLgSht}
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
InDomainGeneralizationBenchmark/src/lablet_generalization_benchmark		InDomainGeneralizationBenchmark/src/lablet_generalization_benchmark
PyTorchVAE		PyTorchVAE
configs		configs
metrics		metrics
sage		sage
src		src
LICENSE		LICENSE
README.md		README.md
requirements_pip.txt		requirements_pip.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

InDomainGeneralizationBenchmark/src/lablet_generalization_benchmark

InDomainGeneralizationBenchmark/src/lablet_generalization_benchmark

PyTorchVAE

PyTorchVAE

configs

configs

metrics

metrics

sage

sage

src

src

LICENSE

LICENSE

README.md

README.md

requirements_pip.txt

requirements_pip.txt

Repository files navigation

DCI-ES

install dependencies

Datasets

Train and save basic representations of the evaluation dataset

Train a single probe

Train set of probes

Compute Explicitness score

In this repo we make use of the following projects:

Citation

About

Releases

Packages

Languages

License

andreinicolicioiu/DCI-ES

Folders and files

Latest commit

History

Repository files navigation

DCI-ES

install dependencies

Datasets

Train and save basic representations of the evaluation dataset

Train a single probe

Train set of probes

Compute Explicitness score

In this repo we make use of the following projects:

Citation

About

Resources

License

Stars

Watchers

Forks

Languages