Variational Gaussian Mixture model Cross Validation resampling of Bayesian and Frequest Neural Networks

VAE code is adapted from this project https://github.com/hwalsuklee/tensorflow-generative-model-collections.git
Bayesian CNN and Frequenst ones are adapted from the following project https://github.com/felix-laumann/Bayesian_CNN https://github.com/kumar-shridhar/PyTorch-BayesianCNN

Dependencies

pip install Cython
pip install pot
if you have conda, install pot with: conda install -c conda-forge pot
pip install -r requirements.txt

Tips on generating dependencies file

pip freeze https://medium.com/python-pandemonium/better-python-dependency-and-package-management-b5d8ea29dff1

How to run or reproduce the experiment

Preparation

clone a new repository and go to the root directory
make build (30mins for the first run on fujitsu-celcius) # equivalent to python main.py --cluster True (train vae on all data and cluster), results could be stored in results/VAE_fashion-mnist_64_62/L-1 for example
make label (1 hours on fujitsu-celcius) # equivalent to python main.py --labeled True --cluster True (train vae according to label and cluster each label, then merge), results could be stored in results/VAE_fashion-mnist_64_62/L0 unutil results/VAE_fashion-mnist_64_62/L9 for example
results for the two steps are stored in results/VAE_fashion-mnist_64_62 for example, where 62 is the latent space dimension of VAE (see configuration file named config.py), while data is stored in /data/FashionMNIST for example

Evaluate Neural Network

change directory to refactor_Bayesian_CNN
make rand frand|vgmm|fvgmm_alexnet

statistic

before you run this command, you should previously run make build and make label
change directory to root folder
make wasser_cv_emd : compute wasserstein distance for random cross validation
make wasser_vgmm_emd: compute wasserstein distance for vgmm-vae cross validation
make t-SNE: generate t-SNE plot for all data divided by vgmm-vae (results could be stored in /results/VAE_fashion-mnist_64_62 for example)
make distribution_y: plot the histogram of class distribution for each cluster, result is store in distribution_y.txt

Plotting

go to /plots and use the R code to generate the beautiful ggplot

Guide to the code

Configuration

in root folder and refactor_Bayesian_CNN, files start with config stores global configuration parameters.

arguments for main.py in project root

python main.py --cluster <True,False (default)> --dataset <'mnist', 'fashion-mnist' (default)> --z_dim <1-inf,62(default)> --labeled <True,False (default)>

Misc Resources

Semi-supervised vae

python make

https://pypi.org/project/py-make/
https://snakemake.readthedocs.io/en/stable/
https://sacred.readthedocs.io/en/latest/apidoc.html decorator for reproducible experiment

parallel job in python

to avoid no space on device problem when run parallel in pytorch

The problem was resolved by setting the following env variable in our Dockerfile: ENV JOBLIB_TEMP_FOLDER=/tmp.
https://stackoverflow.com/questions/44664900/oserror-errno-28-no-space-left-on-device-docker-but-i-have-space
docker run --shm-size=512m
docker system prune -af
https://stackoverflow.com/questions/40115043/no-space-left-on-device-error-while-fitting-sklearn-model
It seems, that your are running out of shared memory (/dev/shm when you run df -h). Try setting JOBLIB_TEMP_FOLDER environment variable to something different: e.g., to /tmp.

%env JOBLIB_TEMP_FOLDER=/tmp

Name		Name	Last commit message	Last commit date
Latest commit History 322 Commits
Bayesian_CNN @ 8c05f58		Bayesian_CNN @ 8c05f58
Pytorch-Utils @ 962b9c2		Pytorch-Utils @ 962b9c2
paper/icml_udl_2019_style		paper/icml_udl_2019_style
paper_2019_distribution_shift_resample @ 8f09ca0		paper_2019_distribution_shift_resample @ 8f09ca0
plots		plots
refactor_Bayesian_CNN		refactor_Bayesian_CNN
results/VAE_fashion-mnist_64_62		results/VAE_fashion-mnist_64_62
tensorflow-generative-model-collections @ 3abde8a		tensorflow-generative-model-collections @ 3abde8a
.gitmodules		.gitmodules
ACGAN.py		ACGAN.py
Makefile		Makefile
README.md		README.md
VAE.py		VAE.py
VGMM.py		VGMM.py
config.py		config.py
config_manager.py		config_manager.py
convnet.py		convnet.py
copy_result.sh		copy_result.sh
data_generator.py		data_generator.py
demo_cross_validation.py		demo_cross_validation.py
docker-togo.sh		docker-togo.sh
getdir.py		getdir.py
main.py		main.py
ops.py		ops.py
prior_factory.py		prior_factory.py
requirements.txt		requirements.txt
requirements_cpu.txt		requirements_cpu.txt
run_all.sh		run_all.sh
run_all_test.sh		run_all_test.sh
scheduler.sh		scheduler.sh
statistic.py		statistic.py
system_requirement.sh		system_requirement.sh
test_gpu.py		test_gpu.py
utils_parent.py		utils_parent.py
visualization.py		visualization.py
wd_rand.txt		wd_rand.txt
wd_rand_matrix.txt		wd_rand_matrix.txt
wd_vgmm.txt		wd_vgmm.txt
wd_vgmm_matrix.txt		wd_vgmm_matrix.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Variational Gaussian Mixture model Cross Validation resampling of Bayesian and Frequest Neural Networks

Dependencies

Tips on generating dependencies file

How to run or reproduce the experiment

Preparation

Evaluate Neural Network

statistic

Plotting

Guide to the code

Configuration

arguments for main.py in project root

Misc Resources

Semi-supervised vae

python make

parallel job in python

to avoid no space on device problem when run parallel in pytorch

About

Releases

Packages

Languages

GAIMJKP/paper_2019_variationalResampleDistributionShift

Folders and files

Latest commit

History

Repository files navigation

Variational Gaussian Mixture model Cross Validation resampling of Bayesian and Frequest Neural Networks

Dependencies

Tips on generating dependencies file

How to run or reproduce the experiment

Preparation

Evaluate Neural Network

statistic

Plotting

Guide to the code

Configuration

arguments for main.py in project root

Misc Resources

Semi-supervised vae

python make

parallel job in python

to avoid no space on device problem when run parallel in pytorch

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages