"Correcting Model Misspecification via Generative Adversarial Networks": ArXiv
-
Download the following dependencies
- jupyter
- papermill
- scrapbook
- pytorch
- seaborn
- matplotlib
- sklearn
- numpy
- statistics
- statsmodel
- pandas
- catboost
-
Folder structure
- src : contains all the functions used in the experiment. These .py files are imported into the notebooks and the functions defined here are then called in the notebook.
- conf - The conf folder contains the configuration files needed and used if the experiments are run using hydra. The latest experiment flow does not use hydra and thus this can be ignored
- baselineModels - contains the following models along with their training and testing regime
- Vanilla Neural Network
- Catboost
- Random Forest
- Stats Model
- bostonDataset, californiaDataset, cpunishDataset, diabetestDataset, friedman1Dataset, friedman2Dataset, friedman3Dataset, linearDataset, RegressionDataset imports the respective datasets and does the required preprocessing on it.
- train_test.py and ABC_train_test.py contains the functions to train and test GAN and ABC GAN respectively
- dataset.py : contains the Dataset class that is used to form the Dataloader (Check pytorch documentation)
- network.py : contains all the distriminator and generator networks used for GAN and ABC GAN
- performanceMetrics.py : contains the functions to plot and compare performace metrics (MSE, MAE, Euclidean Distance,Minkowski Distance) of different models
- sanityChecks : contains functions that perform sanity check by comparing discriminator probability
- Notebooks1 - Contains notebooks for experiment 1 (where variance was varied)
- Notebooks2 - Contains notebooks for experiment 2 (where variance and bias was varied)
- src : contains all the functions used in the experiment. These .py files are imported into the notebooks and the functions defined here are then called in the notebook.