GitHub - RyanWangZf/PAC-Bayes-IB: Official repo for PAC-Bayes Information Bottleneck. ICLR 2022.

⭐PAC-Bayes Information Bottleneck

This repo provides the demonstration of our recent paper on identifying information in deep neural networks through PAC-Bayes Information Bottleneck.

How do we train an MLP model and track the information in weights during the training? Just use the following codes!

from src.models import MLP
from src.dataset import load_data
from src.utils import plot_info_acc
from src import train_track_info, img_preprocess, train_prior

# pre-process the input mnist data
x_tr, y_tr, x_va, y_va, x_te, y_te = load_data('mnist')
x_tr, y_tr = img_preprocess(x_tr, y_tr)
x_va, y_va = img_preprocess(x_va, y_va)
x_te, y_te = img_preprocess(x_te, y_te)

# initialize the model and get the prior for tracking information
model = MLP()
model.cuda()
train_prior(model, x_va, y_va)

# trigger training and view the information trajectory!
info_dict, loss_acc_list = train_track_info(model, list(range(len(x_tr))), x_tr, y_tr, x_va, y_va, 
                               num_epoch=20,
                               batch_size=128, 
                               lr=1e-4,
                               weight_decay=0,
                               track_info_per_iter=-1, # track the information after how many iterations
                               verbose=False,)
plot_info_acc(info_dict, loss_acc_list, 'relu', './')

we shall get the following image

which reproduces the experiment results in the paper.

It is also easy to transform a model to PAC-Bayes IB augmented one! See the following codes:

from src.models import VGG
from src.dataset import load_data
from src import img_preprocess, train_prior, train_iiw

# load data
x_tr, y_tr, x_va, y_va, x_te, y_te = load_data('cifar10')
x_tr, y_tr = img_preprocess(x_tr, y_tr)
x_va, y_va = img_preprocess(x_va, y_va)
x_te, y_te = img_preprocess(x_te, y_te)

# initialize model and get prior
model = VGG()
model.cuda()
train_prior(model, x_va, y_va)

# train with iiw regularization!
# specify the parameters you want for computing IIW and regularize
# or just set param_list=None to use all parameters for trainining!
param_list = ['extract_feature.0.weight', 'extract_feature.2.weight']
info_dict, loss_acc_list = train_iiw(model, list(range(len(x_tr))), 
                                    x_tr, y_tr, x_va, y_va,
                                    param_list=param_list,
                                    verbose=True,)

🔥For reproducing the phase transition phenomenon captured by PAC-Bayes IB

please refer to the jupyter notebook demo_track_information.ipynb.

we can see that deeper layer accelerate the fitting and compressing of deep neural networks!

🔥For reproducing the information in weights w.r.t. batch size and generalization gap

please refer to the jupyter notebook demo_batch_size_info.ipynb.

we can see that the IIW corresponds to the generalization gap (train acc - test acc) on the left hand side!

🔥For comparing the performance of the PIB-based SGD algorithm:

download CIFAR10 from http://www.cs.toronto.edu/~kriz/cifar.html
unzip the file under ./data/cifar-10-python
use python run_vgg_PIB.py

or refer to demo_track_pac_bayes_ibpynb for a look.

😄If you find this result interesting, please consider to cite this paper:

@inproceedings{
    wang2022pacbayes,
    title={{PAC}-Bayes Information Bottleneck},
    author={Zifeng Wang and Shao-Lun Huang and Ercan Engin Kuruoglu and Jimeng Sun and Xi Chen and Yefeng Zheng},
    booktitle={International Conference on Learning Representations},
    year={2022},
    url={https://openreview.net/forum?id=iLHOIDsPv1P}
}

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
data		data
figure		figure
src		src
.gitignore		.gitignore
README.md		README.md
demo_batch_size_info.ipynb		demo_batch_size_info.ipynb
demo_track_information.ipynb		demo_track_information.ipynb
demo_train_pac_bayes_ib.ipynb		demo_train_pac_bayes_ib.ipynb
requirements.txt		requirements.txt
run_vgg_PIB.py		run_vgg_PIB.py
run_vgg_dropout.py		run_vgg_dropout.py
run_vgg_l2.py		run_vgg_l2.py
run_vgg_vanilla.py		run_vgg_vanilla.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

figure

figure

src

src

.gitignore

.gitignore

README.md

README.md

demo_batch_size_info.ipynb

demo_batch_size_info.ipynb

demo_track_information.ipynb

demo_track_information.ipynb

demo_train_pac_bayes_ib.ipynb

demo_train_pac_bayes_ib.ipynb

requirements.txt

requirements.txt

run_vgg_PIB.py

run_vgg_PIB.py

run_vgg_dropout.py

run_vgg_dropout.py

run_vgg_l2.py

run_vgg_l2.py

run_vgg_vanilla.py

run_vgg_vanilla.py

Repository files navigation

⭐PAC-Bayes Information Bottleneck

🔥For reproducing the phase transition phenomenon captured by PAC-Bayes IB

🔥For reproducing the information in weights w.r.t. batch size and generalization gap

🔥For comparing the performance of the PIB-based SGD algorithm:

😄If you find this result interesting, please consider to cite this paper:

About

Releases

Packages

Languages

RyanWangZf/PAC-Bayes-IB

Folders and files

Latest commit

History

Repository files navigation

⭐PAC-Bayes Information Bottleneck

🔥For reproducing the phase transition phenomenon captured by PAC-Bayes IB

🔥For reproducing the information in weights w.r.t. batch size and generalization gap

🔥For comparing the performance of the PIB-based SGD algorithm:

😄If you find this result interesting, please consider to cite this paper:

About

Topics

Resources

Stars

Watchers

Forks

Languages