GitHub - ShaharLutatiPersonal/OCD: Official PyTorch Implementation

OCD Learning to Overfit with Conditional Diffusion Models🪐
_{Official PyTorch Implementation}

Paper

We present a dynamic model in which the weights are conditioned on an input sample x and are learned to match those that would be obtained by finetuning a base model on x and its label y. This mapping between an input sample and network weights is shown to be approximated by a linear transformation of the sample distribution, which suggests that a denoising diffusion model can be suitable for this task. The diffusion model we therefore employ focuses on modifying a single layer of the base model and is conditioned on the input, activations, and output of this layer. Our experiments demonstrate the wide applicability of the method for Image Classification, 3D Reconstruction, Tabular Data, Few-Shot NLP, and Speech Separation.

Updates

07.10.22 - tinyNeRF is online! 💥💥💥
11.10.22 - LeNet5 (MNIST) is online! 💯 💯 💯

Setup

Clone the repo to your local machine.
Make sure you install the requirements.

Special notes

You can use either the example of tinyNerf as in the code (and also train it by yourself) or take Lenet5 model. Please see the Lenet5 model, for full explantion to how export the latent input and output for the selected layer. The output should be: predicted_labels, h = base_model();
where h is I(x) as in the paper. It is very important to detach() the latent variables in the forward pass of the base model, and also use "deepcopy" function from copy
For tinyNerf it is suggested to use the flag --precompute_all, for Lenet5 not.
Specific configs are in config folder, although the generic config as in the paper will work too.
The specific configs are optimizied for small footprint to allow low-end devices to run the model.
Make sure you correctly change the name of the selected layer if other network is employed.
The diffusion process will work once the training objective (over the normalized diffusion model) reaches a plateau of 5E-4.

Examples:

1. for training nerf-OCD

python run_func_OCD.py -e 0

2. for evaluating nerf

python run_func_OCD.py -e 1 -t 0 -pd ./checkpoints/model_ocd_tinynerf.pt -ps ./checkpoints/scale_model_tinynerf.pt

3. for training lenet5-OCD

python run_func_OCD.py -e 0 -pb ./checkpoints/checkpoint_lenet5.pth -pc ./configs/train_mnist.json -pdtr ./data/mnist -pdts ./data/mnist -dt mnist -prc 0

4. for evaluating lenet5-OCD - First you need to train the model(!)

python run_func_OCD.py -e 1 -t 0 -pb ./checkpoints/checkpoint_lenet5.pth -pc ./configs/train_mnist.json -pdtr ./data/mnist -pdts ./data/mnist -dt mnist -prc 0 -pd ./checkpoints/model_ocd_mnist.pt -ps ./checkpoints/scale_model_mnist.pt

Acknowledgments

The tinyNeRF code base is from (https://github.com/krrish94/nerf-pytorch) Krishna Murthy's repo.
There are adjustments in order to export the latent variables required for the diffusion process condition.
The diffusion model is largely adopted from (https://github.com/ermongroup/ddim) Jiaming Song's repo.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
base_models		base_models
checkpoints		checkpoints
configs		configs
data		data
nerf_utils		nerf_utils
LICENSE		LICENSE
Lenet5.py		Lenet5.py
README.md		README.md
data_loader.py		data_loader.py
diffusion_ocd.py		diffusion_ocd.py
ema.py		ema.py
requirements.txt		requirements.txt
run_func_OCD.py		run_func_OCD.py
train.py		train.py
utils_OCD.py		utils_OCD.py

License

ShaharLutatiPersonal/OCD

Folders and files

Latest commit

History

Repository files navigation

OCD Learning to Overfit with Conditional Diffusion Models🪐Official PyTorch Implementation

Updates

Setup

Special notes

Examples:

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Languages

OCD Learning to Overfit with Conditional Diffusion Models🪐
_{Official PyTorch Implementation}