NeRN

LEARNING NEURAL REPRESENTATIONS FOR NEURAL NETWORKS

Link to paper: https://arxiv.org/abs/2212.13554

This is the official framework for "NeRN - Learning Neural Representations for Neural Networks". This framework can be used to recreate the experiments detailed in the paper, or be extended to support additional networks and tasks.

💿 Installation

The requirements for NeRN are listed in requirements.txt. To create a new conda environment with the required packages, run the following:

conda create --name nern python=3.7
conda activate nern
pip install -r requirements.txt

🏃 Training a NeRN

Requirements

A working python environment.
Pretrained CNN weights
- Current supported architectures are ResNet18, ResNet20 & ResNet56.
- Current supported benchmarks are CIFAR-10, CIFAR-100 & ImageNet.
- Below we list instructions for extending the framework to new networks and tasks.
A pyrallis configuration file (.yaml).
(Hopefully) A GPU.

Example Configuration File

logging:
  exp_name: cifar10_resnet20_smoothpes_permute_crossfilter_180  # Will be used for logging and checkpoints
  log_dir: outputs/resnet20/cifar10
original_model_path: trained_models/original_tasks/cifar10/resnet20_0_B.pt  # Path to original pretrained network
nern:
  num_blocks: 5
  hidden_layer_size: 180
  weights_batch_method: random_weighted_batch  # This is the magnitude-oriented batching
  embeddings:
    enc_levels: 40  # For three indices we get a positional embedding of size (40 * 3) * 2 = 240
    base: 0.76  # Smooth positional embeddings
    permutations:
      permute_mode: crossfilter
  weights_batch_size: 4096
  distillation_loss_type: StableKLDivLoss
task:
  task_name: cifar10
  original_model_name: ResNet20
optim:
  optimizer_type: ranger
  lr: 5e-3
  ranger_use_gc: False
epochs: 350  # Either set epochs or iterations
num_gpus: 1
num_workers: 12

Instructions

The NeRN framework supports both ClearML and TensorBoard for logging, and it's up to you to decide which one you prefer. The default is ClearML, but can be changed by adding the following to the configuration file:

logging:
    use_tensorboard: True

Finally, run:

python train.py --config_path <PATH_TO_CONFIG_YAML_FILE>

It is possible to override some fields in the configuration file in the execution line. For example:

python train.py --logging.use_tensorboard True --epochs 400 --nern.weights_batch_method all --config_path <PATH_TO_CONFIG_YAML_FILE>

📊 Evaluating a NeRN

Requirements:

A working python environment
Pretrained CNN weights (see section 'Training a NeRN')
Pretrained NeRN weights
The pyrallis configuration file (.yaml) used to train NeRN
(Hopefully) A GPU
(Optional) Precomputed permutations

Create a copy of the training configuration file, and add the following fields:

nern:
  init: checkpoint
  checkpoint_path: <PATH_TO_NERN_CHECKPOINT>

These will initialize the NeRN using pretrained weights (by the way, this can also be used for finetuning/transfer learning)

Then, run:

python evaluate.py --config_path <PATH_TO_EVAL_CONFIG_YAML_FILE>

Tip: the nern.embeddings.permutations.path parameter is used to load precomputed permutations, and can save some time during training/evaluation

🔧 Extending the Framework

Adding new benchmarks

Under the package tasks, create function implementation to retrieve the test and train datasets. This function will recieve train_kwargs, test_kwargs and an optional use_workers flag. The function will return the training dataset and testing dataset, for example:

def get_dataloaders(train_kwargs, test_kwargs, use_workers=True, **kwargs):
    normalize = transforms.Normalize(mean=[0.485, 0.456, 0.406],
                                     std=[0.229, 0.224, 0.225])
    train_loader = torch.utils.data.DataLoader(
        datasets.CIFAR10(root='./data', train=True, transform=transforms.Compose([
            transforms.RandomHorizontalFlip(),
            transforms.RandomCrop(32, 4),
            transforms.ToTensor(),
            normalize,
        ]), download=True),
        batch_size=train_kwargs["batch_size"], shuffle=True,
        num_workers=train_kwargs["num_workers"] if use_workers else 0, pin_memory=True)

    val_loader = torch.utils.data.DataLoader(
        datasets.CIFAR10(root='./data', train=False, transform=transforms.Compose([
            transforms.ToTensor(),
            normalize,
        ])),
        batch_size=128, shuffle=False,
        num_workers=test_kwargs["num_workers"] if use_workers else 0, pin_memory=True)
    return train_loader, val_loader

Then, under dataloader_factory.py, add your new benchmark in the DataLoaderFactory as such:

class DataloaderFactory:
    tasks_data = {
        ...
        "mybenchmark": {
            "loader": my_get_dataloaders,
            "input_shape": (3, 32, 32)
        },
        ...
    }

Adding new networks

Step 1:

Create an OriginalModel wrapper, and implement the following methods:

class MyNewModel(OriginalModel):
    def __init__(self, num_classes: int = 10, **kwargs):
        super(MyNewModel, self).__init__()
        ...

    def get_feature_maps(self, batch: torch.Tensor) -> Tuple[torch.Tensor, List[torch.Tensor]]:
        # Given an input batch, returns a list of feature maps from all layers of the network (before the activation function)
        pass

    def get_learnable_weights(self) -> List[torch.Tensor]:
        # Returns a list containing the weights of all the convolutional layers in the network which we wish to represent using NeRN
        pass

    def load(self, path: str, device: torch.device):
        # Loads weights from a given path to your model
        pass

Step 2:

Create a ReconstructedModel wrapper, and implement the __str__ method:

class ReconstructedMyNewModel(ReconstructedModel):
    def __init__(self, original_model: ResNet20, train_cfg: Config, device: str,
                 sampling_mode: str = None):
        super(ReconstructedResNet20, self).__init__(original_model, train_cfg, device,  sampling_mode)

    def __str__(self):
        # Returns the name of the model in a unique way. Will be used for caching purposes
        pass

Step 3:

Add your classes to the ModelFactory:

class ModelFactory:
    models = {
        ...
        "MyNewModel": (MyNewModel, ReconstructedMyNewModel),
        ...
    }

📦 Pretrained Models

The models (both original networks and pretrained NeRNs) can be found here (*)

We provide pretrained models for the following tasks:

Task	Original Network	NeRN Configuration
CIFAR10	ResNet20 (Accuracy 91.69%)	Infilter Permutations, Hidden Size 160, Accuracy 90.85% Infilter Permutations, Hidden Size 180, Accuracy 91.30% Crossfilter Permutations, Hidden Size 160, Accuracy 91.16% Crossfilter Permutations, Hidden Size 180, Accuracy 91.50%
CIFAR10	ResNet56 (Accuracy 93.52%)	Infilter Permutations, Hidden Size 280, Accuracy 92.48% Infilter Permutations, Hidden Size 320, Accuracy 92.75% Crossfilter Permutations, Hidden Size 280, Accuracy 92.54% Crossfilter Permutations, Hidden Size 320, Accuracy 92.96%
CIFAR100	ResNet56 (Accuracy 71.35%)	Crossfilter Permutations, Hidden Size 360, Accuracy 70.51% Crossfilter Permutations, Hidden Size 400, Accuracy 71.70%
ImageNet	ResNet18 (Top-1 Accuracy 69.76%)	Crossfilter Permutations, Hidden Size 1256, Top-1 Accuracy 68.77%

(*) Pretrained ResNet18 on ImageNet is taken from torchvision

Since computing the Crossfilter permutations on ResNet18 takes a few hours, we have also uploaded the precomputed permutations. Load the precomputed permutations using nern.embeddings.permutations.path

📃 Final Note

Under experiments, you will find all the relevant configuration files, used to generate the results for sections 4.1-4.3 in the paper. The NeRN framework is still under active development, we hope you have fun using it and welcome any feedbacks.

Citation

If you use this code or paper for your research, please cite the following:

@inproceedings{
ashkenazi2023nern,
title={Ne{RN}: Learning Neural Representations for Neural Networks},
author={Maor Ashkenazi and Zohar Rimon and Ron Vainshtein and Shir Levi and Elad Richardson and Pinchas Mintz and Eran Treister},
booktitle={The Eleventh International Conference on Learning Representations },
year={2023},
url={https://openreview.net/forum?id=9gfir3fSy3J}
}

Name		Name	Last commit message	Last commit date
Latest commit History 318 Commits
NeRN		NeRN
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NeRN

NeRN

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

Repository files navigation

NeRN

LEARNING NEURAL REPRESENTATIONS FOR NEURAL NETWORKS

💿 Installation

🏃 Training a NeRN

Requirements

Example Configuration File

Instructions

📊 Evaluating a NeRN

🔧 Extending the Framework

Adding new benchmarks

Adding new networks

Step 1:

Step 2:

Step 3:

📦 Pretrained Models

📃 Final Note

Citation

About

Releases

Packages

Contributors 5

Languages

License

maorash/NeRN

Folders and files

Latest commit

History

Repository files navigation

NeRN

LEARNING NEURAL REPRESENTATIONS FOR NEURAL NETWORKS

💿 Installation

🏃 Training a NeRN

Requirements

Example Configuration File

Instructions

📊 Evaluating a NeRN

🔧 Extending the Framework

Adding new benchmarks

Adding new networks

Step 1:

Step 2:

Step 3:

📦 Pretrained Models

📃 Final Note

Citation

About

Resources

License

Stars

Watchers

Forks

Languages