A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric Estimation, ACM Multimedia Systems 2024.

This repository contains the code to reproduce the results presented in our paper and in the presentation blog at the Samsung Research Website.

Comparison with standard approach	SyMPIE architecture

If you find any of them useful for your research please consider citing us:

@article{
  barbato2024modular,
  title={A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric Estimation},
  author={Barbato, Francesco and Michieli, Umberto and Yucel, Mehmet Karim and Zanuttigh, Pietro and Ozay, Mete},
  journal={Proceedings of the ACM conference on Multimedia Systems (MMSys)},
  doi={10.1145/3625468.3647623},
  year={2024}
}

Note that due to CUDA-related instabilities the per-corruption numbers you obtain after training may not match those shown in Table 2, but the overall accuracy and improvement will.

Prerequisites

Clone this repository.
Create a new python 3.10 environment using conda (anaconda, miniconda): conda create -n sympie python==3.10.14
Install the dependencies using pip: pip install -r requirements.txt
Download the pretrained checkpoints and place them into training/checkpoints:
To train the architecture download the following datasets (optional):
1. ImageNet, validation set needs to be formatted in a PyTorch-friendly way:
  1. Download and extract the dataset.
  2. Navigate to Data/CLS-LOC.
  3. Rename the val folder into val_raw
  4. Create a new empty val folder.
  5. Copy into the current folder the make_clas_val.py script provided in the root of this repo.
  6. Run the script.
2. Cityscapes
3. ACDC
To validate the architecture download the following datasets:
Move the text splits provided in data in the appropriate folder for each dayaset you downloaded.
Update the paths listed in the config_paths.yaml with the ones for your datasets.

Training

To train our architecture with the default configuration run the following command from classification/training.

torchrun --nproc-per-node=8 train.py

During training the tensorboard logs are saved in classification/training/logs.

For the evaluation we will assume that, after training, the checkpoint file final.pth has been copied from your log folder to classification/training/checkpoints with the name sympie_rerun.pth.

Evaluation

To run the evaluation navigate to classification/eval. From there you can start validation with:

python validate.py

the default configuration uses a ResNet50 classifier with no SyMPIE module enabled. This is needed to compute the baseline against which the deltas generate by compare.py will be computed.

After completing the baseline evaluation you can get the accuracy for the default SyMPIE checkpoint by:

python validate.py --colm sympie

this will use the classification/training/checkpoints/sympie.pth checkpoint. To use your re-trained checkpoint simply change --colm sympie to --colm sympie_rerun.

To change the classifier you can set the --clas argument choosing from the following:

argument value	classifier used
`rn50_1`	ResNet50 V1
`rn50_2`	ResNet50 V2
`yucel`	ResNet50 Hybridaugment++ (ICCV23)
`rn18`	ResNet18 V1
`rn34`	ResNet34 V1
`vgg13bn`	VGG13 with BN
`vgg13`	VGG13
`vgg16bn`	VGG16 with BN
`vgg16`	VGG16
`mnet_1`	MobileNetV3-Large V1
`mnet_2`	MobileNetV2-Large V2
`vitb16`	ViT-B16
`swint`	Swin-T
`prime`	ResNet50 PRIME (ECCV22)
`pixmix`	ResNet50 PIXMIX (CVPR21)
`clip`	CLIP

To validate on ImageNetC-Bar run the script with the --bar flag.

SyMPIE (A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric Estimation) by Francesco Barbato, Umberto Michieli, Mehmet Yucel, Pietro Zanuttigh, Mete Ozay is licensed under CC BY-NC-SA 4.0

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
assets		assets
classification		classification
segmentation		segmentation
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
config_paths.yaml		config_paths.yaml
make_clas_val.py		make_clas_val.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assets

assets

classification

classification

segmentation

segmentation

.gitignore

.gitignore

LICENSE.md

LICENSE.md

README.md

README.md

config_paths.yaml

config_paths.yaml

make_clas_val.py

make_clas_val.py

requirements.txt

requirements.txt

Repository files navigation

A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric Estimation, ACM Multimedia Systems 2024.

Prerequisites

Training

Evaluation

About

Releases

Packages

Languages

License

SamsungLabs/SyMPIE

Folders and files

Latest commit

History

Repository files navigation

A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric Estimation, ACM Multimedia Systems 2024.

Prerequisites

Training

Evaluation

About

Resources

License

Stars

Watchers

Forks

Languages