Shifted Gaussian Noise (SGN)

The official implementation of our ICLR 2024 paper: Robust Classification via Regression for Learning with Noisy Labels.

Abstract

Deep neural networks and large-scale datasets have revolutionized the field of machine learning. However, these large networks are susceptible to overfitting to label noise, resulting in reduced generalization. To address this challenge, two promising approaches have emerged: i) loss reweighting, which reduces the influence of noisy examples on the training loss, and ii) label correction that replaces noisy labels with estimated true labels. These directions have been pursued separately or combined as independent methods, lacking a unified approach. In this work, we present a unified method that seamlessly combines loss reweighting and label correction to enhance robustness against label noise in classification tasks. Specifically, by leveraging ideas from compositional data analysis in statistics, we frame the problem as a regression task, where loss reweighting and label correction can naturally be achieved with a shifted Gaussian label noise model. Our unified approach achieves strong performance compared to recent baselines on several noisy labeled datasets. We believe this work is a promising step towards robust deep learning in the presence of label noise.

Keywords

Classification via Regression, Loss Reweighting, Label Correction, Robustness, Label Noise, Noisy Labels, Probabilistic Machine Learning, Compositional Data Analysis, Isometric Log-Ratio Transform, Gaussian Likelihood.

Environment Setup

The code uses and is based on the Uncertainty Baselines GitHub repository, please follow the installation instructions there.

Our experiments were run with TensorFlow 2.6.0, TensorFlow Probability 0.14.1 and Uncertainty Baselines 0.0.7, on A100 GPUs.

Running Experiments

For example, the SGN method on CIFAR-100 with 40% symmetric noise, can be run with the following command

python src/cifar/sgn.py --data_dir=/path/to/data/ \
                        --output_dir=/path/to/output_dir/ \
                        --dataset cifar100 \
                        --noisy_labels \
                        --corruption_type sym \
                        --severity 0.4

or on Clothing1M

python src/clothing1m/sgn.py --data_dir=/path/to/data/ \
                             --output_dir=/path/to/output_dir/

or on WebVision

python -u src/webvision/sgn.py --data_dir=/path/to/data/ \
                               --output_dir=/path/to/output_dir/

Reference

If you want to cite our work, you can do so with the following BibTex:

@inproceedings{
englesson2024robust,
title={Robust Classification via Regression for Learning with Noisy Labels},
author={Erik Englesson and Hossein Azizpour},
booktitle={The Twelfth International Conference on Learning Representations},
year={2024},
url={https://openreview.net/forum?id=wfgZc3IMqo}
}

If you enjoyed this work, you might also find our previous work at TMLR interesting: Logistic-Normal Likelihoods for Heteroscedastic Label Noise.

Contact

If you have any questions, feel free to reach out to me at engless@kth.se.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Shifted Gaussian Noise (SGN)

Abstract

Keywords

Environment Setup

Running Experiments

Reference

Contact

About

Releases

Packages

Languages

ErikEnglesson/SGN

Folders and files

Latest commit

History

Repository files navigation

Shifted Gaussian Noise (SGN)

Abstract

Keywords

Environment Setup

Running Experiments

Reference

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages