Latent Class-Conditional Noise Model

Abstract

Learning with noisy labels has become imperative in the Big Data era, which saves expensive human labors on accurate annotations. Previous noise-transition-based methods have achieved theoretically-grounded performance under the Class-Conditional Noise model (CCN). However, these approaches builds upon an ideal but impractical anchor set available to pre-estimate the noise transition. Even though subsequent works adapt the estimation as a neural layer, the ill-posed stochastic learning of its parameters in back-propagation easily falls into undesired local minimums. We solve this problem by introducing a Latent Class-Conditional Noise model (LCCN) to parameterize the noise transition under a Bayesian framework. By projecting the noise transition into the Dirichlet space, the learning is constrained on a simplex characterized by the complete dataset, instead of some ad-hoc parametric space wrapped by the neural layer. We then deduce a dynamic label regression method for LCCN, whose Gibbs sampler allows us efficiently infer the latent true labels to train the classifier and to model the noise. Our approach safeguards the stable update of the noise transition, which avoids previous arbitrarily tuning from a mini-batch of samples. We further generalize LCCN to different counterparts compatible with open-set noisy labels, semi-supervised learning as well as cross-model training. A range of experiments demonstrate the advantages of LCCN and its variants over the current state-of-the-art methods.

Get Started

Environments

The project is tested under the following environment settings:

OS: Ubuntu 18.04.5
GPU: NVIDIA GeForce RTX 3090
Python: 3.7.10
PyTorch: 1.7.1
Torchvision: 0.8.2
Cudatoolkit: 11.0.221
Numpy: 1.21.2
Tensorflow: 1.8.0

Code Structure

We summarize the codes as the following structure

├── README.md
├──                                              # Tensorflow code for LCCN
├── data/cifar10/cifar-10-batches-bin/*          # data files
│── cifar10_input.py                             # read data
│── cifar10.py                                   # backbone (training and inference)
│── cifar10_train.py                             # CE         
│── cifar10_train_bootstrapping.py               # CE with the bootstrapped labels
│── cifar10_train_T.py                           # CE with the transition matrix
│── cifar10_train_varT.py                        # CE with the adapted transition matrix
│── cifar10_train_varC.py                        # CE with LCCN
│── cifar10_train_T.py                           # CE with the transition matrix
│── cifar10_train_T.py                           # CE with the transition matrix
│── cifar10_eval.py                              # evaluation
├──                                              # Torch code for DivideLCCN
│── PreResNet.py                                 # Resnet Backbone 1
│── resnet_model.py                              # Resnet Backbone 2
│── dataloader_cifar.py                          # read data
│── divide_train_varC.py                         # DivideLCCN

Construct the noisy dataset

  python dataset.py

Baselines

Train DNNs directly with the cross-entropy loss (CE).

  python cifar10_train.py --train_dir results/events_ce/cifar10_train --noise_rate 0.3 # You can train other models like this one

Train Bootstrapping

  python cifar10_train_bootstrapping.py --train_dir results/events_bootstrapping/cifar10_train --noise_rate 0.3 # You can train other models like this one

Train Forward

  python cifar10_train_T.py --init_dir results/events_ce/cifar10_train --train_dir results/events_T/cifar10_train --noise_rate 0.3 # You can train other models like this one

Train S-adaptation

  python cifar10_train_varT.py --init_dir results/events_ce/cifar10_train --train_dir results/events_varT/cifar10_train --noise_rate 0.3 # You can train other models like this one

Train LCCN

Train LCCN

  python cifar10_train_varC.py --init_dir results/events_ce/cifar10_train --train_dir results/events_varC/cifar10_train --noise_rate 0.3 # You can train other models like this one

Train DivideLCCN

To train DivideLCCN on CIFAR-10/CIFAR-100 with different noisy types/ratios, simply run:

Train DivideLCCN, sym noise, CIFAR-10

  python divide_train_varC.py --noise_mode sym --r 0.2 --dataset cifar10 --num_class 10 --data_dir ${data_dir} # You can train other models like this one

Train DivideLCCN, asym noise, CIFAR-10

  python divide_train_varC.py --noise_mode asym --r 0.1 --dataset cifar10 --num_class 10 --data_dir ${data_dir} # You can train other models like this one

Train DivideLCCN, sym noise, CIFAR-100

  python divide_train_varC.py --noise_mode sym --r 0.2 --dataset cifar100 --num_class 100 --data_dir ${data_dir} # You can train other models like this one

Train DivideLCCN, asym noise, CIFAR-100

  python divide_train_varC.py --noise_mode asym --r 0.1 --dataset cifar100 --num_class 100 --data_dir ${data_dir} # You can train other models like this one

Results (please refer to the paper for more results)

Citation

@article{yao2023latent,
  title={Latent Class-Conditional Noise Model},
  author={Yao, Jiangchao and Han, Bo and Zhou, Zhihan and Zhang, Ya and Tsang, Ivor W},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
  year={2023},
  publisher={IEEE}
}

Acknowledgement

We borrow some codes from LCCN and DivideMix.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Latent Class-Conditional Noise Model

Abstract

Get Started

Environments

Code Structure

Construct the noisy dataset

Baselines

Train LCCN

Train DivideLCCN

Results (please refer to the paper for more results)

Citation

Acknowledgement

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
data/cifar10/cifar-10-batches-bin		data/cifar10/cifar-10-batches-bin
figures		figures
PreResNet.py		PreResNet.py
README.md		README.md
cifar10.py		cifar10.py
cifar10_eval.py		cifar10_eval.py
cifar10_input.py		cifar10_input.py
cifar10_train.py		cifar10_train.py
cifar10_train_T.py		cifar10_train_T.py
cifar10_train_bootstrapping.py		cifar10_train_bootstrapping.py
cifar10_train_varC.py		cifar10_train_varC.py
cifar10_train_varT.py		cifar10_train_varT.py
dataloader_cifar.py		dataloader_cifar.py
dataset.py		dataset.py
divide_train_varC.py		divide_train_varC.py
resnet_model.py		resnet_model.py

Sunarker/LCCN

Folders and files

Latest commit

History

Repository files navigation

Latent Class-Conditional Noise Model

Abstract

Get Started

Environments

Code Structure

Construct the noisy dataset

Baselines

Train LCCN

Train DivideLCCN

Results (please refer to the paper for more results)

Citation

Acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages