SED: Foster Adaptivity and Balance in Learning with Noisy Labels

Abstract: Label noise is ubiquitous in real-world scenarios, posing a practical challenge to supervised models due to its effect in hurting the generalization performance of deep neural networks. Existing methods primarily employ the sample selection paradigm and usually rely on dataset-dependent prior knowledge (e.g., a pre-defined threshold) to cope with label noise, inevitably degrading the adaptivity. Moreover, existing methods tend to neglect the class balance in selecting samples, leading to biased model performance. To this end, we propose a simple yet effective approach named SED to deal with label noise in a Self-adaptivE and class-balanceD manner. Specifically, we first design a novel sample selection strategy to empower self-adaptivity and class balance when identifying clean and noisy data. A mean-teacher model is then employed to correct labels of noisy samples. Subsequently, we propose a self-adaptive and class-balanced sample re-weighting mechanism to assign different weights to detected noisy samples. Finally, we additionally employ consistency regularization on selected clean samples to improve model generalization performance. Extensive experimental results on synthetic and real-world datasets demonstrate the effectiveness and superiority of our proposed method.

Pipeline

Installation

pip install -r requirements.txt

Datasets

We conduct noise robustness experiments on two synthetically corrupted datasets (i.e., CIFAR100N and CIFAR80N) and three real-world datasets (i.e., Web-Aircraft, Web-Car and Web-Bird. Specifically, we create the closed-set noisy dataset CIFAR100N and the open-set noisy dataset CIFAR80N based on CIFAR100. To make the open-set noisy dataset CIFAR80N, we regard the last 20 categories in CIFAR100 as out-of-distribution. We adopt two classic noise structures: symmetric and asymmetric, with a noise ratio $n \in (0,1)$.

You can download the CIFAR10 and CIFAR100 on this.

You can download the Clothing1M from here.

Training

An example shell script to run SED on CIFAR-100N :

CUDA_VISIBLE_DEVICES=0 python main.py --warmup-epoch 20 --epoch 100 --batch-size 128 --lr 0.05 --warmup-lr 0.05  --noise-type symmetric --closeset-ratio 0.2 --lr-decay cosine:20,5e-4,100  --opt sgd --dataset cifar100nc

An example shell script to run SED on CIFAR-80N :

CUDA_VISIBLE_DEVICES=0 python main.py --warmup-epoch 20 --epoch 100 --batch-size 128 --lr 0.05 --warmup-lr 0.05  --noise-type symmetric --closeset-ratio 0.2 --lr-decay cosine:20,5e-4,100  --opt sgd --dataset cifar80no

Here is an example shell script to run SED on Web-Aircraft :

CUDA_VISIBLE_DEVICES=0 python main_web.py --warmup-epoch 10 --epoch 110 --batch-size 32 --lr 0.005 --warmup-lr 0.005  --lr-decay cosine:10,5e-4,110 --weight-decay 5e-4 --seed 123 --opt sgd --dataset web-bird --SSL True --gpu 0 --pretrain True

Results on Cifar100N and Cifar80N:

Results on Web-Aircraft, Web-Bird, and Web-Car:

Effects of different components in test accuracy (%) on CIFAR100N (noise rate and noise type are 0.5 and symmetric, respectively)

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
model		model
utils		utils
Figure.png		Figure.png
README.md		README.md
Table1.png		Table1.png
Table2.png		Table2.png
Table3.png		Table3.png
main.py		main.py
main_web.py		main_web.py
requirements.txt		requirements.txt
run.sh		run.sh
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SED: Foster Adaptivity and Balance in Learning with Noisy Labels

Pipeline

Installation

Datasets

Training

Results on Cifar100N and Cifar80N:

Results on Web-Aircraft, Web-Bird, and Web-Car:

Effects of different components in test accuracy (%) on CIFAR100N (noise rate and noise type are 0.5 and symmetric, respectively)

About

Releases

Packages

Languages

NUST-Machine-Intelligence-Laboratory/SED

Folders and files

Latest commit

History

Repository files navigation

SED: Foster Adaptivity and Balance in Learning with Noisy Labels

Pipeline

Installation

Datasets

Training

Results on Cifar100N and Cifar80N:

Results on Web-Aircraft, Web-Bird, and Web-Car:

Effects of different components in test accuracy (%) on CIFAR100N (noise rate and noise type are 0.5 and symmetric, respectively)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages