Mitigating Spurious Correlations with Causal Logit Perturbation

This repository contains the code of "Mitigating Spurious Correlations with Causal Logit Perturbation".

Abstract

Deep learning has seen widespread success in various domains such as science, industry, and society. However, it is acknowledged that certain approaches suffer from non-robustness, relying on spurious correlations for predictions. Addressing these limitations is of paramount importance, necessitating the development of methods that can disentangle spurious correlations. This study attempts to implement causal models via logit perturbations and introduces a novel Causal Logit Perturbation (CLP) framework to train classifiers with generated causal logit perturbations for individual samples, thereby mitigating the spurious associations between non-causal attributes (i.e., image backgrounds) and classes. Our framework employs a perturbation network to generate sample-wise logit perturbations using a series of training characteristics of samples as inputs. The whole framework is optimized by an online meta-learning-based learning algorithm and leverages human causal knowledge by augmenting metadata in both counterfactual and factual manners. Empirical evaluations on four typical biased learning scenarios, including long-tail learning, noisy label learning, generalized long-tail learning, and subpopulation shift learning, demonstrate that CLP consistently achieves state-of-the-art performance. Moreover, visualization results support the effectiveness of the generated causal perturbations in redirecting model attention towards causal image attributes and dismantling spurious associations.

Framework

The overall structure of CLP, which consists of four main components: the metadata augmentation module, the backbone classifier, the training characteristics module, and the perturbation network which generates sample-wise logit perturbations. The red and green lines indicate the learning loops of the backbone classifier and the perturbation network, respectively.

Requirements

PyTorch >= 1.2.0
Python3
torchvision
PIL
argparse
numpy

Getting started

Dataset

For long-tailed and noisy cifar data, the generation steps follow those in Meta-Weight-Net.

Long-tailed CIFAR10/100: The long-tailed version of CIFAR10/100. Detailed generation process can be seen in "./data/cifar/CIFAR_process.py".
Noisy CIFAR10/100: The noisy version of CIFAR10/100. Detailed generation process can be seen in "./data/cifar/CIFAR_process.py".

For generalized long-tail benchmark, including ImageNet-GLT and MSCOCO-GLT, the generation steps follow those in IFL

ImageNet-GLT: The generalized long-tail version of ImageNet. Detailed generation process can be seen in "./data/GLT/_ImageNetGeneration".
MSCOCO-GLT: The generalized long-tail version of MSCOCO. Detailed generation process can be seen in "./data/GLT/_COCOGeneration".

For the subpopulation shift benchmark, the acquisition process follows that of chang et al.

Waterbirds: Detailed processing steps can be seen in "./data/subpopulation/waterbirds_datasets.py".

Train

We give training examples using cifar data:

Training on CIFAR-LT-10/100:

CIFAR100-LT, 
python CLP_train.py --imb_factor 10 --dataset cifar100 --num_classes 100

The lambda value that guides the strength of the saliency regularization term can be changed in Line 44 of ``CLP_train.py".
The infilling manner for the causal (foreground) and non-causal (background) components of images can be changed in Lines 55-56 of ``CLP_train.py".
The volume of metadata can be changed in Line 38 of ``CLP_train.py".

Or run the script:

sh train.sh

Training on Noisy CIFAR10/100 with flip noise:

Noisy CIFAR100, 
python CLP_train.py --corruption_type flip2 --corruption_ratio 0.2 --dataset cifar100 --num_classes 100

Or run the script:

sh train-noise.sh

Test

python test.py

Experimental results

For long-tail learning tasks:

For noisy learning tasks:

For subpopulation shift tasks:

For generalized long-tail learning tasks:

For more results, please refer to our paper.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
data		data
img		img
infilling		infilling
.gitattributes		.gitattributes
CIFAR_process.py		CIFAR_process.py
CLP_train.py		CLP_train.py
LICENSE		LICENSE
README.md		README.md
cifar.py		cifar.py
data_utils.py		data_utils.py
model.py		model.py
resnet.py		resnet.py
test.py		test.py
train-noise.sh		train-noise.sh
train.sh		train.sh
wideresnet.py		wideresnet.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mitigating Spurious Correlations with Causal Logit Perturbation

Abstract

Framework

Requirements

Getting started

Dataset

Train

Test

Experimental results

About

Uh oh!

Releases

Packages

Languages

License

xiaolingzhou98/CLP

Folders and files

Latest commit

History

Repository files navigation

Mitigating Spurious Correlations with Causal Logit Perturbation

Abstract

Framework

Requirements

Getting started

Dataset

Train

Test

Experimental results

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages