Targeted Supervised Contrastive Learning for Long-Tailed Recognition

This repository contains the implementation code for paper:
Targeted Supervised Contrastive Learning for Long-Tailed Recognition
Tianhong Li*, Peng Cao*, Yuan Yuan, Lijie Fan, Yuzhe Yang, Rogerio Feris, Piotr Indyk, Dina Katabi
IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR 2022)
[Paper]

This repository also contains an unofficial reimplementation for paper:
Exploring Balanced Feature Spaces for Representation Learning
Bingyi Kang, Yu Li, Sa Xie, Zehuan Yuan, Jiashi Feng
The Ninth International Conference on Learning Representations (ICLR 2021)

If you find this code or idea useful, please consider citing our work:

@article{li2021targeted,
  title={Targeted Supervised Contrastive Learning for Long-Tailed Recognition},
  author={Li, Tianhong and Cao, Peng and Yuan, Yuan and Fan, Lijie and Yang, Yuzhe and Feris, Rogerio and Indyk, Piotr and Katabi, Dina},
  journal={arXiv preprint arXiv:2111.13998},
  year={2021}
}

With high imbalance ratio, class centers learned by KCL exhibit poor uniformity while class centers learned by TSC are still uniformly distributed and thus TSC achieves better performance.

Preparation

Prerequisites

Download ImageNet dataset, and place them in your data_root. Long-tailed version will be created using train/val splits (.txt files) in corresponding subfolders under imagenet_inat/data/
Change the data_root and save_folder in main_moco_supcon_imba.py and main_lincls_imba.py accordingly for ImageNet-LT.
All trainings are tested on 4 Titan X GPUs.

Dependencies

PyTorch (>= 1.6, tested on 1.6)
scikit-learn
tensorboard-logger

Usage

Training of KCL/TSC cosists of two steps: first, the feature encoder (ResNet-50) is pretrained using KCL/TSC losses. Second, a linear classifier is added on top of the feature encoder and fine-tuned using cross-entropy loss.

Target generation:

To generate targets for TSC pre-training, use

python target_generation.py

The generated targets will be stored at optimal_{N}_{M}.npy, where N is the number of classes, M is the dimension of output features.

1st stage pre-training:

KCL

python main_moco_supcon_imba.py --cos --epochs 200 --K 6 --moco-dim 128 --mlp \                                                                                                         -a resnet50 --name kcl_release \
  -a resnet50 --name [YOUR PREFERRED NAME] \
  --lr 0.1 --batch-size 256 \
  --dist-url 'tcp://localhost:10000' --multiprocessing-distributed --world-size 1 --rank 0 \
  --dataset imagenet

TSC

python main_moco_supcon_imba.py --cos --epochs 400 --K 6 --moco-dim 128 --mlp --targeted --tr 1 --sep_t --tw 0.2 \
  -a resnet50 --name [YOUR PREFERRED NAME] \
  --lr 0.1 --batch-size 256 \
  --dist-url 'tcp://localhost:10000' --multiprocessing-distributed --world-size 1 --rank 0 \
  --dataset imagenet

2nd stage fine-tuning:

python main_lincls_imba.py  --dataset imagenet  --pretrained [PRETRAINED MODEL PATH FROM 1ST STAGE] --epochs 40 --schedule 20 30 --seed 0 -b 2048

Results

Here we show the accuracy of KCL and TSC on ImageNet-LT in 5 trails:

Method	Trial 1	Trial 2	Trial 3	Trial 4	Trial 5
KCL	51.2	51.6	51.5	51.2	51.3
TSC	52.3	53.0	52.6	52.1	52.2

NOTE: many/medium/minor classes accuracy could change significantly with different learning rate or batch size in the second stage while overall accuracy remains the same.

Acknowledgement

This code inherits some codes from MoCo and Classifier-Balancing.

Contact

If you have any questions, feel free to contact me through email (tianhong@mit.edu) or Github issues. Enjoy!

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
cifar_dirty		cifar_dirty
dataset		dataset
imagenet_inat		imagenet_inat
moco		moco
moco_models		moco_models
LICENSE		LICENSE
README.md		README.md
main_lincls_imba.py		main_lincls_imba.py
main_moco_supcon_imba.py		main_moco_supcon_imba.py
optimal_1000_128_analytical.npy		optimal_1000_128_analytical.npy
supp_release.pdf		supp_release.pdf
target_generation.py		target_generation.py
teaser.jpg		teaser.jpg

License

LTH14/targeted-supcon

Folders and files

Latest commit

History

Repository files navigation

Targeted Supervised Contrastive Learning for Long-Tailed Recognition

Preparation

Prerequisites

Dependencies

Usage

Target generation:

1st stage pre-training:

KCL

TSC

2nd stage fine-tuning:

Results

Acknowledgement

Contact

About

Resources

License

Stars

Watchers

Forks

Languages