OSP

The implementation of CVPR2023 paper "Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning".

Overview

Recent advances in robust semi-supervised learning (SSL) typically filter out-of-distribution (OOD) information at the sample level. We argue that an overlooked problem of robust SSL is its corrupted information on semantic level, practically limiting the development of the field. In this paper, we take an initial step to explore and propose a unified framework termed OOD Semantic Pruning (OSP), which aims at pruning OOD semantics out from in-distribution (ID) features. Specifically, (i) we propose an aliasing OOD matching module to pair each ID sample with an OOD sample with semantic overlap. (ii) We design a soft orthogonality regularization, which first transforms each ID feature by suppressing its semantic component that is collinear with paired OOD sample. It then forces the predictions before and after soft orthogonality decomposition to be consistent. Being practically simple, our method shows a strong performance in OOD detection and ID classification on challenging benchmarks. In particular, OSP surpasses the previous state-of-the-art by 13.7% on accuracy for ID classification and 5.9% on AUROC for OOD detection on TinyImageNet dataset. Codes are available in the supplementary material.

Quick Start

Dataset Preparation

Download CIFAR-100 dataset under the directory data.

Training

Train the model by of CIFAR-100 dataset with 100 labeled images per class and 20,000 unlabeled samples. Here, first-50 classes as ID categories, the later-50 classes as OOD categories. The mismatch ratio $\gamma$ "args.ratio' is 0.3 or 0.6.

export CUDA_VISIBLE_DEVICES="1"
# stage 1
python ours_stage1.py \
--dataset cifar100 \
--arch wideresnet \
--batch-size 64 \
--expand-labels \
--seed 5 \
--ratio 3 \
--total-steps 50000 \
--eval-step 500 \
--out results/cifar100@100_r3/ours_stage1 \
##
# stage 1
python cifar100_ours_stage2.py \
--dataset cifar100 \
--ood 1 \
--arch wideresnet \
--batch-size 64 \
--mu 5 \
--expand-labels \
--seed 5 \
--ratio 3 \
--total-steps 100000 \
--eval-step 1000 \
--resume results/cifar100@100_r3/ours_stage1/model_best.pth.tar \
--out results/cifar100@100_r3/ours_stage2 \

Acknowledgement

Our codes are based on T2T

Citation

@InProceedings{Wang_2023_CVPR,
    author    = {Wang, Yu and Qiao, Pengchong and Liu, Chang and Song, Guoli and Zheng, Xiawu and Chen, Jie},
    title     = {Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2023},
    pages     = {23849-23858}
}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
dataset		dataset
images		images
models		models
utils		utils
README.md		README.md
ours_stage1.py		ours_stage1.py
ours_stage2.py		ours_stage2.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dataset

dataset

images

images

models

models

utils

utils

README.md

README.md

ours_stage1.py

ours_stage1.py

ours_stage2.py

ours_stage2.py

Repository files navigation

OSP

Overview

Quick Start

Dataset Preparation

Training

Acknowledgement

Citation

About

Releases

Packages

Languages

rain305f/OSP

Folders and files

Latest commit

History

Repository files navigation

OSP

Overview

Quick Start

Dataset Preparation

Training

Acknowledgement

Citation

About

Resources

Stars

Watchers

Forks

Languages