Why Is Prompt Tuning for Vision-Language Models Robust to Noisy Labels? [ICCV 2023]

Why Is Prompt Tuning for Vision-Language Models Robust to Noisy Labels?
Cheng-En Wu, Yu Tian, Haichao Yu, Heng Wang, Pedro Morgado, Yu Hen Hu, Linjie Yang

Introduction

This repo is the official implementation of Why Is Prompt Tuning for Vision-Language Models Robust to Noisy Labels?.

Install

Setup conda environment (recommended).:

############ Conda Environment Installation ############

# Fetch the miniconda script
export HOME=$PWD
wget -q https://repo.anaconda.com/miniconda/Miniconda3-py37_4.12.0-Linux-x86_64.sh -O miniconda.sh
sh miniconda.sh -b -p $HOME/miniconda3
rm miniconda.sh
export PATH=$HOME/miniconda3/bin:$PATH

# Initialize conda
source $HOME/miniconda3/etc/profile.d/conda.sh
hash -r
conda config --set always_yes yes --set changeps1 yes
conda activate dassl

############ Dassl Installation ############

# Clone the Dassl repository
git clone https://github.com/KaiyangZhou/Dassl.pytorch.git
cd Dassl.pytorch/
git reset --hard ac6e44194b2f90e325f477aadd6d9bc3a92ce255

# Establish a new conda environment
conda create -n dassl python=3.7

# Activate the new environment
conda activate dassl

# Install the required dependencies
pip install -r requirements.txt

# Install PyTorch (version 1.11.0 or above) and torchvision
conda install pytorch==1.11.0 torchvision==0.12.0 cudatoolkit=11.3 -c pytorch

# Set up the Dassl library (No need to rebuild even if the source code changes)
python setup.py develop

############ PTNL Installation ############

# Navigate back to the parent directory
cd ..

# Clone the PTNL repository
git clone https://github.com/CEWu/PTNL
cd PTNL/

# Install necessary packages for CLIP
pip install -r requirements.txt

######## Note ########
# Two symbolic links, `data` and `temp_analyze_results_miltiple`, exist in the repository. It is recommended that these be pointed to locations with sufficient storage capacity.

rm data temp_analyze_results_miltiple # remove the existing links
ln -s ${your_data_path} ./data
ln -s ${your_temp_analyze_results_miltiple_path} ./temp_analyze_results_miltiple

# Installation complete

Datasets

Please follow the instructions at CoOp Datasets Instructions to prepare all datasets.

Training

Obtain Psuodo-labels and Save Model's logits

CUDA_VISIBLE_DEVICES=0 bash get_info.sh sscaltech101 anay_rn50 end 16 -1 False

Prompt Tuning with Noisy Labels

Training with samples with noisy (2 out of 16 shots training samples are noisy labels.)

CUDA_VISIBLE_DEVICES=0 bash upl_train.sh sscaltech101 rn50_ep50 end 16 16 False True rn50_random_init 2

9 arguments listed sequentially as follows:

dataset config (others in configs/datasets)
model config (only rn50_ep50)
class token position (end or middle)
number of context tokens
number of shots (1, 2, 4, 8, 16)
class-specific context (False or True)
CLASS_EQULE True of False
log tag (only rn50_random_init)
number of false positive training samples per class

PS. under scripts, there are eight scripts (upl_train_*.sh) shring 16 seeds to speed up training process on 8 GPUs.

Ensemble Testing

Test with existing files after prompt tuning with noisy labels (2 out of 16 shots training samples are noisy labels.)

bash upl_test_existing_logits.sh sscaltech101 rn50_ep50 end 16 16 False True 2

8 arguments listed sequentially as follows:

dataset config (others in configs/datasets)
model config (only rn50_ep50)
class token position (end or middle)
number of context tokens
number of shots (1, 2, 4, 8, 16)
class-specific context (False or True)
CLASS_EQULE True of False
number of false positive training samples per class

Citation

If you find our work beneficial for your research, please consider citing:

@inproceedings{wu2023ptnl,
    title={Why Is Prompt Tuning for Vision-Language Models Robust to Noisy Labels?},
    author={Cheng-En Wu, Yu Tian, Haichao Yu, Heng Wang, Pedro Morgado, Yu Hen Hu, Linjie Yang},
    booktitle={ICCV},
    year={2023}
}

Acknowlegment

This repository is based on UPL.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
clip		clip
configs		configs
datasets		datasets
evaluation		evaluation
scripts		scripts
trainers		trainers
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data		data
get_info.py		get_info.py
gitignore.txt		gitignore.txt
requirements.txt		requirements.txt
temp_analyze_results_miltiple		temp_analyze_results_miltiple
upl_test.py		upl_test.py
upl_train.py		upl_train.py

License

CEWu/PTNL

Folders and files

Latest commit

History

Repository files navigation

Why Is Prompt Tuning for Vision-Language Models Robust to Noisy Labels? [ICCV 2023]

Introduction

Install

Datasets

Training

Obtain Psuodo-labels and Save Model's logits

Prompt Tuning with Noisy Labels

Ensemble Testing

Test with existing files after prompt tuning with noisy labels (2 out of 16 shots training samples are noisy labels.)

Citation

Acknowlegment

About

Resources

License

Stars

Watchers

Forks

Languages