OpenPT: Towards Open-World Prompt Tuning

Note: Our paper is currently under review, and more details will be announced in due course.

Abstract

Prompt Tuning (PT) has emerged as a promising parameter-efficient paradigm for adapting pre-trained vision-language models (VLMs) to downstream tasks. However, largely built on closed-world assumptions, existing approaches simultaneously suffer from the Base-New Tradeoff (BNT), OOD Overconfidence, and Knowledge Evolution Deficiency, severely compromising their generalizability and reliability in dynamic, open-world environments. In this work, we present Open-World Prompt Tuning (OpenPT), a novel framework for achieving generalizable and reliable adaptation of VLMs in the open world. Specifically, we first reveal that the BNT problem stems from a channel bias issue, and present Decoupled Prompt Tuning with Simplex Equiangular Tight Frame (DePT++), which improves base-to-new generalization by decoupling base-specific and task-shared knowledge into two isolated feature spaces. Building upon DePT++, we then introduce Collaborative Energy-based OOD Detection (CE-OOD), which achieves precise OOD detection by integrating complementary energy scores from the two decoupled spaces. Finally, we develop Pseudo-Class Guided Class-Incremental Learning (PC-CIL) to facilitate the continual learning of new class knowledge by assigning pseudo-class names to OOD samples and enforcing strict geometric separation between novel and established classes. Remarkably, OpenPT can be used as a plugin to improve existing PT methods. Extensive results on a broad spectrum of baselines, datasets, and evaluation metrics demonstrate the effectiveness and flexibility of OpenPT.

Method

Specifically, we first identify channel bias as the structural origin of the BNT dilemma. To overcome this, we propose Decoupled Prompt Tuning with Simplex Equiangular Tight Frame (DePT++), a novel framework that resolves the dilemma by decoupling base-specific discrimination and task-shared semantics into two isolated feature spaces. Based on DePT++, we then devise the Collaborative Energy-based OOD Detection (CE-OOD) scheme, which achieves precise OOD detection by integrating complementary energy scores from the two decoupled spaces. Finally, we devise the Pseudo-Class Guided Class-Incremental Learning (PC-CIL) strategy that facilitates the continual learning of new classes by assigning pseudo-class names to OOD samples and distancing new class prototypes from old ones in a shared feature space.

Main Contributions

We reveal that the Base-New Tradeoff (BNT) problem in prompt tuning stems from a channel biasissue and propose DePT++ to overcome it from a feature decoupling perspective.

Based on DePT++, we present OpenPT, a novel and plug-and-play framework for achieving generalizable and reliable VLM adaptation in the open world.

We demonstrate OpenPT's effectiveness and flexibility using a broad spectrum of baselines, datasets, and evaluation metrics.

Flexibility and Effectiveness

Our OpenPT is orthogonal to both prompt tuning and adapter tuning approaches, therefore can be used as a plugin to improve all of them.

Installation

This codebase is tested on Ubuntu 20.04.2 LTS with python 3.8. Follow the below steps to create environment and install dependencies.

Setup conda environment (recommended).

Create a conda environment

conda create -y -n openpt python=3.8
conda activate openpt

Install torch (requires version >= 1.8.1) and torchvision

pip install torch==1.9.0+cu111 torchvision==0.10.0+cu111 torchaudio==0.9.0 -f https://download.pytorch.org/whl/torch_stable.html

Install dassl

git clone https://github.com/KaiyangZhou/Dassl.pytorch.git
cd Dassl.pytorch/
pip install -r requirements.txt
python setup.py develop

Install OpenPT

cd ..

git clone https://github.com/heyhey24/OpenPT.git
cd OpenPT/

pip install -r requirements.txt
pip install setuptools==59.5.0

Data preparation

Please follow the instructions at DATASETS.md to prepare all datasets. In addition to preparing the data as required, you also need to copy the openset_ood_class.json file for each dataset from the ood class folder in the code repository to the corresponding dataset folder, for example, Caltech101:

caltech-101/
|–– 101_ObjectCategories/
|–– split_zhou_Caltech101.json
|–– openset_ood_class.json

Training and Evaluation

We provide parallel running script parallel_runner.py for each prompting variant including CoOp (w/ DePT). Make sure to configure the dataset paths in environment variable DATA and run the commands from the main directory.

Base to New Generalization

# Running CoOp (w/ DePT/DePT++)
python parallel_runner.py --cfg coop
python parallel_runner.py --cfg coop_dept
python parallel_runner.py --cfg coop_dept_etf

If the base model trained on the base-to-new generalization task has been obtained using the above command, you can then perform training and evaluation under OpenPT-bench with the following command.

Openset experiments

python parallel_runner.py --cfg baselines_coop   
python parallel_runner.py --cfg coop_openpt

After running, the output will be in the outputs/ directory, the results will be tallied in the results/ directory as csv, and a mail will be sent to email address.

If you want to add your own models, you'll need to write your models in the trainers/ directory and register them in dassl, then configure the settings in the configs/ directory and train.py file, and add your new tasks to the configs.py file. Then you can run python parallel_runner.py --cfg your_model to run our own model.

To later perform openset experiments on other baseline models (e.g., xxx), create two trainers: etf_xxx.py and openset_etf_xxx.py in trainers/, along with their corresponding config files in configs/. Then add two task configurations to configs.py: xxx_dept_etf and xxx_openpt. Then you can run python parallel_runner.py --cfg xxx_dept_etf, followed by python parallel_runner.py --cfg xxx_openpt, to run the models you just created.

Also, add baselines_xxx to configs.py to test the baseline under OpenPT‑bench. You can run python parallel_runner.py --cfg baselines_xxx.

Acknowledgements

Our code is based on DePT, CaPT and MODE-OOD repositories. If you use our model and code, please consider citing these works as well.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
clip		clip
clip_maple		clip_maple
configs		configs
datasets		datasets
examples		examples
ood_class		ood_class
tests		tests
trainers		trainers
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
configs.py		configs.py
parallel_runner.py		parallel_runner.py
requirements.txt		requirements.txt
templates.py		templates.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenPT: Towards Open-World Prompt Tuning

Abstract

Method

Main Contributions

Flexibility and Effectiveness

Installation

Data preparation

Training and Evaluation

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

OpenPT: Towards Open-World Prompt Tuning

Abstract

Method

Main Contributions

Flexibility and Effectiveness

Installation

Data preparation

Training and Evaluation

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages