Zero-shot Improvement of Object Counting with CLIP

Ruisu Zhang*, Yicong Chen*, Kangwook Lee

This is the code repository for paper Zero-shot Improvement of Object Counting with CLIP (link) accepted by R0-FoMo Workshop in Neurips 2023.

Abstract

We focus on the object counting limitations of vision-language models, with a particular emphasis on Contrastive Language-Image Pre-Training (CLIP) models. We assess the counting performance of CLIP using a custom dataset, which uncovers significant variations across diverse objects. To address this, we introduce a zero-shot, training-free method aimed at improving counting accuracy by manipulating the text embedding space of CLIP. Through comprehensive experiments, we demonstrate that our method not only enhances the counting capabilities of CLIP but also boosts the performance of text-to-image generative models like Stable Diffusion, particularly in generating images with precise object counts.

Reproduce experiment results

Install required packaged

Create your own virtual environment using a python environment > 3.6

conda create -y -n ENV_NAME
conda activate ENC_NAME
cd $CODE_DIR
pip install -r requirements.txt

CLIP experiments

Run python run.py with specified configurations:

-m, --model: choose CLIP model from ["clip_base_32","clip_base_16","clip_large_14"];
-d, --dataset: choose dataset from ["custom","countbench"];
-t, --task: choose task from ["classification","image_retrievel"];
-r, --ref_obj: specify the name of reference object in a string format;

Stable Diffusion experiments

Run python run.py with specified configurations:

-m, --model: set as "stable_diffusion";
-t, --task: set as "image_gen";
-r, --ref_obj: specify the name of reference object in a string format;

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
README.md		README.md
clip.py		clip.py
clip_count_utils.py		clip_count_utils.py
data_aug.py		data_aug.py
requirements.txt		requirements.txt
run.py		run.py
sd.py		sd.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Zero-shot Improvement of Object Counting with CLIP

Abstract

Reproduce experiment results

Install required packaged

CLIP experiments

Stable Diffusion experiments

About

Releases

Packages

Contributors 3

Languages

License

UW-Madison-Lee-Lab/CLIP_Counting

Folders and files

Latest commit

History

Repository files navigation

Zero-shot Improvement of Object Counting with CLIP

Abstract

Reproduce experiment results

Install required packaged

CLIP experiments

Stable Diffusion experiments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages