Multi-Modal Proxy Learning Towards Personalized Visual Multiple Clustering

CVPR 2024


The flow chart of Multi-MaP: Multi-MaP obtains multiple clustering results based on the high-level concepts from users and the reference words from GPT-4.

Requirements

We recommend Linux for performance and compatibility reasons.
1 NVIDIA GPUs. We developed and trained the model using RTX 2080 Ti (11GB).
PyTorch >= 1.11

Getting started

Datasets

Furit
Furit360
Cards

Please refer to http://faculty.washington.edu/juhuah/images/AugDMC_datasets.zip

Training and evaluation

Fruit dataset

python main.py --dataset fruit --lr 0.005 --alpha 0.3 --beta 0.4 --weight_decay 0.00005

Fruit360 dataset

python main.py --dataset fruit360 --lr 0.01 --alpha 0.1 --beta 0.3 --weight_decay 0.0

Cards dataset

python main.py --dataset cards --lr 0.005 --alpha 0.2 --beta 0.3 --weight_decay 0.00001

Bibtex

Please cite our paper if you use this code in your own work:

@article{yao2024multi,
  title={Multi-Modal Proxy Learning Towards Personalized Visual Multiple Clustering},
  author={Yao, Jiawei and Qian, Qi and Hu, Juhua},
  journal={arXiv preprint arXiv:2404.15655},
  year={2024}
}

Acknowledgement

This research is supported in part by Advata Gift funding. All opinions, findings, conclusions and recommendations in this paper are those of the author and do not necessarily reflect the views of the funding agencies.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
clip		clip
README.md		README.md
main.py		main.py
parse.py		parse.py
teaser.jpg		teaser.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

clip

clip

README.md

README.md

main.py

main.py

parse.py

parse.py

teaser.jpg

teaser.jpg

Repository files navigation

Multi-Modal Proxy Learning Towards Personalized Visual Multiple Clustering

Requirements

Getting started

Datasets

Training and evaluation

Bibtex

Acknowledgement

About

Releases

Packages

Languages

Alexander-Yao/Multi-MaP

Folders and files

Latest commit

History

Repository files navigation

Multi-Modal Proxy Learning Towards Personalized Visual Multiple Clustering

Requirements

Getting started

Datasets

Training and evaluation

Bibtex

Acknowledgement

About

Resources

Stars

Watchers

Forks

Languages