CrossMoST: Cross-Modal Self-Training: Aligning Images and Pointclouds to learn Classification without Labels

Official implementation of Cross-Modal Self-Training: Aligning Images and Point Clouds to learn Classification without Labels

What is CrossMoST

It is an optimization framework to improve the label-free classification performance of a zero-shot 3D vision model by leveraging unlabeled 3D data and their accompanying 2D views. We implement a student-teacher framework to simultaneously process 2D views and 3D point clouds and generate joint pseudo labels to train a classifier and guide cross-model feature alignment.

Pipeline

Instructions

[Install environments]

We trained our models on 4 Nvidia V100 GPUs, the code is tested with CUDA==11.0 and pytorch==1.10.1
conda create -n crossmost python=3.7.15
conda activate crossmost
conda install pytorch==1.10.1 torchvision==0.11.2 torchaudio==0.10.1 cudatoolkit=11.3 -c pytorch -c conda-forge
pip install -r requirements.txt

[Download datasets and initialize models, put them in the right paths.]

Download the used datasets and initialize models from here. For now, you ONLY need to download "modelnet40_normal_resampled", and "shapenet-55".
The data folder should have the following structure:

./data |
-- co3d |
-- modelnet40_rendered |
-- modelnet40_ply_hdf5_2048 |
-- redwood |
-- scanobjectnn |
.
.
-- [dataset].yaml
.
.
-- data_transforms.py 
-- dataset_3d.py 
-- dataset_catalog.json 
-- labels.json 
-- templates.json 
-- utils.py

Once you have downloaded and unzipped the datasets,

# Change the data paths in the config files
./data/[dataset].yaml

Then, download the Shapenet-pretrained backbones and the DVAE for the point-transformer.

./checkpoints |
-- dVAE.pth 
-- ulip-june11-checkpoint_best.pt

[Zero-shot evaluation of Shapenet-pretrained backbones]

Please change the script to accommodate your system accordingly, this script is used to train on 4 gpus by default. You can also modify the desired output folder in the script.

# the scripts are named by its correspoinding 3D backbone name.
bash ./run_zs_eval_modelnet.sh

adjust the bash script accordingly to run evaluations for other datasets.

[Training CrossMoST]

bash ./run_crossmost_train_modelnet.sh

You can also run the baseline-self training

bash ./run_baseline_train_modelnet.sh

adjust the bash script accordingly to run evaluations for other datasets.

Checkpoints for evaluating Baseline Self-training vs CrossMoST

You can download the checkpoints of the CrossMoST and our baselines from here and put them in the corresponding directories.

./checkpoints |
-- dVAE.pth 
-- ulip-june11-checkpoint_best.pt 
-- co3d_baseline |
    -- checkpoint-best.pth |
.
.
.
-- scobjwbg_crossmost |
    -- checkpoint-best.pth

To run the evaluation on the provided checkpoints,

bash ./run_crossmost_eval_modelnet.sh

You can also run the baseline-self training

bash ./run_baseline_eval_modelnet.sh

adjust the bash script accordingly to run evaluations for other datasets.

Acknowledgemets

Our code borrows heavily from MUST repository. If you use our model, please consider citing them as well.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Assets		Assets
checkpoints		checkpoints
configs		configs
data		data
models		models
outputs		outputs
utils		utils
README.md		README.md
ema.py		ema.py
engine_self_training.py		engine_self_training.py
engine_self_training_3D.py		engine_self_training_3D.py
optim_factory.py		optim_factory.py
requirements.txt		requirements.txt
run_baseline_eval_modelnet.sh		run_baseline_eval_modelnet.sh
run_baseline_train_modelnet.sh		run_baseline_train_modelnet.sh
run_crossmost_eval_modelnet.sh		run_crossmost_eval_modelnet.sh
run_crossmost_train_modelnet.sh		run_crossmost_train_modelnet.sh
run_zs_eval_modelnet.sh		run_zs_eval_modelnet.sh
train_3modal_ULIP.py		train_3modal_ULIP.py
train_CrossMoST_co3d.py		train_CrossMoST_co3d.py
train_CrossMoST_modelnet10.py		train_CrossMoST_modelnet10.py
train_CrossMoST_modelnet40.py		train_CrossMoST_modelnet40.py
train_CrossMoST_modelnet40depth.py		train_CrossMoST_modelnet40depth.py
train_CrossMoST_redwood.py		train_CrossMoST_redwood.py
train_CrossMoST_scanobject.py		train_CrossMoST_scanobject.py
train_CrossMoST_scanobject_hardest.py		train_CrossMoST_scanobject_hardest.py
train_CrossMoST_scanobject_withbg.py		train_CrossMoST_scanobject_withbg.py

theamaya/CrossMoST

Folders and files

Latest commit

History

Repository files navigation

CrossMoST: Cross-Modal Self-Training: Aligning Images and Pointclouds to learn Classification without Labels

What is CrossMoST

Pipeline

Instructions

[Install environments]

[Download datasets and initialize models, put them in the right paths.]

[Zero-shot evaluation of Shapenet-pretrained backbones]

[Training CrossMoST]

Checkpoints for evaluating Baseline Self-training vs CrossMoST

Acknowledgemets

Citation

About

Resources

Stars

Watchers

Forks

Languages