OAMixer: Object-aware Mixing Layer for Vision Transformers

Official PyTorch implementation of "OAMixer: Object-aware Mixing Layer for Vision Transformers" (CVPRW 2022) by Hyunwoo Kang*, Sangwoo Mo*, and Jinwoo Shin.

Our code is heavily built upon DeiT and timm repositories. We use the newer version of timm than DeiT to borrow the updated mixer implementations. Our main contributions are in (a) models directory that defines the base masked model class and specific instantiations for ViT, MLP-Mixer, and ConvMixer, and (b) transforms directory that defines the paired transformations of image and corresponding patch labels (e.g., BigBiGAN, ReLabel).

Installation

Install required libraries.

pip install -r requirements.txt

Create patch labels

Create BigBiGAN patch labels. You can download the pretrained U-Net weights (e.g., trained on ImageNet) from the original repository. Then, place the pretrained weights in patch_models/pretrained.

python generate_mask.py --data-set [DATASET] --output_dir [OUTPUT_PATH]

Create ReLabel patch labels.

python3 generate_label.py [DATASET_PATH] [OUTPUT_PATH] --model dm_nfnet_f6 --pretrained --img-size 576 -b 32 --crop-pct 1.0

Training

Train a baseline model.

python -m torch.distributed.launch --nproc_per_node=4 --use_env main.py \
 --model deit_t --batch-size 64 --data-set imagenet --output_dir [OUTPUT_PATH]

Apply ReMixer to the baseline model.

[BASE_CODE_ABOVE] --mask-attention --patch-label relabel

Apply TokenLabeling (for both baseline model and ReMixer).

[BASE_CODE_ABOVE] --token-label

Inference

python main.py --eval --model deit_t --data-set imagenet --resume [OUTPUT_PATH]/checkpoint.pth

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
imagenet_info		imagenet_info
models		models
patch_models		patch_models
transforms		transforms
LICENSE		LICENSE
README.md		README.md
datasets.py		datasets.py
engine.py		engine.py
generate_mask.py		generate_mask.py
generate_relabel.py		generate_relabel.py
losses.py		losses.py
main.py		main.py
requirements.txt		requirements.txt
samplers.py		samplers.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OAMixer: Object-aware Mixing Layer for Vision Transformers

Installation

Create patch labels

Training

Inference

About

Releases

Packages

Contributors 2

Languages

License

alinlab/OAMixer

Folders and files

Latest commit

History

Repository files navigation

OAMixer: Object-aware Mixing Layer for Vision Transformers

Installation

Create patch labels

Training

Inference

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages