This paper aims to develop a network that can outperform not only the canonical transformers, but also the high-performance convolutional models. We propose a new transformer based hybrid network by taking advantage of transformers to capture long-range dependencies, and of CNNs to model local features. Furthermore, we scale it to obtain a family of models, called CMTs, obtaining much better accuracy and efficiency than previous convolution and transformer based models.

Paper: Jianyuan Guo, Kai Han, Han Wu, Chang Xu, Yehui Tang, Chunjing Xu, Yunhe Wang. CMT: Convolutional Neural Networks Meet Vision Transformers. Accepted in CVPR 2022.

Model architecture

A block of CMT is shown below:

Dataset

Dataset used: [ImageNet2012]

Dataset size 224*224 colorful images in 1000 classes
- Train：1,281,167 images
- Test： 50,000 images
Data format：jpeg
- Note：Data will be processed in dataset.py

Environment Requirements

Hardware(Ascend/GPU)
- Prepare hardware environment with Ascend or GPU.
Framework
- MindSpore
For more information, please check the resources below£º
- MindSpore Tutorials
- MindSpore Python API

Script description

Script and sample code

CMT
├── eval.py        # inference entry
├── fig
│   └── CMT.PNG    # the illustration of CMT network
├── readme.md      # Readme
└── src
    ├── dataset.py # dataset loader
    └── cmt.py     # CMT network

Eval process

Usage

After installing MindSpore via the official website, you can start evaluation as follows:

Launch

# CMT infer example
  GPU: python eval.py --model cmt --dataset_path dataset_path --platform GPU --checkpoint_path [CHECKPOINT_PATH]

checkpoint can be downloaded at https://download.mindspore.cn/model_zoo/.

Result

result: {'acc': 0.832} ckpt= ./cmt_s_ms.ckpt

Description of Random Situation

In dataset.py, we set the seed inside "create_dataset" function. We also use random seed in train.py.

ModelZoo Homepage

Please check the official homepage.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commits
fig		fig
scripts		scripts
src		src
README.md		README.md
eval.py		eval.py
新建位图图像.bmp		新建位图图像.bmp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fig

fig

scripts

scripts

src

src

README.md

README.md

eval.py

eval.py

新建位图图像.bmp

新建位图图像.bmp

Repository files navigation

Contents

CMT Description

Model architecture

Dataset

Environment Requirements

Script description

Script and sample code

Eval process

Usage

Launch

Result

Description of Random Situation

ModelZoo Homepage

About

Releases

Packages

Languages

yangyucheng000/CMT

Folders and files

Latest commit

History

Repository files navigation

Contents

Usage

Launch

Result

About

Resources

Stars

Watchers

Forks

Languages