Bag-of-Tricks-and-a-Strong-Baseline-for-Fungi-Fine-Grained-Classification

paper link: http://star.informatik.rwth-aachen.de/Publications/CEUR-WS/Vol-3180/paper-182.pdf

This work is sponsored by Natural Science Foundation of China(62276242), CAAI-Huawei MindSpore Open Fund(CAAIXSJLJJ-2021-016B), Anhui Province Key Research and Development Program(202104a05020007), and USTC Research Funds of the Double First-Class Initiative(YD2350002001)”。

1. Environment setting

1.0. Package

Several important packages
- mindspore == 1.8.1
- mindcv
Replace folder mindcv/ to our mindcv/ folder (We made some changes to the original Mindcv framework, such as adding TA, etc)

MindCV is an open source toolbox for computer vision research and development based on MindSpore. mindcv

1.1. Dataset

In this project, we use a large fungi's datasets from this challenge to evaluate performance:

Fungi2022

1.2. OS

Windows10
Ubuntu20.04
macOS (CPU only)

2. Train

Single GPU Training
DataParallel (single machine multi-gpus)
DistributedDataParallel

2.1. data

train data and test data structure:

├── DF20/
│   ├── img20001.jpg
│   ├── img20002.jpg
│   └── ....
├── DF21/
│   ├── img21001.jpg
│   ├── img21002.jpg
│   └── ....
└──

Training sets and test sets are distributed with CSV labels corresponding to them.

2.2. configuration

you can directly modify yaml file (in ./configs/)

2.3. run.

take the SwinTransformer training process as an example.

we train a basic model by dividing the training set and verification set 9:1

python train.py ./Fungidata/DF20 -c configs/swin_large_384.yaml \
        --freeze-layer 2 \
        --batch-size 32 \
        --lr 0.01 \
        --decay-rate 0.9 \
        --output ./output/Swin-TF/DF20/freeze_layer_2

Add data augment and continue fine-tuning

python train.py ./Fungidata/DF20 -c configs/swin_large_384.yaml \
        --output ./output/Swin-TF/DF20/All_aug/freeze_layer_2 \
        --initial-checkpoint ./output/Swin-TF/DF20/freeze_layer_2/20220430-123449-swin_large_patch4_window12_384-384/Best_Top1-ACC.pth.tar \
        --freeze-layer 2 \
        --lr 0.001 \
        --batch-size 32 \
        --warmup-epochs 0 \
        --cutmix 1 \
        --color-jitter 0.4 \
        --reprob 0.25 \
        --aa trivial \
        --decay-rate 0.9

Modify the loss function and continue fine-tuning

python train.py ./Fungidata/DF20 -c configs/swin_large_384.yaml \
        --output ./output/Swin-TF/DF20/new_loss/freeze_layer_2 \
        --initial-checkpoint ./output/Swin-TF/DF20/All_aug/freeze_layer_2/20220430-123449-swin_large_patch4_window12_384-384/Best_Top1-ACC.pth.tar \
        --freeze-layer 2 \
        --lr 0.001 \
        --batch-size 32 \
        --warmup-epochs 0 \
        --cutmix 1 \
        --color-jitter 0.4 \
        --reprob 0.25 \
        --aa trivial \
        --decay-rate 0.9 \
        --Focalloss

Fine-tuning with full data sets

python train_all.py ./Fungidata/DF20 -c configs/swin_large_384.yaml \
         --batch-size 32 \
         --img-size 384 \
         --output ./output/Swin-TF/DF20/All_data/swin_large_384 \
         --freeze-layer 2 \
         --initial-checkpoint ./output/Swin-TF/DF20/new_loss/freeze_layer_2/20220502-114033-swin_large_patch4_window12_384-384/Best_Top1-ACC.pth.tar \
         --lr 0.001 \
         --cutmix 1 \
         --color-jitter 0.4 \
         --reprob 0.25 \
         --aa trivial \
         --decay-rate 0.1 \
         --warmup-epochs 0 \
         --epochs 24 \
         --sched multistep \
         --checkpoint-hist 24 \
         --Focalloss

Two-stage training

python train_all.py ./Fungidata/DF20 -c configs/swin_large_384.yaml \
         --batch-size 32 \
         --img-size 384 \
         --output ./output/Swin-TF/DF20/two_stage/swin_large_384 \
         --freeze-layer 2 \
         --initial-checkpoint ./output/Swin-TF/DF20/All_data/freeze_layer_2/20220504-115867-swin_large_patch4_window12_384-384/Best_Top1-ACC.pth.tar \
         --lr 0.001 \
         --cutmix 1 \
         --color-jitter 0.4 \
         --reprob 0.25 \
         --aa trivial \
         --decay-rate 0.1 \
         --warmup-epochs 0 \
         --epochs 5 \
         --sched multistep \
         --Focalloss

2.4. multi-gpus

change train.sh

python -m mindspre.distributed.launch --nproc_per_node=4 train_all.py

3. Evaluation

for details, see test.sh

sh test.sh

Use the sh test.sh command to execute the reasoning file (the checkpoint in the google cloud disk will be used, please download it in 5. Challenge's final checkpoints and logits)

4. Model Ensemble

run model_ensemble.ipynb

Execute 3. Evaluation to get the logits file required by model_ensemble.ipynb (you can also download it directly from google cloud disk, see 5. Challenge's final checkpoints and logits for details)

5. Challenge's final checkpoints and logits

It can be downloaded from Google Cloud Disk: https://drive.google.com/file/d/1v9SlsjXXKI5kizg9BTMWfA9faaawAgpv/view?usp=sharing

It can be directly used for model ensemble reasoning.

Acknowledgment

Thanks to mindcv for Mindspore implementation.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
configs		configs
mindcv		mindcv
README.md		README.md
final_result.csv		final_result.csv
model_ensemble.ipynb		model_ensemble.ipynb
test.py		test.py
test.sh		test.sh
train.py		train.py
train.sh		train.sh
train_all.py		train_all.py

wujiekd/Bag-of-Tricks-and-a-Strong-Baseline-for-Fungi-Fine-Grained-Classification-Mindspore

Folders and files

Latest commit

History

Repository files navigation

Bag-of-Tricks-and-a-Strong-Baseline-for-Fungi-Fine-Grained-Classification

1. Environment setting

1.0. Package

MindCV is an open source toolbox for computer vision research and development based on MindSpore. mindcv

1.1. Dataset

1.2. OS

2. Train

2.1. data

2.2. configuration

2.3. run.

2.4. multi-gpus

3. Evaluation

4. Model Ensemble

5. Challenge's final checkpoints and logits

Acknowledgment

About

Resources

Stars

Watchers

Forks

Languages