Bi-directional Masks for Efficient N:M Sparse Training (ICML 2023) (Paper Link)

Requirements

python 3.7
pytorch 1.10.2
torchvision 0.11.3

Training

Training models on ImageNet

ResNet-18

cd CnnModels
python imagenet.py --arch resnet18 --lr 0.1 --data_path PATH_TO_DATASETS --label_smoothing 0.1 --num_epochs 120 --job_dir PATH_TO_JOB_DIR --iter 100 --greedy_num 100

ResNet-50

cd CnnModels
python imagenet.py --arch resnet50 --lr 0.1 --data_path PATH_TO_DATASETS --label_smoothing 0.1 --num_epochs 120 --job_dir PATH_TO_JOB_DIR --iter 100 --greedy_num 100

DeiT-small

cd DeiT
python3 -m torch.distributed.launch --nproc_per_node=4  --use_env main.py --model vit_deit_small_patch16_224 --batch-size 256 --data-path PATH_TO_DATASETS --output_dir PATH_TO_JOB_DIR

Training models on CIFAR

VGG-19

cd CnnModels
python cifar.py --arch vgg19_cifar10 --lr 0.1 --weight_decay 0.001 --data_path PATH_TO_DATASETS --label_smoothing 0.1 --num_epochs 300 --job_dir PATH_TO_JOB_DIR

ResNet-32

cd CnnModels
python cifar.py --arch resnet32_cifar10 --lr 0.1 --weight_decay 0.001 --data_path PATH_TO_DATASETS --label_smoothing 0.1 --num_epochs 300 --job_dir PATH_TO_JOB_DIR

MobileNetV2

cd CnnModels
python cifar.py --arch mobilenetv2 --lr 0.1 --weight_decay 0.001 --data_path PATH_TO_DATASETS --label_smoothing 0.1 --num_epochs 300 --job_dir PATH_TO_JOB_DIR

Testing

We provide our trained models and experiment logs at following Table:

Model	Sparse Pattern	Top1	Top5	Link
ResNet-50	2:4	77.4	93.7	Google Drive
ResNet-50	1:4	75.6	92.7	Google Drive
ResNet-50	2:8	76.3	93.0	Google Drive
ResNet-50	4:8	77.5	93.8	Google Drive
ResNet-50	1:16	71.4	90.1	Google Drive
Deit-small	2:4	77.6	93.8	Google Drive

To test, run:

ResNet-50 on ImageNet

cd CnnModels
python eval.py --arch resnet50 --pretrain_dir PATH_TO_CHECKPOINTS --train_batch_size 256 --eval_batch_size 256  --label_smoothing 0.1 --data_path PATH_TO_DATASETS

DeiT-small on ImageNet

cd DeiT
python3 -m torch.distributed.launch --nproc_per_node=4  --use_env main.py --model vit_deit_small_patch16_224 --batch-size 256 --data-path PATH_TO_DATASETS --output_dir PATH_TO_JOB_DIR --resume PATH_TO_CHECKPOINTS --eval

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
CnnModels		CnnModels
DeiT		DeiT
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bi-directional Masks for Efficient N:M Sparse Training (ICML 2023) (Paper Link)

Requirements

Training

Training models on ImageNet

Training models on CIFAR

Testing

About

Releases

Packages

Contributors 2

Languages

zyxxmu/Bi-Mask

Folders and files

Latest commit

History

Repository files navigation

Bi-directional Masks for Efficient N:M Sparse Training (ICML 2023) (Paper Link)

Requirements

Training

Training models on ImageNet

Training models on CIFAR

Testing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages