SpikeZIP-TF: Conversion is All You Need for Transformer-based SNN

News

[2024/6] Code of SpikeZip-TF is released!

Introduction

This is the official project repository for the following paper. If you find this repository helpful, Please kindly cite:

@inproceedings{
spikeziptf2024,
title={SpikeZIP-TF: Conversion is All You Need for Transformer-based SNN},
author={You, Kang= and Xu, Zekai= and Nie, Chen and Deng, Zhijie and Wang, Xiang and Guo, Qinghai and He, Zhezhi},
booktitle={Forty-first International Conference on Machine Learning (ICML)},
year={2024}
}

🔥 =: indicates equal contribution.

Usage

Train

Train the Quantized-ANN with pretrain model.

The following table provides the pre-trained checkpoints used in the paper:

	ViT-Small-ReLU	ViT-Base-ReLU	ViT-Large-ReLU
pre-trained checkpoint	download	download	download
md5	`929f93b`	`8d49104`	`91bded0`

Prepare the ImageNet dataset and run the scripts below:

NCCL_P2P_DISABLE=1 OMP_NUM_THREADS=1 CUDA_VISIBLE_DEVICES=0,1,2,3 python -m torch.distributed.run --nproc_per_node=4 --master_port='29500' main_finetune_distill.py \
    --accum_iter 4 \
    --batch_size 64 \
    --model vit_small_patch16 \
    --model_teacher vit_small_patch16 \
    --finetune checkpoint-path \
    --pretrain_teacher checkpoint-path \
    --epochs 100 \
    --blr 1.5e-4 --layer_decay 0.65 --warmup_epochs 0 \
    --weight_decay 0.05 --drop_path 0.1 --drop_rate 0.0 --mixup 0.8 --cutmix 1.0 --reprob 0.25 \
    --dist_eval --data_path dataset-path --output_dir output_path --log_dir log_path \
    --mode "QANN_QAT" --level 16 --act_layer relu --act_layer_teacher relu --temp 2.0 --wandb --print_freq 200 --define_params --mean 0.5 0.5 0.5 --std 0.5 0.5 0.5

Conversion

Convert your QANN model to SNN.

The following table provides the pre-trained QANN used in the paper:

	ViT-Small-ReLU-Q32	ViT-Base-ReLU-Q32	ViT-Large-ReLU-Q32
pre-trained checkpoint	download	download	download
md5	`8207d3e`	`7edba1d`	`d83936c`

Prepare the ImageNet dataset and run the scripts below:

NCCL_P2P_DISABLE=1 OMP_NUM_THREADS=1 CUDA_VISIBLE_DEVICES=0,1,2,3 python -m torch.distributed.launch --nproc_per_node=4 --master_port='29501' main_finetune.py \
    --accum_iter 4 \
    --batch_size 32 \
    --model vit_small_patch16 \
    --finetune QANN-checkpoint-path \
    --resume QANN-checkpoint-path \
    --epochs 100 \
    --blr 3.536e-4 --layer_decay 0.65 \
    --weight_decay 0.05 --drop_path 0.1 --drop_rate 0.0 --mixup 0.8 --cutmix 1.0 --reprob 0.25 \
    --dist_eval --data_path dataset-path --output_dir output_path --log_dir log_path \
    --mode "SNN" --act_layer relu --eval --ratio 0.5 --time_step 64 --encoding_type analog --level 16 --weight_quantization_bit 32 --define_params --mean 0.5 0.5 0.5 --std 0.5 0.5 0.5

Evaluation

Evaluate the energy of your SNN model

By using QANN trained by yourself or provided in pretrained QANN table above, run the the scripts below to evaluate the energy of your SNN model:

CUDA_VISIBLE_DEVICES=0 python -m torch.distributed.launch --nproc_per_node=1 --master_port='29501' main_finetune.py \
    --accum_iter 4 \
    --batch_size 4 \
    --model vit_small_patch16 \
    --finetune /data1/vit-small-imagenet-relu-q16-80.45.pth \
    --resume /data1/vit-small-imagenet-relu-q16-80.45.pth \
    --epochs 100 \
    --blr 3.536e-4 --layer_decay 0.65 \
    --weight_decay 0.05 --drop_path 0.1 --drop_rate 0.0 --mixup 0.8 --cutmix 1.0 --reprob 0.25 \
    --dist_eval --data_path /data1/ImageNet/ --output_dir /home/kang_you/SpikeZIP_transformer/output/ --log_dir /home/kang_you/SpikeZIP_transformer/output \
    --mode "SNN" --act_layer relu --eval --energy_eval --time_step 32 --encoding_type rate --level 16 --weight_quantization_bit 32 --define_params --mean 0.5 0.5 0.5 --std 0.5 0.5 0.5

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
__pycache__		__pycache__
energy_consumption_calculation		energy_consumption_calculation
results		results
scripts		scripts
util		util
README.md		README.md
engine_finetune.py		engine_finetune.py
engine_pretrain.py		engine_pretrain.py
extract_imagenet.py		extract_imagenet.py
main.py		main.py
main_finetune.py		main_finetune.py
main_finetune_distill.py		main_finetune_distill.py
main_finetune_distill_dvs.py		main_finetune_distill_dvs.py
main_finetune_dvs.py		main_finetune_dvs.py
main_finetune_raw.py		main_finetune_raw.py
main_linprobe.py		main_linprobe.py
main_pretrain.py		main_pretrain.py
metric.py		metric.py
misc.py		misc.py
models_mae.py		models_mae.py
models_vit.py		models_vit.py
spike_quan_layer.py		spike_quan_layer.py
spike_quan_wrapper.py		spike_quan_wrapper.py
spikezip_logo.png		spikezip_logo.png
submitit_finetune.py		submitit_finetune.py
submitit_linprobe.py		submitit_linprobe.py
submitit_pretrain.py		submitit_pretrain.py
vision_transformer.py		vision_transformer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SpikeZIP-TF: Conversion is All You Need for Transformer-based SNN

Contents

News

Introduction

Usage

Train

Conversion

Evaluation

About

Releases

Packages

Contributors 2

Languages

Intelligent-Computing-Research-Group/SpikeZIP-TF

Folders and files

Latest commit

History

Repository files navigation

SpikeZIP-TF: Conversion is All You Need for Transformer-based SNN

Contents

News

Introduction

Usage

Train

Conversion

Evaluation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages