Self-attention generative adversarial networks

Introduction

SAGAN (ICML'2019)

```latex @inproceedings{zhang2019self, title={Self-attention generative adversarial networks}, author={Zhang, Han and Goodfellow, Ian and Metaxas, Dimitris and Odena, Augustus}, booktitle={International conference on machine learning}, pages={7354--7363}, year={2019}, organization={PMLR}, url={https://proceedings.mlr.press/v97/zhang19d.html}, } ```

Results from our SAGAN trained in CIFAR10

Results and models

Models	Dataset	Inplace ReLU	dist_step	Total Batchsize (BZ_PER_GPU * NGPU)	Total Iters*	Iter	IS	FID	Config	Download	Log
SAGAN-32x32-woInplaceReLU Best IS	CIFAR10	w/o	5	64x1	500000	400000	9.3217	10.5030	config	model	Log
SAGAN-32x32-woInplaceReLU Best FID	CIFAR10	w/o	5	64x1	500000	480000	9.3174	9.4252	config	model	Log
SAGAN-32x32-wInplaceReLU Best IS	CIFAR10	w	5	64x1	500000	380000	9.2286	11.7760	config	model	Log
SAGAN-32x32-wInplaceReLU Best FID	CIFAR10	w	5	64x1	500000	460000	9.2061	10.7781	config	model	Log
SAGAN-128x128-woInplaceReLU Best IS	ImageNet	w/o	1	64x4	1000000	980000	31.5938	36.7712	config	model	Log
SAGAN-128x128-woInplaceReLU Best FID	ImageNet	w/o	1	64x4	1000000	950000	28.4936	34.7838	config	model	Log
SAGAN-128x128-BigGAN Schedule Best IS	ImageNet	w/o	1	32x8	1000000	826000	69.5350	12.8295	config	model	Log
SAGAN-128x128-BigGAN Schedule Best FID	ImageNet	w/o	1	32x8	1000000	826000	69.5350	12.8295	config	model	Log

'*' Iteration counting rule in our implementation is different from others. If you want to align with other codebases, you can use the following conversion formula:

total_iters (biggan/pytorch studio gan) = our_total_iters / dist_step

We also provide converted pre-train models from Pytorch-StudioGAN. To be noted that, in Pytorch Studio GAN, inplace ReLU is used in generator and discriminator.

Models	Dataset	Inplace ReLU	n_disc	Total Iters	IS (Our Pipeline)	FID (Our Pipeline)	IS (StudioGAN)	FID (StudioGAN)	Config	Download	Original Download link
SAGAN-32x32 StudioGAN	CIFAR10	w	5	100000	9.116	10.2011	8.680	14.009	Config	model	model
SAGAN0-128x128 StudioGAN	ImageNet	w	1	1000000	27.367	40.1162	29.848	34.726	Config	model	model

Our Pipeline denote results evaluated with our pipeline.
StudioGAN denote results released by Pytorch-StudioGAN.

For IS metric, our implementation is different from PyTorch-Studio GAN in the following aspects:

We use Tero's Inception for feature extraction.
We use bicubic interpolation with PIL backend to resize image before feed them to Inception.

For FID evaluation, differences between PyTorch Studio GAN and ours are mainly on the selection of real samples. In MMGen, we follow the pipeline of BigGAN, where the whole training set is adopted to extract inception statistics. Besides, we also use Tero's Inception for feature extraction.

You can download the preprocessed inception state by the following url: CIFAR10 and ImageNet1k.

You can use following commands to extract those inception states by yourself.

# For CIFAR10
python tools/utils/inception_stat.py --data-cfg configs/_base_/datasets/cifar10_inception_stat.py --pklname cifar10.pkl --no-shuffle --inception-style stylegan --num-samples -1 --subset train

# For ImageNet1k
python tools/utils/inception_stat.py --data-cfg configs/_base_/datasets/imagenet_128x128_inception_stat.py --pklname imagenet.pkl --no-shuffle --inception-style stylegan --num-samples -1 --subset train

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Self-attention generative adversarial networks

Introduction

Results and models

Files

README.md

Latest commit

History

README.md

File metadata and controls

Self-attention generative adversarial networks

Introduction

Results and models