Yunqing Zhao* ,
Keshigeyan Chandrasegaran* ,
Milad Abdollahzadeh* ,
Ngai‑Man Cheung†
Singapore University of Technology and Design
NeurIPS 2022, Ernest N. Morial Convention Center, New Orleans, LA, USA. * Equal Contribution
[Project Page] [Poster] [Slides] [Paper]
In this research, we propose Adaptation-Aware Kernel Modulation (AdAM) for few-shot image generation, that aims to identify kernels in source GAN important for target adaptation.
The model can perform GAN adaptation using very few samples from target domains with different proximity to the source.
- Platform: Linux
- Tesla V100 GPUs / (or A100 GPUs)
- PyTorch 1.7.0
- Python 3.6.9
- lmdb, tqdm
Alternatively, A suitable conda environment named adam
can be created and activated with:
git clone https://github.com/yunqing-me/AdAM.git
conda env create -f environment.yml
conda activate adam
cd AdAM
We analyze the Source ↦ Target domain relation in the Sec. 3 (and Supplementary). See below for related steps in this analysis.
Step 1. git clone https://github.com/rosinality/stylegan2-pytorch.git
Step 2. Move ./visualization
to ./stylegan2-pytorch
Step 3. Then, refer to the visualization code in ./visualization
.
Prepare the few-shot training dataset using lmdb format
For example, download the 10-shot target set, Babies
(Link) and AFHQ-Cat
(Link), and organize your directory as follows:
10-shot-{babies/afhq_cat}
└── images
└── image-1.png
└── image-2.png
└── ...
└── image-10.png
Then, transform to lmdb format:
python prepare_data.py --input_path [your_data_path_of_{babies/afhq_cat}] --output_path ./_processed_train/[your_lmdb_data_path_of_{babies/afhq_cat}]
Prepare the entire target dataset for evaluation
For example, download the entire dataset, Babies
(Link) and AFHQ-Cat
(Link), and organize your directory as follows:
entire-{babies/afhq_cat}
└── images
└── image-1.png
└── image-2.png
└── ...
└── image-n.png
Then, transform to lmdb format for evaluation
python prepare_data.py --input_path [your_data_path_of_entire_{babies/afhq_cat}] --output_path ./_processed_test/[your_lmdb_data_path_of_entire_{babies/afhq_cat}]
Download the GAN model pretrained on FFHQ from here. Then, save it to ./_pretrained/style_gan_source_ffhq.pt
.
Randomly generate Gaussian noise input (the same dimension as input to the generator) for Importance Probing, save them to ./_noise/
:
python noise_generation.py
bash _bash_importance_probing.sh
We can obtain the estimated Fisher information of modulated kernels and it will be saved in ./_output_style_gan/args.exp/checkpoints/filter_fisher_g.pt
and ./_output_style_gan/args.exp/checkpoints/filter_fisher_d.pt
# you can tune hyperparameters here
bash _bash_main_adaptation.sh
Training dynamics and evaluation results will be shown on wandb
We note that, ideally Step 1. and Step 2. can be combined together. Here, for simplicity we use two steps as demonstration.
Use Babies and AFHQ-Cat as example: download images from here, then move the unzipped folder into ./cluster_center
, then refer to Evaluator
in AdAM_main_adaptation.py
.
The estimated fisher information (i.e., the output of Importance Probing) and Weights (i.e., the output of the main adaptation corresponding to Figure 4 in the main paper) can be found Here.
We provide all 10-shot target images and models used in our main paper and Supplementary. You can also adapt to other images selected by yourself.
Source GAN:
- FFHQ
- LSUN-Church
- LSUN-Cars
- ...
Target Samples: Link
- Babies
- Sunglasses
- MetFaces
- AFHQ-Cat
- AFHQ-Dog
- AFHQ-Wild
- Sketches
- Amedeo Modigliani's Paintings
- Rafael's Paintings
- Otto Dix's Paintings
- Haunted houses
- Van Gogh houses
- Wrecked cars
- ...
Follow the experiment part in this repo and you can produce your customized results.
If you find this project useful in your research, please consider citing our paper:
@inproceedings{zhao2022fewshot,
title={Few-shot Image Generation via Adaptation-Aware Kernel Modulation},
author={Yunqing Zhao and Keshigeyan Chandrasegaran and Milad Abdollahzadeh and Ngai-man Cheung},
booktitle={Advances in Neural Information Processing Systems},
editor={Alice H. Oh and Alekh Agarwal and Danielle Belgrave and Kyunghyun Cho},
year={2022},
url={https://openreview.net/forum?id=Z5SE9PiAO4t}
}
Meanwhile, we also demonstrate a relevant research that aims to identify and Remove InCompatible Knowledge (RICK, CVPR-2023) for few-shot image generation:
@inproceedings{zhao2023exploring,
title={Exploring incompatible knowledge transfer in few-shot image generation},
author={Zhao, Yunqing and Du, Chao and Abdollahzadeh, Milad and Pang, Tianyu and Lin, Min and Yan, Shuicheng and Cheung, Ngai-Man},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
pages={7380--7391},
year={2023}
}
We appreciate the wonderful base implementation of StyleGAN-V2 from @rosinality. We thank @mseitzer, @Ojha and @richzhang for their implementations on FID score and intra-LPIPS.
We also thank for the useful training and evaluation tool used in this work, from @Miaoyun.