This personal project synthesize animal images via Conditional-GAN, extended generative adversarial network.
During the training process, the label is additionally given to the generator as input and the discriminator determines its class. Then, we can manipulate the generator's input during the sampling process to get the desired class.
The project's goal is to check whether we can sample the synthesized animal images by giving multi-labeled input to the generator.
Introduced by Choi et al. in StarGAN v2: Diverse Image Synthesis for Multiple Domains
.
AFHQ is a dataset of animal faces consisting of 15,000 high-quality images at 512 × 512 resolution.
The dataset includes three domains of cat, dog, and wildlife, each providing 5000 images.
All images are vertically and horizontally aligned to have the eyes at the center.
kaggle datasets download -d andrewmvd/animal-faces -p ./dataset/ --unzip
dataset
|-- afhq
| |-- train
| | |-- cat
| | | |-- flickr_cat_000002.jpg
| | | |-- ...
| | |-- dog
| | |-- wild
| |-- val
| | |-- cat
| | |-- ...
TBU
Cat | Dog | Catdog |
---|---|---|
- Reconstruct overall architecture
- .ipynb -> .py