CGAN Description

Generative Adversarial Nets were recently introduced as a novel way to train generative models. In this work we introduce the conditional version of generative adversarial nets, which can be constructed by simply feeding the data, y, we wish to condition on to both the generator and discriminator. We show that this model can generate MNIST digits conditioned on class labels. We also illustrate how this model could be used to learn a multi-modal model, and provide preliminary examples of an application to image tagging in which we demonstrate how this approach can generate descriptive tags which are not part of training labels.

Paper: Conditional Generative Adversarial Nets.

Model Architecture

Architecture guidelines for Conditional GANs

Replace any pooling layers with strided convolutions (discriminator) and fractional-strided convolutions (generator).
Use batchnorm in both the generator and the discriminator.
Remove fully connected hidden layers for deeper architectures.
Use ReLU activation in generator for all layers except for the output, which uses Tanh.
Use LeakyReLU activation in the discriminator for all layers.

Dataset

Train CGAN Dataset used: MNIST

Dataset size：52.4M，60,000 28*28 in 10 classes
- Train：60,000 images
- Test：10,000 images
Data format：binary files
- Note：Data will be processed in dataset.py


└─data
  └─MNIST_Data
    └─train

Environment Requirements

Hardware Ascend
- Prepare hardware environment with Ascend processor.
Framework
- MindSpore
For more information, please check the resources below：
- MindSpore Tutorials
- MindSpore Python API

Script Description

Script and Sample Code

.
└─CGAN
  ├─README.md               # README
  ├─requirements.txt        # required modules
  ├─scripts                 # shell script
    ├─run_standalone_train.sh            # training in standalone mode(1pcs)
    ├─run_distributed_train_ascend.sh    # training in parallel mode(8 pcs)
    └─run_eval_ascend.sh    # evaluation
  ├─ src
    ├─dataset.py            # dataset create
    ├─cell.py               # network definition
    ├─ckpt_util.py          # utility of checkpoint
    ├─model.py              # discriminator & generator structure
  ├─ train.py               # train cgan
  ├─ eval.py                # eval cgan
  ├─ export.py              # export mindir

Script Parameters

Training Script Parameters

# distributed training
bash run_distributed_train_ascend.sh /path/to/MNIST_Data/train /path/to/hccl_8p_01234567_127.0.0.1.json 8

# standalone training
bash run_standalone_train.sh /path/MNIST_Data/train 0

# evaluating
bash run_eval_ascend.sh /path/to/script/train_parallel/0/ckpt/G_50.ckpt 0

Training Process

Training

Run run_standalone_train_ascend.sh for non-distributed training of CGAN model.

# standalone training
bash run_standalone_train_ascend.sh /path/MNIST_Data/train 0

Distributed Training

Run run_distributed_train_ascend.sh for distributed training of CGAN model.

bash run_distributed_train_ascend.sh /path/to/MNIST_Data/train /path/to/hccl_8p_01234567_127.0.0.1.json 8

Notes

hccl.json which is specified by RANK_TABLE_FILE is needed when you are running a distribute task. You can generate it by using the hccl_tools.

Run run_eval_ascend.sh for evaluation.

# eval
bash run_eval_ascend.sh /path/to/script/train_parallel/0/ckpt/G_50.ckpt 0

Evaluation result

Evaluation result will be stored in the img_eval path. Under this, you can find generator result in result.png.

Model Export

python  export.py --ckpt_dir /path/to/train/ckpt/G_50.ckpt

Model Description

Performance

Evaluation Performance

Parameters	Ascend
Model Version	V1
Resource	CentOs 8.2; Ascend 910; CPU 2.60GHz, 192cores; Memory 755G
uploaded Date	07/04/2021 (month/day/year)
MindSpore Version	1.2.0
Dataset	MNIST Dataset
Training Parameters	epoch=50, batch_size = 128
Optimizer	Adam
Loss Function	BCELoss
Output	predict class
Loss	g_loss: 4.9693 d_loss: 0.1540
Total time	7.5 mins(8p)
Checkpoint for Fine tuning	26.2M(.ckpt file)
Scripts	cgan script

Description of Random Situation

We use random seed in train.py and cell.py for weight initialization.

Model_Zoo Homepage

Please check the official homepage.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commits
script		script
src		src
README.md		README.md
eval.py		eval.py
export.py		export.py
requirements.txt		requirements.txt
train.py		train.py

Mind23-2/MindCode-30

Folders and files

Latest commit

History

Repository files navigation

Contents

Model Export

Model Description

Performance

Evaluation Performance

About

Resources

Stars

Watchers

Forks

Languages