Merge pull request #1 from sanghunpark/develop

Add files via upload
sanghunpark · Dec 3, 2020 · db71cfd · db71cfd
2 parents 22190fd + 73e283c
commit db71cfd
Show file tree

Hide file tree

Showing 23 changed files with 1,670 additions and 22 deletions.
diff --git a/README.md b/README.md
@@ -1,22 +1,119 @@
-# Neural Crossbreed: Neural Based Image Metamorphosis
-<p align="left">
-<img src="docs/teaser.png">
-</p>
-
-[[Paper]](https://arxiv.org/abs/2009.00905)
-
-The code will be updated soon.
-
-
-## Citation
- If you find this work useful in your research, please cite our paper:
- ```
-@article {park2020neural,
-    title = {Neural Crossbreed: Neural Based Image Metamorphosis},
-    author = {Sanghun Park and Kwanggyoon Seo and Junyong Noh},
-    journal = {ACM Transactions on Graphics (SIGGRAPH Asia 2020)},
-    volume = {39},
-    number = {6},
-    year = {2020}
-}
-```
+# Neural Crossbreed: Neural Based Image Metamorphosis
+[[Paper]](https://arxiv.org/abs/2009.00905)
+
+<p align="left">
+<img src="docs/teaser.png">
+</p>
+
+__Abstract__ We propose Neural Crossbreed, a feed-forward neural network that can learn a semantic change of input images in a latent space to create the morphing effect. Because the network learns a semantic change, a sequence of meaningful intermediate images can be generated without requiring the user to specify explicit correspondences. In addition, the semantic change learning makes it possible to perform the morphing between the images that contain objects with significantly different poses or camera views. Furthermore, just as in conventional morphing techniques, our morphing network can handle shape and appearance transitions separately by disentangling the content and the style transfer for rich usability. We prepare a training dataset for morphing using a pre-trained BigGAN, which generates an intermediate image by interpolating two latent vectors at an intended morphing value. This is the first attempt to address image morphing using a pre-trained generative model in order to learn semantic transformation. The experiments show that Neural Crossbreed produces high quality morphed images, overcoming various limitations associated with conventional approaches. In addition, Neural Crossbreed can be further extended for diverse applications such as multi-image morphing, appearance transfer, and video frame interpolation.
+
+
+<!-- The code will be updated soon. -->
+# Requirements
+Python 3.7.4
+
+PyTorch 1.2.0
+
+TensorBoard
+
+PyYAML 5.1.2
+
+tqdm 4.41.1
+
+PyTorch pretrained BigGAN 0.1.1
+
+NLTK 3.4.5
+
+scikit-image 0.16.2
+
+# Installlation 
+
+Clone this repository.
+```bash
+git clone https://github.com/sanghunpark/neural_crossbreed.git
+cd neural_crossbreed/
+```
+>## For [Conda](https://www.anaconda.com/) users:
+>```bash
+>conda create -n neural_crossbreed python=3.7.4
+>conda activate neural_crossbreed
+>```
+
+Install PyTorch 1.2.0+ and torchvision from [PyTorch](http://pytorch.org.)
+
+Install other dependencies from the requirement file.
+
+```bash
+pip install -r requirements.txt 
+```
+
+
+# Testing
+
+Download the checkpoint file of the pre-trained dog model from the [link](https://drive.google.com/drive/folders/1IhxQ-fus-maSEakuFy7PorP1dkWI1WyR?usp=sharing), save it in `./train_dir/nc_final/`.
+
+To generate morphed images using the pre-trained dog model, run following command:
+```bash
+python test.py  --config=./config.yaml
+                --ngpu=1
+                --gpu_1st=0
+                --input_a=./sample_images/dog5.png
+                --input_b=./sample_images/dog3.png
+                --niter=5
+```
+Generated images will be placed under `./test_outdir/out_***.png`.
+<!-- Generated images will be stored in the `./test_outdir/`. -->
+
+You can obtain the disentangled transition results by adding `--disentangled` and `--tau`. For example,
+```bash
+python test.py  --config=./config.yaml
+                --ngpu=1
+                --gpu_1st=0
+                --input_a=./sample_images/dog5.png
+                --input_b=./sample_images/dog3.png
+                --niter=5
+                --disentangled
+                --tau=0.3
+```
+
+# Training
+To train Neural Crossbreed from scratch, run the following command.
+
+```bash
+python train.py --config=./config.yaml
+                --ngpu=[NUM_GPUs]
+                --gpu_1st=[1ST_GPU_ID]
+```
+You can specify the GPU usage with the argument `--ngpu` and `--gpu_1st`. For instance, `--ngpu=2` and `--gpu_1st=3` will run the training process on GPU3 and GPU4. If you don't use these arguments, it will use all visible GPUs by running:
+```bash
+CUDA_AVAILABLE_DEVICES=0,1,... python train.py --config=./config.yaml
+```
+
+The training configuration such as path, model, and hyperparamters can be further customized with [`./config.yaml`](https://github.com/sanghunpark/neural_crossbreed/blob/master/config.yaml). Please see refer to [`./config.yaml`](https://github.com/sanghunpark/neural_crossbreed/blob/master/config.yaml). 
+
+In particular, to resume training, replace the checkpoint path in the [`./config.yaml`](https://github.com/sanghunpark/neural_crossbreed/blob/master/config.yaml) file with the latest checkpoint. 
+```yaml
+# path
+...
+checkpoints: [MODEL_DIR]/[YOUR_CHECKPOINT].pt
+...
+```
+If a checkpoint file exists in the path, training will resume at the point where a previous training run left off. Otherwise, the network will be trained from scratch.
+
+## Citation
+ If you find this work useful, please cite our paper:
+ ```
+@article {park2020neural,
+    title = {Neural Crossbreed: Neural Based Image Metamorphosis},
+    author = {Sanghun Park and Kwanggyoon Seo and Junyong Noh},
+    journal = {ACM Transactions on Graphics (SIGGRAPH Asia 2020)},
+    volume = {39},
+    number = {6},
+    year = {2020}
+}
+```
+
+## Acknowledgements
+This repository contains pieces of code from the following repositories:
+
+[PyTorch pretrained BigGAN](https://github.com/huggingface/pytorch-pretrained-BigGAN) and [FUNIT](https://github.com/NVlabs/FUNIT). 
diff --git a/config.yaml b/config.yaml
@@ -0,0 +1,48 @@
+
+# log frequency
+log_freq: 50
+checkpoint_freq: 100
+
+# path
+root_dir: ./train_dir/
+checkpoints: nc_final/best_00174000.pt #
+out_path: ./test_outdir/out.png
+
+# data
+img_size: 128
+biggan_model: 128
+batch_size: 2
+num_workers: 8
+truncation: 0.25
+n_inter: 100
+max_epochs: 2000
+
+# metric
+which_best: 'FID'
+
+# model 
+gen:
+  n_f: 32
+  n_cont_down: 3
+  n_cont_res_blks: 3
+
+  n_style_down: 6 # 2^(n) : image size
+  n_style_res_blks: 0
+  n_mlp_f: 512
+  n_mlp_blks: 3
+
+dis:
+  n_res: 4
+  n_f: 32
+
+# loss weight
+adv_w: 1
+rec_w: 0.1
+gp_w: 1
+
+# training parameter
+G_lr: 0.0001 # Adam
+D_lr: 0.0004 # TTUR
+beta1: 0.5
+beta2: 0.999
+eps: 0.00000001 #1e-08