Diverse Image-to-Image Translation via Disentangled Representations (High resolution)

Pytorch implementation for multi-modality I2I on translation tasks with high resolution images. We adopt a multi-scale generator and discriminator architecture to stable the training and enhance the quality of generated images. The project is an extension to the "Diverse Image-to-Image Translation via Disentangled Representations(https://arxiv.org/abs/1808.00948)", ECCV 2018.

Contact: Hsin-Ying Lee (hlee246@ucmerced.edu) and Hung-Yu Tseng (htseng6@ucmerced.edu)

Paper

Diverse Image-to-Image Translation via Disentangled Representations
Hsin-Ying Lee*, Hung-Yu Tseng*, Jia-Bin Huang, Maneesh Kumar Singh, and Ming-Hsuan Yang
European Conference on Computer Vision (ECCV), 2018 (oral) (* equal contribution)

Please cite our paper if you find the code or dataset useful for your research.

@inproceedings{DRIT,
  author = {Lee, Hsin-Ying and Tseng, Hung-Yu and Huang, Jia-Bin and Singh, Maneesh Kumar and Yang, Ming-Hsuan},
  booktitle = {European Conference on Computer Vision},
  title = {Diverse Image-to-Image Translation via Disentangled Representations},
  year = {2018}
}

Example Results

Usage

Prerequisites

Python 3.5 or Python 3.6
Pytorch 0.4/1.0 and torchvision (https://pytorch.org/)
TensorboardX
Tensorflow (for tensorboard usage)
Docker file based on CUDA 9.0, CuDNN 7.1, and Ubuntu 16.04 is provided in the [DRIT] github page.

Install

Clone this repo:

git https://github.com/hytseng0509/DRIT_hr.git
cd DRIT_hr

Datasets

We validate our model on street scene datasets: GTA and Cityscapes

cd datasets/gta2cityscapes
mkdir trainA trainB

Download images from two domains and place in folders trainA and trainB separately

Usage

Training

python3 train.py --dataroot ../datasets/gta2cityscapes -name NAME --display_dir DISPLAY_DIR --result_dir RESULT_DIR
tensorboard --logdir DISPLAY_DIR/NAME

Results and saved models can be found at RESULT_DIR/NAME.

Generate results with randomly sampled attributes
- Require folder testA (for a2b) or testB (for b2a) under dataroot

python3 test.py --dataroot ../datasets/gta2cityscapes -name NAME --output_dir OUTPUT_DIR --resume MODEL_FILE --num NUM_PER_IMG

Generate results with attributes encoded from given images
- Require both folders testA and testB under dataroot

python3 test_transfer.py --dataroot ../datasets/gta2cityscapes -name NAME --output_dir OUTPUT_DIR --resume MODEL_FILE

Results can be found at OUTPUT_DIR/NAME

Note

The feature-wise transformation (i.e. --concat 0) is not fully tested yet
We also adopt Mode Seeking loss, specify --ms to apply mode seeking loss in the training
Due to the large number of training images in the GTA dataset, the default training epoch is set to 90. Please refer to the default setting in original DRIT if the number of training images is around 1K.
Feel free to contact the authors for any potential improvement of the code

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
imgs		imgs
.gitignore		.gitignore
README.md		README.md
dataset.py		dataset.py
model.py		model.py
networks.py		networks.py
options.py		options.py
saver.py		saver.py
test.py		test.py
test_transfer.py		test_transfer.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diverse Image-to-Image Translation via Disentangled Representations (High resolution)

Paper

Example Results

Usage

Prerequisites

Install

Datasets

Usage

Note

About

Releases

Packages

Languages

ike-taku/DRIT_hr

Folders and files

Latest commit

History

Repository files navigation

Diverse Image-to-Image Translation via Disentangled Representations (High resolution)

Paper

Example Results

Usage

Prerequisites

Install

Datasets

Usage

Note

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages