A tensorflow v2.3.0 implementation of Pix2Pix.
The models were trained on the cityscapes and maps datasets (found here) for 200 epochs each and batch sizes of 10 and 1 respectively.
The model did noticeably better at the maps dataset which can be attributed to it having a simpler output distribution vs the cityscapes dataset.
python train.py [-i img_path] [-o out_path] [-c ckpt_path] [-e epochs] [-l learning_rate] [-b batch_size] [--cont]
- -i: Path to dataset directory(default: ./data/maps/). Directory structure:
.
+-- dataset
| +-- train
| | +--image1.png
| | +--image2.png
| +-- val
| | +--image1.png
| | +--image2.png
| +-- out
- -o: Path to output directory(default: ./data/maps/out/).
- -c: Path to checkpoint directory(default: ./data/ckpt).
- -e: Number of epochs as integer(default: 200).
- -l: Learning rate as float(default: 0.0002)
- -b: Batch size in integer(default: 1).
- --cont: Flag for continuing training. Training will start from scratch unless included.